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Preface 


Applied Differential Equations with Boundary Value Problems is a comprehensive exposition of ordinary differential 
equations and an introduction to partial differential equations (due to space constraint, there is only one chapter 
devoted directly to PDEs) including their applications in engineering and the sciences. This text is designed for a 
two-semester sophomore or junior level course in differential equations and assumes previous exposure to calculus. 
It covers traditional material, along with novel approaches in presentation and utilization of computer capabilities, 
with a focus on various applications. This text intends to provide a solid background in differential equations for 
students majoring in a breadth of fields. 

This book started as a collection of lecture notes for an undergraduate course in differential equations taught by 
the Division of Applied Mathematics at Brown University, Providence, RI. To some extent, it is a result of collective 
insights given by almost every instructor who taught such a course over the last 15 years. Therefore, the material 
and its presentation covered in this book were practically tested for many years. 

There is no need to demonstrate the importance of ordinary and partial differential equations (ODE and PDE, for 
short) in science, engineering, and education—this subject has been included in the curriculum of universities around 
the world for almost two hundred years. Their utilization in industry and engineering is so widespread that, without 
a doubt, differential equations have become the most successful mathematical tool in modeling. Perhaps the most 
germane point for the student reader is that many curricula recommend or require a course in ordinary differential 
equations for graduation. The beauty and utility of differential equations and their application in mathematics, 
biology, chemistry, computer science, economics, engineering, geology, neuroscience, physics, the life sciences, and 
other fields reaffirm their inclusion in myriad curricula. 

In this text, differential equations are described in the context of applications. A more comprehensive treatment 
of their applications is given in [14]. It is important for students to grasp how to formulate a mathematical model, 
how to solve differential equations (analytically or numerically), how to analyze them qualitatively, and how to 
interpret the results. This sequence of steps is perhaps the hardest part for students to learn and appreciate, yet 
it is an essential skill to acquire. This book provides the common language of the subject and teaches the main 
techniques needed for modeling and systems analysis. 

The goals in writing this textbook: 


e To show that a course in differential equations is essential for modeling real-life phenomena. This textbook 
lays down a bridge between calculus, modeling, and advanced topics. It provides a basis for further serious 
study of differential equations and their applications. We stress the mastery of traditional solution techniques 
and present effective methods, including reliable numerical approximations. 


e To provide qualitative analysis of ordinary differential equations. Hence, the reader should get an idea of how 
all solutions to the given problem behave, what are their validity intervals, whether there are oscillations, 
vertical or horizontal asymptotes, and what is their long term behavior. So the reader will learn various 
methods of solving, analysis, visualization, and approximation. This goal is hard to achieve without exploiting 
the capabilities of computers. 


e To give an introduction to four of the most pervasive computer software! packages: Maple”, Mathematica® , 
MATLAB®, and Maxima—the first computer algebra system in the world. A few other such solvers are available: 
Sage, R, and SymPy, but we cannot afford to present them in the text and refer the reader to the accompanied 


‘The owner of Maple is Maplesoft (http://www.maplesoft.com/), a subsidiary of Cybernet Systems Co. Ltd. in Japan, which is the 
leading provider of high-performance software tools for engineering, science, and mathematics. Mathematica is the product of Wolfram 
Research company of Champaign, Illinois, USA founded by Stephen Wolfram in 1987; its URL is http: //www.wolfram.com. 

MATLAB® is the product of the MathWorks, Inc., 3 Apple Hill Drive, Natick, MA, 01760-2098 USA, Tel: 508-647-7000, Fax: 508-647-7001, 
E-mail: info@mathworks.com, URL: www.mathworks.com. 
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web site. Some popular software packages have either similar syntax (such as Octave or GiNaC) or include 
engines of known solvers (such as MathCad and MuPad—integrated part of MATLAB). Others should be 
accessible with the recent development of cloud technology such as Sage. Also, simple numerical algorithms 
can be handled with a calculator or a spreadsheet program. 


e To give the lecturer a flexible textbook within which he or she can easily organize a curriculum matched to 
their specific goals. This textbook presents a large number of examples from different subjects, which facilitate 
the development of the student’s skills to model real-world problems. Staying within a traditional context, the 
book contains some advanced material on differential equations. 


e To give students a thorough understanding of the subject of differential equations as a whole. This book 
provides detailed solutions of all the basic examples, and students can learn from it without any extra help. It 
may be considered as a self-study text for students as well. This book recalls the basic formulas and techniques 
from calculus, which makes it easy to understand all derivations. It also includes advanced material in each 
chapter for inquisitive students who seek a deeper knowledge of this subject. 


Philosophy of the Text 
We share our pedagogical approach with famous mathematician Paul Halmos [19, pp. 61-62], who recommended 
the study of mathematics by examples. He goes on to say: 


... it’s examples, examples, examples that, for me, all mathematics is based on, and I always look for 
them. I look for them first, when I begin to study. I keep looking for them, and I cherish them all. 


Pedagogy and Structure of the Book 

Ordinary and partial differential equations is a classical subject that has been studied for about 300 years. 
However, education has changed with omnipresent mathematical modeling technology available to all. This textbook 
stresses that differential equations constitute an essential part of modeling by showing their applications, including 
numerical algorithms and syntax of the four most popular software packages. 

It is essential to introduce information technologies early in the class. Students should be encouraged to use 
numerical solvers in their work because they help to illustrate and illuminate concepts and insights. It should be 
noted that computers cannot be used blindly because they are as smart as the programmers allow them to be—every 
problem requires careful examination. 

This textbook stays within traditional coverage of basic topics in differential equations. It contains practical 
techniques for solving differential equations, some of which are not widely used in undergraduate study. Not every 
statement or theorem is followed by rigorous verification. Proofs are included only when they enhance the reader’s 
understanding and challenge the student’s intellectual curiosity. 

Our pedagogical approach is based on the following principle: follow the author. Every section has many examples 
with detailed exposition focused on how to choose an appropriate technique and then how to solve the problem. 
There are hundreds of problems solved in detail, so a reader can master the techniques used to solve and analyze 
differential equations. 

Notation 

This text uses numbers enclosed with brackets to indicate references in the bibliography, which is located at 
the end of the book, starting on page 669. The text uses only standard notations and abbreviations [et al. (et alii 
from Latin) means “and others,” or “and co-workers;” i.e. from Latin “id est” meaning that is, that is to say, or 
in other words; e.g. stands for the Latin phrase “exempli gratia,” which means for example; and etc. means “and 
the others,” “and other things,” “and the rest”). However, we find it convenient to type Mi at the end of proofs or 
at the end of a topic presented; we also use UH at the end of examples (unless a new one serves as a delimiter). We 


hope that the reader understands the difference between = (equal) and = (equivalence relation). Also = is used for 
short to signal that the expression follows by definition. There is no common notation for complex numbers. Since 
a complex number (let us denote it by z) is a vector on the plane, it is a custom to denote it by z = x + yj rather 
than z = xi+ yj, where the unit vector i is dropped and j is the unit vector in the positive vertical direction. In 
mathematics, this vector j is denoted by 7. For convenience, we present the list of symbols and abbreviations at the 
beginning of the text. 


For students 
This text has been written with the student in mind to make the book very friendly. There are a lot of illustrations 
accompanied by corresponding codes for appropriate solvers. Therefore, the reader can follow examples and learn how 


XV 


to use these software packages to analyze and verify obtained results, but not to replace mastering of mathematical 
techniques. Analytical methods constitute a crucial part of modeling with differential equations, including numerical 
and graphical applications. Since the text is written from the viewpoint of the applied mathematician, its presentation 
may sometimes be quite theoretical, sometimes intensely practical, and often somewhere in between. In addition to 
the examples provided in the text, students can find additional resources, including problems and tutorials on using 
software, at the website that accompanies this book: 
http://www.cfm. brown. edu/people/dobrush/am33/computing33.htm1l 

The focus of the book is upon applications and methods of solutions because most practical problems need 
mathematical and numerical approximations to gain insight into their behavior. For adequate preparation of a 
student for study in her or his respective fields, it is imperative to muster in computer applications, in particular, 
being familiar with numerical solvers and computer algebra systems (CAS for short). In engineering or other 
application oriented courses, CAS and numerical solvers become a part of education in mathematics because of 
following reasons: 


e They are a part of solving tools. 


e CASs allow investigation of algorithms; in particular, they can be helpful to analyze algorithms, their com- 
plexity, dependency on input data, and performance. 


e They help to understand mathematics, illustrate concepts, and boosts the learning process. 


e CAS and numerical solvers usually reveal the underlying mathematics; in particular, its open code could be a 
part of a mathematical proof. 


For instructors 

Universities usually offer two courses on differential equations of different levels; one is the basic first course 
required by curriculum, and the other covers the same material, but is more advanced and attracts students who 
find the basic course trivial. This text can be used for both courses, and curious students have an option to increase 
their understanding and obtain deeper knowledge in any topic of interest. A great number of examples and exercises 
make this text well suited for self-study or for traditional use by a lecturer in class. Therefore this textbook addresses 
the needs of two levels of audience, the beginning and the advanced. 
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Introduction 


The independent discovery of the calculus by I. Newton and G. Leibniz was immediately followed by its intensive 
application in mathematics, physics, and engineering. Since the late seventeenth century, differential equations have 
been of fundamental importance in the study, development, and application of mathematical analysis. Differential 
equations and their solutions play one of the central roles in the modeling of real-life phenomena. 

In this chapter, we begin our study with the first order differential equations in normal form 


dy 
dz 


where f(x,y) is a given single-valued function of two variables, called a slope or rate function. For an arbitrary 
function f(x,y), there does not necessarily exist a function y = ¢(x) that satisfies the differential equation. In fact, a 
differential equation usually has more than one solution. We classify first order differential equations and formulate 
several analytic methods that are applicable to each subclass. 

One of the most intriguing things about differential equations is that for an arbitrary function f, there is no 
general method for finding an exact formula for the solution. For many differential equations that are encountered 
in real-world applications, it is impossible to express their solutions via known functions. Generally speaking, every 
differential equation defines its solution (if it exists) as a special function not necessarily expressible by elementary 
functions (such as polynomial, exponential, or trigonometric functions). Only exceptional differential equations can 
be explicitly or implicitly integrated. For instance, such “simple” differential equations as y’ = y? — x or y’ = e*Y 
cannot be solved by available methods. 


= f(x,y), 


1.1 Motivation 


In applied mathematics, a model is a set of equations describing the relationships between numerical values of 
interest in a system. Mathematical modeling is the process of developing a model pertaining to physics or other 
sciences. Since differential equations are our main objects of interest, we consider only models that involve these 
equations. For example, Newton’s second law, F = ma, relates the force F acting on a particle of mass m with the 


resulting acceleration a = X s d?x/ dt?. The transition from a physical problem to a corresponding mathematical 
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model is not easy. It often happens that, for a particular problem, physical laws are hard or impossible to derive, 
though a relation between physical values can be obtained. Such a relation is usually used in the derivation of a 
mathematical model, which may be incomplete or somewhat inaccurate. Any such model may be subject to refining, 
making its predictions agree more closely with experimental results. 

Many problems in the physical sciences, social sciences, biology, geology, economics, and engineering are posed 
mathematically in terms of an equation involving derivatives (or differentials) of the unknown function. Such an 
equation is called a differential equation, and their study was initiated by Leibniz? in 1676. It is customary to 
use his notation for derivatives: dy/da, d?y/dx?, ..., or the prime notation: y’, y’,.... For higher derivatives, we 
use the notation y‘ to denote the derivative of the order n. When a function depends on time, it is common to 
denote its first two derivatives with respect to time with dots: 4, ¥. 

The next step in mathematical modeling is to determine the unknown, or unknowns, involved. Such a procedure is 
called solving the differential equation. The techniques used may yield solutions in analytic forms or approximations. 
Many software packages allow solutions to be visualized graphically. In this book, we focus on four popular packages: 
MATLAB®, Maple”, Mathematica®, and Maxima. Some attention will be given to (free) computer algebra systems 
Sage and SymPy. To motivate the reader, we begin with two well-known examples. 


Example 1.1.1: (Carbon dating) The procedure for determining the age of archaeological remains was developed 
by the 1960 Nobel prize winner in chemistry, Willard Libby®. Cosmic radiation entering the Earth’s atmosphere is 
constantly producing carbon-14 (gC'*), an unstable radioactive isotope of ordinary carbon-12 (¢C'). Both isotopes 
of carbon appear in carbon dioxide, which is incorporated into the tissues of all plants and animals, including human 
beings. In the atmosphere, as well as in all living organisms, the proportion of radioactive carbon-14 to ordinary 
(stable) carbon-12 is constant. When an organism dies, the absorption of carbon-14 by respiration and ingestion 
terminates. Experiments indicate that radioactive substances, such as uranium or carbon-14, decay by a certain 
percentage of their mass in a given unit of time. In other words, radioactive elements decay at a rate proportional 
to the mass present. Let c(t) be the concentration of carbon-14 in dead organic material at time t, counted since 
the time of death. Then c(t) obeys the following differential equation subject to the initial condition: 

a) = —c(t), t>0, c(0) = co, (1.1.1) 
where at the time of death t = 0, co is the concentration of the isotope that a living organism maintains, and . is 
the characteristic constant (\ ~ 1.24 x 10~+ per year for carbon-14). The technique to solve this type of differential 
equation will be explained later, in §2.1. We guess a solution: c(t) = K e~*’, with constant K, using the derivative 
property of the exponential function (eX) = ke**. Since c(0) = K, it follows from the initial condition, c(0) = co, 
that 

c(t) = coe. 
Suppose we know this formula to be true for c(t). We determine the time of death of organic material from an 
examination of the concentration c(t) of carbon-14 at the time t. The following relationship holds: 


ae =e, 


Applying a logarithm to both sides, we obtain 
—At = In[c(t)/co] = — In[eo/c(t)), 


from which we can find the time ¢t of death of the organism to be 


i=5in(3). 


Recall that the half-life of a radioactive nucleus is defined as the time t, during which the number of nuclei reduces 
to one-half of the original value. If the half-life of a radioactive element is known to be t;,, then the radioactive nuclei 
decay according to the law 

N(0) 


_ —t/th _ 
N(t) = N(Q)2-/ =e, 


(11.9) 


*Gottfried Wilhelm Leibniz (1646-1716) was a German scientist who first solved separable, homogeneous, and linear differential 
equations. He co-discovered calculus with Isaac Newton. 
3 American chemist Willard Libby (1908-1980). 
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where N(t) is the amount of radioactive substance at time ¢ and t, = (In2)/X. Since the half-life of carbon-14 is 
approximately 5,730 years, present measurement techniques utilize this method for carbonaceous materials up to 
about 50,000 years old. 


=— V(t) —— 


Example 1.1.2: (RC-series circuit) The most 
common applications of differential equations occur 
in the theory of electric circuits because of its im- 
portance and the pervasiveness of these equations in Cc R 
network theory. Figure 1.1 shows an electric circuit 
consisting of a resistor R and a capacitor C' in se- 


ries. A differential equation relating current J(t) in I 

the circuit, charge q(t) on the capacitor, and voltage —- 

V(t) measured at the points shown can be derived by ; 

applying Kirchhoff’s voltage law, which states that Figure 1.1: RC-Circuit. 


the voltage V(t) must equal the sum of the voltage 
drops across the resistor and the capacitor (see [14]). 


It is known that the voltage changes across the passive elements are approximately as follows. 


AVr = RI for the resistor, 
AVc =@q/C for the capacitor. 


Furthermore, the current is defined to be the rate of flow of charge: I(t) = dq/dt. By combining these expressions 
using Kirchhoft’s voltage law we obtain a differential equation relating q(t) and V(t): 


1.2 Classification of Differential Equations 


To study differential equations, we need some common terminology and basic classification of equations. If an 
equation involves the derivative of one variable with respect to another, then the former is called a dependent 
variable and the latter is an independent variable. For instance, in the equation from Example 1.1.2, the charge q is 
a dependent variable and the time t¢ is an independent variable. 

Ordinary and Partial Differential Equations. We start classification of differential equations with the 
number of independent variables: whether there is a single independent variable or several independent variables. 
The first case is an ODE (acronym for Ordinary Differential Equation), and the second is a PDE (Partial Differential 
Equation). 

Systems of Differential Equations. Another classification is based on the number of unknown dependent 
variables to be found. If two or more unknown variables are to be determined, then a system of equations is required. 


Example 1.2.1: We will derive a simple model of an arms race between two countries. Let 2;(¢) represent the 
size (or cost) of the arms stocks of country i (i = 1,2). Due to the cost of maintenance, we assume that an 
isolated country will diminish its arms stocks at a rate proportional to its size. We express this mathematically as 
& = da; /dt = —c;x;, c; > 0. The competition between countries, however, causes each one to increase its supply 
of arms at a rate proportional to the other country’s arms supplies. The English meteorologist Lewis F. Richardson 
[43, 44] proposed a model to describe the evolution of both countries’ arsenals as the solution of the following system 
of differential equations: 

Ly = —-CyX1 + dx + gi(x1,X2,t), C1, dy 


Lr = —C2%2 + dz21 + g2(x1, x2, t), C2, dy 


where the c’s are called cost factors, the d’s are defense factors, and the g’s are grievance terms that account for 
other factors. 
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The order of a differential equation is the order of the highest derivative that appears in the equation. More 
generally, an ordinary differential equation of the n-th order is an equation of the following form: 


F (2,y(2),9'(@),---.y(2)) =0. (1.2.1) 
Here y(a) is an unspecified function having n derivatives and depending on « € (a,b), a < b; F(#,y,pi,---,Pn) isa 
given function of n +2 variables. Some of the arguments, x, y,..., y"~)) (or even all of them) may not be present 


in Eq. (1.2.1). However, the n-th derivative, y™, must be present in the ordinary differential equation, or else its 
order would be less than n. If this equation can be solved for y‘” (a), then we obtain the differential equation in the 
normal form: 


y™ (2) = f (e,u's-.¥), x € (a,b). (1,2:2) 


A first order differential equation is of the form 
F(a, y,y') = 0. (1.2.3) 
If we can solve it with respect to y’, then we obtain its normal form: 


d 
7a =f(x,y) or dy=f(a,y)da, (1.2.4) 


where dx and dy are differentials in variables x and y, respectively. 
Linear and Nonlinear Equations. The ordinary differential equation (1.2.1) is said to be linear if F is a linear 
function of the variables y(x), y’(x), ..., y” (x). Thus, the general linear ordinary differential equation of order n is 


dn (x)y + an—r(a)y"—) + +++ + ao(x)y = g(2). (1.2.5) 


An equation (1.2.1) is said to be nonlinear if it is not of this form. 

For example, the van der Pol equation, # — e(1 — y?) y + dy = 0, is a nonlinear equation because of the presence 
of the term y?. On the other hand, y’(a) + (sin x) y(a) = x? is a linear differential equation of the first order because 
it is of the form (1.2.5). In this case, ay = 1, ao(x) = sina, and g(x) = 2”. 

The general forms for the first and second order linear differential equations are: 


ay (x) y'(a) + ao(a) y(e) = f(a) and a(x) y"(a) + a(x) y'(a) + ao(a) y(a) = f(a). 


1.3. Solutions to Differential Equations 


Since the unknown quantity in a differential equation is a function, it should be defined in some domain. The 
differential equation (1.2.1) is usually considered on some open interval (a,b) = {a : a < a < b}, where its solution 
along with the function F(x, y,pi,...,Pn) should be defined. However, it may happen that we look for a solution on 
a closed interval [a, 6] or a semi-open interval, (a, b] or [a,b). For instance, the bending of a plane’s wing is modeled 
by an equation on a semi-closed interval [0, 2), where the point x = 0 corresponds to the connection of the wing to 
the body of the plane, and @ is the length of the wing. At x = @, the equation is not valid and its solution is not 
defined at this point. To embrace all possible cases, we introduce the notation |a, b], which denotes an interval (a, b) 
possibly including the end points; hence, |a, b| can denote the open interval (a,b), the closed interval [a, b], or the 
semi-open intervals (a, b] or [a, 0). 


Definition 1.1: A solution or integral of the ordinary differential equation 


F («, y(x), y'(x),... y(z)) = 0 


on an interval |a,b| (a < b) is a continuous function y(x) such that y, y’, y”, ..., y™ exist and satisfy the 
equation for all values of the independent variable on the interval, « € |a,b|. The graphs of the solutions of a 
differential equation are called their integral curves or streamlines. 


This means that a solution y(x) has derivatives up to the order n in the interval |a,b|, and for every x € |a, d|, 
the point (2, y(x),y'(a),... sy (a) should be in the domain of F'. We can show that a solution satisfies a given 
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differential equation in various ways. The general method consists of calculating the expressions of the dependent 
variable and its derivatives, and substituting all of these in the given equation. The result of such a substitution 
should lead to the identity. 

From calculus, it is known that a differential equation y’ = f(a) has infinitely many solutions for a smooth function 
f(z) defined in some domain. These solutions are expressed either via an indefinite integral, y = f f(x) dx + C, 
or a definite integral with a variable boundary, y = ee f(a) dz — C or y = — f° f(x) dx + C, where zo is some 
fixed value. The constant of integration, C, is assumed arbitrary in the sense that it can be given any value within 
a certain range. However, C' actually depends on the domain where the function y(a) is considered and the form 
in which the integral is expressed. For instance, a simple differential equation y’ = (1 + 2?)~1 has infinitely many 
solutions presented in three different forms: 


dx 
1+ 2? 


1 1— 72 
= arctanz + C = arctan (; **) + Ci = = arccos (5) + Co, 
x 


where arbitrary constants C’, C,, and C2 can be expressed in terms of each other, but their relations depend on the 
domain of x. For example, Cj = 4 +C when x < 1, but C; = C — 34 for x > 1. Also, the antiderivative of 1/x (for 
x #0) will usually be written as In Cz instead of In|C'z| because it would be assumed that C' > 0 for a positive x 
and C' < 0 for a negative xz. In general, a function of an arbitrary constant is itself an arbitrary constant. 

Given the above observation, one might expect that a differential equation y’ = f(x,y) has infinitely many 
solutions (if any). For instance, the function y = x + 1 is a solution to the differential equation 


y +y=242. 


To verify this, we substitute y = 2 +1 and y’ = 1 into the equation. Indeed, y’ + y = 1+(x4+1)=2+2. It is 
not difficult to verify that another function g(x) = «+ 1+ e°® is also a solution of the given differential equation, 
demonstrating that a differential equation may have many solutions. 

To solve a differential equation means to make its solutions known (in the sense explained later). A solution in 
which the dependent variable is expressed in terms of the independent variable is said to be in explicit form. A 
function is known if it can be expressed by a formula in terms of standard and/or familiar functions (polynomial 
functions, exponentials, trigonometric functions, and their inverse functions). For example, we consider functions 
given by a convergent series as known if the terms of the series can be expressed via familiar functions. Also, 
quadrature (expression via integral) of a given function f(a) is regarded as known. 

However, we shall see in this book that functions studied in calculus are not enough to describe solutions of all 
differential equations. In general, a differential equation defines a function as its solution (if one exists), even if it 
cannot be expressed in terms of familiar functions. Such a solution is usually referred to as a special function. Thus, 
we use the word “solution” in a broader sense by including less convenient forms of solutions. 

Any relation, free of derivatives, that involves two variables x and y and that is consistent with the differential 
equation (1.2.1) is said to be a solution of the equation in implicit form. Although we may not be able to solve 
the relation for y, thus obtaining a formula in x, any change in = still results in a corresponding change in y. Hence, 
on some interval, this could define locally a solution y = ¢() even if we fail to find an explicit formula for it or even 
if the global function does not exist. In fact, we can obtain numerical values for y = ¢(x) to any desired precision. 

In this text, you will learn how to determine solutions explicitly or implicitly, how to approximate them numer- 
ically, how to visualize and plot solutions, and much more. 

Let us consider for simplicity the first order differential equation in normal form: y’ = f(#,y). One can rarely 
find its solution in explicit form, namely, as y = ¢(x). We will say that the equation 


P(x, y) =0 
defines a solution in implicit form if ®(x, y) is a known function. How would you know that the equation ®(x, y) = 0 
defines a solution y = ¢(x) to the equation y’ = f(x,y)? Assuming that the conditions of the implicit function 
theorem hold, we differentiate both sides of the equation ®(z, y) = 0 with respect to z: 
®,(z, y) + Oy (x,y) y’ = 0, where ©, = 0®/Ox, ©, = 0®/Oy. 


From the equation y’ = f(x,y), we obtain 


®.(x,y) + Oy (x,y) f(x,y) = 0. (1.3.1) 
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Therefore, if the function y = (2) is a solution to y’ = f(#,y), then the function ®(x,y) = y — (x) must satisfy 
Eq. (1.3.1). Indeed, in this case, we have 6, = —¢’ and ®, = 1. 

An ordinary differential equation may be given either for a restricted set of values of the independent variable 
or for all real values. Restrictions, if any, may be imposed arbitrarily or due to constraints relating to the equation. 
Such constraints can be caused by conditions imposed on the equation or by the fact that the functions involved 
in the equation have limited domains. Furthermore, if an ordinary differential equation is stated without explicit 
restrictions on the independent variable, it is assumed that all values of the independent variable are permitted with 
the exception of any values for which the equation is meaningless. 


Example 1.3.1: The relation 
zx 
nyt y— [ e* dx =0 (y > 0) 
0 
is considered to be a solution in implicit form of the differential equation 


2 ! y —x? 


2 =e = 
(1+ 2y*)y’ — ye =0 or V-Tyap° 


This can be seen by differentiating the given relationship implicitly with respect to z. This leads to 


d a) 1 dy dy 2 
as finy ty [ e 7 an oa e 
Therefore, 
dy (1+ 2y? og? dy _ y 2 
dx ( y = = dx 1+2y? aan 


Example 1.3.2: The function y(z) that is defined implicitly from the equation 


+ 2y7=4 


is a solution of the differential equation « + 2yy’ = 0 on the interval (—2,2) subject to g(+2) = 0. To plot its 
solution in Maple, use the implicitplot command: 
implicitplot (x72+2*y*2=4, x=-2..2, y=-1.5..1.5); 
The same ellipse can be plotted with the aid of Mathematica: 
ContourPlot[x*2 + 2 y*2 == 4, {x, -2, 2}, fy, -2, 2}, 
PlotRange -> {{-2.1, 2.1}, {-1.5, 1.5}}, AspectRatio -> 1.5/2.1, 
ContourStyle -> Thickness[0.005], FrameLabel -> {"x", "y"}, 


RotateLabel -> False] (* Thickness is .5% of the figure’s length *) 
The same plot can be drawn in Mazima with the following commands: 
load(draw) ; 


draw2d(ip_grid=[100,100], /* optional, makes a smoother plot */ 
implicit(x72 + 2*y72 = 4, x,-2.1,2.1, y,-1.5,1.5)); 

MATLAB is capable to perform the same job: 

[x,y] = meshgrid(-2:.1:2,-2:.1:2); contour(x..2 + 2*y.°2) 


The implicit relation x? + 2 y? = 4 contains the two explicit solutions 


y(x) = V2 —0.52? and y(“z) =—-V2-0.542 (-2 <2 < 2), 


which correspond graphically to the two semi-ellipses. Indeed, if we rewrite the given differential equation 1+2y y’ = 0 
in the normal form y’ = —a/(2y), then we should exclude y = 0 from consideration. Since 7 = +2 correspond to 
y = 0 in both of these solutions, we must exclude these points from the domains of the explicit solutions. Note that 
the differential equation «+ 2y y’ = 0 has infinitely many solutions: x? + 2y? = C (|x| < C), where C is an arbitrary 
positive constant. 


Next, we observe that a differential equation may (and usually will) have an infinite number of solutions. A set 
of solutions of y’ = f(a, y) that depends on one arbitrary constant C' deserves a special name. 
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Definition 1.2: A function y = ¢(z, C) is called the general solution to the differential equation y’ = f(z, y) 
in some two-dimensional domain 2 if for every point (x,y) € 9 there exists a value of constant C’ such that the 
function y = ¢(x,C) satisfies the equation y’ = f(x,y). A solution of this differential equation can be defined 
implicitly: 

®(x,y,C) =0 or w(a,y) =C. (1.3.2) 


In this case, ®(x, y,C) is called the general integral, and 7(z, y) is referred to as the potential function of 
the given equation y’ = f(x, y). 


A constant C may be given any value in a suitable range. Since C’ can vary from problem to problem, it is often 
called a parameter to distinguish it from the main variables x and y. Therefore, the equation ®(x, y, C) = 0 defines 
a one-parameter family of curves with no intersections. Graphically, it represents a family of solution curves in the 
xy-plane, each element of which is associated with a particular value of C. The general solution corresponds to the 
entire family of curves that the equation defines. 

As might be expected, the inverse statement is true: the curves of a one-parameter family are integrals of some 
differential equation of the first order. Indeed, let the family of curves be defined by the equation ®(z, y,C) = 0, 
with a smooth function ®. Differentiating with respect to a yields a relation of the form F(z, y,y’,C) = 0. By 
eliminating C’ from these two equations, we obtain the corresponding differential equation. 


Example 1.3.3: For an arbitrary constant C, show that 


the function y = C ++—-Ges is the solution of the nonlinear 207 
VI+C? y/ 
differential equation y — xy! = ——=—— ol 


J1+y? 


Solution. Taking the derivative of y shows that y’ = C. = 

Substitution y= Ca + y aero and y’ = C into the differ- a : g 

ential equation yields -10f 
C C et 


Cat 


———. -— r C = —. 
V14+cC? 7 V14+C? 


2 : oo, : Figure 1.2: Example 1.3.3. A one- 
This identity proves that the function is a solution of the 


parameter family of solutions, plotted with 


given differential equation. Setting C’ to some value, for le eee 
instance, C = 1, we obtain a particular solution y = x + 
1/V2. 


Example 1.3.4: Show that the function y = (x) in para- 
metric form, y(t) = te’, x(t) = e*, is a solution to the 


differential equation x? y’ = 1— xy. 0a [\ 
} 
Solution. The derivatives of x and y with respect to t are sf | 
dx : dy , 0.25 
—-—eE and —=e 1-¢t > 02 
dt dt ( ) 


respectively. Hence, 


dy = dy/dt = eo (1 _ t) = et (1 _ t) 0.05 
da da/dt et r : : : . J 
= et po 
a a Figure 1.3: Example 1.3.4. A solution of 
because 2-? = e~* and y/a =te-*. the differential equation x7y’ = 1 — xy 


(plotted with MATLAB). 


Example 1.3.5: Consider the one-parameter (depending on C) family of curves 


_# +y? 


get+y?+Cy=0 or C= (y #0). 
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On differentiating, we get 2x + 2yy’ + Cy’ =0. Setting C = —(x? + y”)/y in the latter, we obtain the differential 


equation 
ree ) 


ae + 2yy'—y' ( =0 or y’ = 2xy/(x? — y?). 


This job can be done by Mazima in the following steps: 


depends (y,x); /* declare that y depends on x */ 

soln: x°2 + y°2 + Cey = 0; /* soln is now a label for the equation */ 
diff (soln,x); /* differentiate the equation */ 
eliminate([%,soln], [C]); /* eliminate C from these two equations */ 
solve(4%, ’diff(y,x)); /* solve for y’ */ 


Sometimes the integration of y’ = f(x,y) leads to a family of integral curves that depend on an arbitrary constant 
C in parametric form, namely, 
x=p(t,C), y=v(t,C). 


This family of integral curves is called the general solution in parametric form. 

In many cases, it is more convenient to seek a solution in parametric form, especially when the slope function is 
a ratio of two functions: y’ = P(x, y)/Q(x,y). Then introducing a new independent variable t, we can rewrite this 
single equation as a system of two equations: 


& 2 dax/dt=Q(a,y), 9 = dy/dt = P(a,y). (1.3.3) 


1.4 Particular and Singular Solutions 


A solution to a differential equation is called a particular (or specific) solution if it does not contain any arbitrary 
constant. By setting C' to a certain value, we obtain a particular solution of the differential equation. So every specific 
value of C in the general solution identifies a particular solution or curve. Another way to specify a particular solution 
of y’ = f(x,y) is to impose an initial condition: 


y(Xo) = Yo, (1.4.1) 


which specifies a solution curve that goes through the point (ao, yo) on the plane. Substituting the general solution 
into Eq. (1.4.1) will allow you to determine the value of the arbitrary constant. Sometimes, of course, no value of 
the constant will satisfy the given condition (1.4.1), which indicates that there is no particular solution with the 
required property among the entire family of integral curves from the general solution. 


Definition 1.3: A differential equation y’ = f(x,y) (or, in general, F(x,y,y’) = 0) subject to the initial 


condition y(xo) = yo, where xo and yo are specified values, is called an initial value problem (IVP) or a 
Cauchy problem. 


COS & 


Example 1.4.1: Show that the function y(x) = x i + / a] is a solution of the following initial value 
1 


x 
problem: 
xy -—y=xcosx, y(1)=1. 


Solution. The derivative of y(a) is 


” cos cos * cos 
y(o)=1+ f Sde+a == 1+cosr+ | = de. 
1 av av 1 x 


Hence, 


, ” cosx * cosx 
ry —-y=x+n2cosx+u2 da—a}1+ dz} = x cosa. 
1 zt 1 zt 


The initial condition is also satisfied since 


rhea +f  as| =i, 


x 


We can verify that y(x) is the solution of the given initial value problem using the following steps in Mathematica: 
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y[x_]=x + x*Integrate[Cos[t]/t, {t, 1, x}] 

x*Dly[x], x] - ylx] 

Simplify([%] 

ytt] (* to verify the initial value at x=1 *) 


Definition 1.4: A singular solution of y’ = f(z,y) is a function that is not a special case of the general 


solution and for which the uniqueness of the initial value problem has failed. 


Not every differential equation has a singular solution, but if it does, its singular solution cannot be determined 
from the general solution by setting a particular value of C’, including too, because integral curves of the general 
solution have no common points. A differential equation may have a solution that is neither singular nor a member 
of the family of one-parameter curves from the general solution. According to the definition, a singular solution 
always has a point on the plane where it meets with another solution. Such a point is usually referred to as a branch 
point. At that point, two integral curves touch because they share the same slope, y’ = f(x,y), but they cannot 
cross each other. For instance, functions y = x? and y = «+ have the same slope at « = 0; they touch but do not 
cross. 

A singular solution of special interest is one that consists entirely of branch points—at every point it is tangent 
to another integral curve. An envelope of the one-parameter family of integral curves is a curve in the xy-plane 
such that at each point it is tangent to one of the integral curves. Since there is no universally accepted definition of 
a singular solution, some authors define a singular solution as an envelope of the family of integral curves obtained 
from the general solution. Our definition of a singular solution includes not only the envelopes, but all solutions 
that have branch points. This broader definition is motivated by practical applications of differential equations in 
modeling real-world problems. The existence of a singular solution gives a warning signal in using the differential 
equation as a reliable model. 

A necessary condition for the existence of an envelope is that x, y, C satisfy the equations: 

O® 


®(x,y,C) =0 and 30 7 0, (1.4.2) 


where ®(x,y,C) = 0 is the equation of the general solution. Eliminating C may introduce a function that is not a 
solution of the given differential equation. Therefore, any curve found from the system (1.4.2) should be checked on 
whether it is a solution of the given differential equation or not. 


Example 1.4.2: Let us consider the equation 


y =2/y (y= 0), (1.4.3) 


where the radical takes positive sign. Suppose y > 0, we divide both sides of Eq. (1.4.3) by 2,/y, which leads to a 
separable equation (see §2.1 for detail) 


/ d 12p 
y 1 or avy =1 


2/5 dc nf 


From chain rule, it follows that 


dy day 1 ape, 
ae age ee 


Hence \/y = x+C, where x > —C. The general solution of 
Eq. (1.4.3) is formed by the one-parametric family of semi- 
parabolas 


Figure 1.4: Example 1.4.2: some solutions 


y(z)=(e+C), or C= Vy-2, r>-C. to y’ = 2,/y along with the singular solu- 
tion y = 0, plotted with Mathematica. 


The potential function for the given differential equation is y(x,y) = /y— «x. Eq. (1.4.3) has also a trivial 
(identically zero) solution y = 0 that consists of branch points—it is the envelope. This function is a singular 
solution since y = 0 is not a member of the family of solutions y(a) = (2 + C)? for any choice of the constant C. 
The envelope of the family of curves can also be found from the system (1.4.2) by solving simultaneous equations: 
(x + C)? —y = 0 and 00/0C = 2(x+ C) = 0, where ®(x, y,C) = (x +C)? — y. We can plot some solutions together 
with the singular solution y = 0 using the folowing Mathematica commands: 
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qi = Plot[Evaluate[(x + C[1])°2 /. C[1] -> {0, 1, -1}], {x, -3.5, 3.5}, 

AxesLabel -> {x, Y}] 
q2 = Plotly = 0, {x, -3.5, 3.5}, PlotStyle -> Thick] 
Show[qi, q2] 
Actually, the given equation (1.4.3) has infinitely many singular solutions that could be constructed from the singular 
envelope y = 0 and the general solution by piecing together parts of solutions. An envelope does not necessarily 
bound the integral curves from one side. For instance, the general solution of the differential equation y! = 3 y?/* 
consists of y = (2 + C)° that fill the entire zy-plane. Its envelope is y = 0. 


Example 1.4.3: The differential equation 5y’ = 2y~3/?, y 4 0, has the one-parameter family of solutions y = 
(a — C)?/>, which can be written in an implicit form (1.4.2) with ®(a,y,C) = y° — (a — C)?. Differentiating with 
respect to C and equating to zero, we obtain y = 0, which is not a solution. This example shows that conditions 
(1.4.2) are only necessary for the envelope’s existence. 


Example 1.4.4: Prove that the function y() defined implicitly from the equation y = arctan(x + y)+C, where C 
is a constant, is the general solution of the differential equation (x + y)? y’ = 1. 
Solution. The chain rule shows that 


dy d d 
= - aaa [arctan(a + y) + C] Ae [x + y] 
1 dy dy dy 2 dy 
_ 1 eed Peasy ieee — 1 oe 
1+(a+y)? | oa ae tae et Tt ae 


From the latter, it follows that y/(a@ + y)? = 1. 


The next example demonstrates how for a function that contains an arbitrary constant as a parameter we can 
find the relevant differential equation for which the given function is its general solution. 


Example 1.4.5: For an arbitrary constant C, show that the function y = C ae is the solution of the differential 
equation 
(1 + 2ary) dx + (1 +27) dy =0. (1.4.4) 
Prove that this equation has no other solutions. 
Solution. The differential of this function is 
—(1+ 27) -(C—2)2x x? —1-2Cx 
dy = y' dz = rll + 2") — (C ~ 2)2e dz = ————_ dz. 
ng G+2)? Tp ae 
Multiplying both sides by 1 + 2”, we have 
x? —1—2Cx —a? —14 227 —2Cx C-x 
iL 2) dy = ————— dg = da = — [1+ 22 ——_ ) a: 
eae 14+ 2? “ 14+ 2? . ( " 5) . 
and, since y = (C — x)/(1+ 27), we get 
21-26 
—(1+27)dy= ie (1 + 2ay) da. 
1+? 


We are going to prove now that there is no solution other than y = (C — x)/(1 +27). Solving for C, we find 
the potential function ~(a, y) = (1+ 27)y +a. Suppose the opposite, that other solutions exist; let y = (x) be a 
solution. Substituting y = ¢(x) into the potential function w(x, y), we obtain a function that we denote by F(z), 
that is, F(x) = (1+ 2?) 6(x) +2. Differentiation yields 

F'() = 2a¢(a) + (1+ 27) ¢'(z) +1. 
1+ 2x 


Since ¢/(x) = ge and we get 


F'(x) = 2x $(x) — (1+ 27 ¢(x)) +1=0. 


Therefore, F(x) is a constant, which we denote by C. That is, ¢(a) = 
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1.5 Direction Fields 


A geometrical viewpoint is particularly helpful for the first order equation y’ = f(x, y). The solutions of this equation 
form a family of curves in the xy-plane. At any point (a, y), the slope dy/dz of the solution y(x) at that point is 
given by f(z, y). We can indicate this by drawing a short line segment (or arrow) through the point (x, y) with the 
slope f(x,y). The collection of all such line segments at each point (a, y) of a rectangular grid of points is called a 
direction field or a slope field of the differential equation y’ = f(z, y). 

By increasing the density of arrows, it would be possible, in theory at least, to approach a limiting curve, the 
coordinates and slope of which would satisfy the differential equation at every point. This limiting curve—or rather 
the relation between x and y that defines a function y(x)—is a solution of y’ = f(a, y). Therefore the direction field 
gives the “flow of solutions.” Integral curves obtained from the general solution are all different: there is precisely 
one solution curve that passes through each point (x,y) in the domain of f(#,y). They might be touched by the 
singular solutions (if any) forming the envelope of a family of integral curves. At each of its points, the envelope is 
tangent to one of integral curves because they share the same slope. 

Direction fields can be plotted for differential equations even if they are not necessarily written in the normal 
form. If the derivative y’ is determined uniquely from the general equation F(x, y, y’) = 0, the direction field can be 
obtained for such an equation. However, if the equation F(a, y,y’) = 0 defines multiple values for y’, then at every 
such point we would have at least two integral curves with distinct slopes. 


Example 1.5.1: Let us consider the differential equation not in the normal form: 
a(y’)* — 2yy' +2 =0. (1.5.1) 


At every point (x,y) such that y? > x? we can assign to y’ two distinct values 


ef. 2 m2 
pi - YRVE TY (ye —2? > 0). 


Y= 
x 


When y? < x”, Eq. (1.5.1) does not define y’ since the root becomes imaginary. Therefore, we cannot draw a direction 
field for the differential equation (1.5.1) because its slope function is not a single-value function. Nevertheless, we 
may try to find its general solution by making a guess that it is a polynomial of the second degree: y = Cxz?+ Br+A, 
where coefficients A, B, and C are to be determined. Substituting y and its derivative, y’ = 2C'x+ B, into Eq. (1.5.1), 
we get B=0 and A=1/(4C). Hence, Eq. (1.5.1) has a one-parametric family of solutions 


1 
=Cr*+—. 152 
y= Cx +75 (1.5.2) 
For any value of C, C #0, y? is greater than or equal to x”. To check our conclusion, we use Maple: 
dsolve(x* (diff (y (x) ,x))°2-2*y (x) *diff (y (x) ,x)+x=0,y(x)); 
phi:=(x,C)->C*#x*x+0.25/C; # the general solution 
plot ({subs (C=.5,phi(x,C)) ,phi(x,-1),x},x=-1..1,y=-1..1,color=blue) ; 


Let us consider any region R of the xy-plane in which f(z, y) is a real, single-valued, continuous function. Then 
the differential equation y’ = f(x,y) defines a direction field in the region R. A solution y = (a) of the given 
differential equation has the property that at every point its graph is tangent to the direction element at that point. 
The slope field provides useful qualitative information about the behavior of the solution even when you cannot 
solve it. Direction fields are common in physical applications, which we discuss in [14]. While slope fields prove 
their usefulness in qualitative analysis, they are open to several criticisms. The integral curves, being graphically 
obtained, are only approximations to the solutions without any knowledge of their accuracy and formulas. 

If we change for a moment the notation of the independent variable x to t, for time, then we can associate the 
solution of the differential equation with the trajectory of a particle starting from any one of its points and then 
moving in the direction of the field. The path of such a particle is called a streamline of the field. Thus, the 
function defined by a streamline is an integral of the differential equation to which the field applies. A point through 
which just one single integral curve passes is called an ordinary point. 

When high precision is required, a suitably dense set of line segments on the plane region must be made. The labor 
involved may then be substantial. Fortunately, available software packages are very helpful for practical drawings of 
direction fields instead of hand sketching. 

There is a friendly graphical program, Winplot, written by Richard Parris, a teacher at Phillips Exeter Academy 
in Exeter, New Hampshire. Mr. Parris generously allows free copying and distribution of the software and provides 
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As we see from Figure 1.5, integral curves intersect each 
other, which would be impossible for solutions of a dif- 
ferential equation in the normal form. Indeed, solving 
Eq. (1.5.2) with respect to C, we observe that for every 
point (x,y), with y? > x”, there are two distinct values of 
C= (y + VP — 2) /x?. For instance, Eq. (1.5.1) defines 
two slopes at the point (2,1): 2+ V3. 

Let us find an envelope of singular solutions. According to 
Eq. (1.4.2), we differentiate the general solution (1.5.2) with 
respect to C, which gives x? = —1/(407). Eliminating C 
from these two equations, we obtain 2? — y? = 0 or y = £2. 
Substitution into Eq. (1.5.1) yields that these two functions 
are its solution. 

Hence, the given differential equation has two singular so- 


lutions y = +2 that form the envelope of integral curves Figure 1.5: Example 1.5.1: lou solutions 
corresponding to the general solution. along with two singular solutions, plotted in 
Maple. 


frequent updates. The latest version can be downloaded from the website: 
http://math.exeter.edu/rparris/winplot.html. The program is of top quality and is easy to use. 

You can find many online applications for plotting direction fields by entering ”desmos direction fields” into any 
search engine. These include https://bluffton.edu/homepages/facstaff/nesterd/java/slopefields.html, 
http://www. geogebra.org, http: //www.mathscoop.com, and http://slopefield.nathangrigg.net. 


Maple 

It is recommended to clear the memory before starting a session by invocation of either restart or gc( ) for 
garbage collection. Maple is particularly useful for producing graphical output. It has two dedicated commands 
for plotting flow fields associated with first order differential equations—DEplot and dfieldplot. For example, the 
commands 
restart; with(DEtools): with(plots): 
dfieldplot (diff (y(x) ,x)=y(x)+x, y(x), x=-1..1, y=-2..2, arrows=medium) ; 
allow you to plot the direction field for the differential equation y’ = y+. To include graphs of some solutions into 
the direction field, we define the initial conditions first: 
inc:=[y(0)=0.5,y(0)=-1]; 
Then we type 
DEplot (diff (y(x) ,x)=y(x)+x, y(x), x=-1..1, y=-2..2, inc, arrows=mediun, 
linecolor=black, color=blue,title=‘Direction field for y’=y+x‘) ; 


There are many options in representing a slope field, which we demonstrate in the text. A special option, dirgrid, 
specifies the number of arrows in the direction field. For instance, if we replace Maple’s option arrows=medium with 
dirgrid=[16,25], we will get the output presented in Figure 1.10. 

The computer algebra system (CAS for short) Maple also has an option to plot the direction fields without arrows 
or with comets, as can be seen in Figures 1.8 and 1.9, plotted with the following script: 
dfieldplot (x*diff (y(x) ,x)=3*y(x)+2*x,y(x), 
x=-1..1,y=-2..2,arrows=line,title=‘Direction field for xy’=3y+2x‘); 
DEplot (x*diff (y (x) ,x)=3*y(x)+x73,y(x), 
x=-1..1,y=-2..2,arrows=comet ,title=‘Direction field for xy’=3y+x*x*x‘); 

You may draw a particular solution that goes through the point x = 7/2, y = 1 in the same picture by typing 
DEplot (equation, y(x), x-range, y-range, [y(Pi/2)=1], linecolor=blue). 
Maple also can plot direction fields with different colors: 
dfieldplot (diff (y(x) ,x)=f(x,y), y(x), x-range, y-range, color=f(x,y)). 


Mathematica 

It is always a good idea to start Mathematica’s session with clearing variables or the kernel. With Mathematica, 
only one command is needed to draw the direction field corresponding to the differential equation y’ = f(t, y). By 
choosing, for instance, f(t, y) = 1 —t? — y, we type: 
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dydt=1-1'2-y 
anh Vy 
v1 1 
vA \ 
a Vy] 
\\ \ 
\\ Vy 
YA Vy 
o}-y ; yt 
YN N24 «% 4fe7- =~ \X\\ 
\ Ns - 4 4 A poorer ~~ \\ 
es Se PS EM LP fe YJ 
SENN eA PAM A Ped 
Nos eo LP AEP ff toe 
Nw er PAA Sf ff ee ~s 
2+» - 4 Af AA hd df goer ew 5 
-2 -l 0 1 2 
Figure 1.6: Direction field for the equation y’ = 
(y+2)/(y — 2), plotted using Winplot. There is no Figure 1.7: Direction field along with two solutions 
information in a neighborhood of the singular line for the equation y’(t) = 1 — t? — y(t), plotted with 
y=. Mathematica. 
Direction field_for xy’=3y+2x wMUeceee, ) , , 
VVV VV AA LL a3 | 
STO Tie aaa d Satecesccllaoeeorae.. 
SNNAVUL HELE LT SVAVALILEU LET Peper 
\N\NAVAVV AAS LETT id de SNA NT DT a ay 
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SNAAAN\\\ 1 I ELLIS NNNNANAVAVYUE EEL LLLP SS 
SN de Pee SNA SANA DU Eg ee ee Se 
eS NA LL SSSNSNSANVA GD DEL ZAL ASS 
42 NM III ILS SSSSNSNNNAV UD DISA AASASSS 
LALA AA A-eN\IL LL LLL LL ————— NN A 7 24 2 2 2 27 
EY PL Aaa eee! AOE Ag? AN NSS 
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PEELE REAL FESS PEAS 
PITT TTT Pao PTAA AVAANA PPPPP Pit PA bya VV NANA 
Figure 1.8: Direction field for xy’ = 3y + 22, plotted Figure 1.9: Direction field for ry’ = 3y + x3, plotted 
with Maple. with Maple. 


dfield = VectorPlot[{1,1-t*2-y}, {t, -2, 2}, fy, -2, 2}, Axes -> True, 
VectorScale -> {Small,Automatic,None}, AxesLabel -> {"t", "dydt=1-t*2-y"}] 


The option VectorScale allows one to fix the arrows’ sizes and suboption Scaled[1] specifies arrowhead size relative 
to the length of the arrow. To plot the direction field along with, for example, two solutions, we use the following 
commands: 

soli = DSolve[{y’[t] == 1 - y[t] - t*2, y[0] 
sol2 = DSolve[{fy’ [t] == 1 - y[t] - t°2, y[0] 
pp1 = Plot[y[t] /. soli, {t, -2, 2}] 

pp2 = Plot[y[t] /. sol2, {t, -2, 2}] 
Show[dfield, pp1, pp2] 


AF Fle) s t) 
-i}; y(t], €) 


For plotting streamlines/solutions, CAS Mathematica has a dedicated command: StreamPlot. If you need to plot 
a sequence of solutions under different initial conditions, use the following script: 
myODE = t~2*y’[t] == (y[t])°3 - 2+t*y[t] 
Ic = {{0.5, 0.7}, {0.5, 4}, {0.5, 1}}; 
Do[ansODE[i] = 
Flatten[DSolve[{myODE, y(CIC[{[i, 1]]] == IC([(i, 2]]}, y(t], t]]; 
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Figure 1.10: Direction field for y’ = y + x using (a) arrows=medium and (b) dirgrid, plotted with Maple. 


myplot[i] = Plot[Evaluatel[y[t] /. ansODE[i]], {t, 0.02, 5}]; 
Print[myplot[i]]; , fi, 1, Length[IC]}] 


Note that Mathematica uses three different notations associated with the symbol “=.” Double equating “==” is 
used for defining an equation or for testing an equality; regular “=” is used to define instant assignments, while “:=’ 
is used to represent the left-hand side in the form of the right-hand side afresh, that is, unevaluated. 


MATLAB® 

MATLAB is a numerical computing environment that also includes CAS subroutines: MuPad (based on Maple) 
and Live Editor. Before beginning a new session, it is recommended that you execute the cle command to clear the 
command window and the clear command to remove all variables from memory. In order to plot a direction field 
with MATLAB, you have several options. One of them includes creation of an intermediate file’, say function1.m, 
yielding the slope function f(x,y). Let us take a simple example f(x,y) = xy”. This file will contain the following 
three lines (excluding comments followed after %): 
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y% Function for direction field: (function1.m) 
function F=function1 (x,y); 
Fex*y*y ; 
F=vectorize(F); % to get a vectorized version of the function, which is optional 


If a function f(x,y) is not complicated, it could be defined directly within MATLAB code. We demonstrate it in the 
case of the rational slope function f(x,y) = (w+ 2y — 5)/(2% + 4y — 7) that is used in Example 2.2.3 on page 56: 


der=@(x,y) (x+2*y-5) ./(2*x+4*y+7); % define slope function 


equ=@(x) (3-4*x)/8; % define equilibrium 
sig=@(x) -(7+2*x)/4; % define singular 
xmin=-5; xmax=5; ymin=-5; ymax=5; % set the frame 
dx=(xmax-xmin) /20; % set spatial steps 


dy=(ymax-ymin) /20; 

[X,Y]=meshgrid(xmin:dx:xmax, ymin:dy:ymax); % generate mesh 
Dx=ones(size(X)); % Unit x-components of arrows 
Dy=der (X,Y); % Computed y-components 
L=sqrt(Dx.~2 + Dy.*2); % Initial lengths 

Dx1=Dx./L; Dy1=Dy./L; % Unit lengths for all arrows 
quiver(X,Y, Dx1,Dy1, ’b-’); % draw the direction field 

axis tight; % sets the axis limits 

% to the range of the data 


4All script files in MATLAB must have extension m. 
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xlabel(’x’,’FontSize’ ,16); % set labels, fontsize for them 
ylabel(’y’,’FontSize’ ,16,’rotation’,0); % and direction for letter y 
set(gca, ’FontSize’, 12); % set fontsize for axis 
hold on 
xx=xmin:dx/10:xmax; 
plot(xx, equ(xx), ’k’, ’LineWidth’, 3); 
plot(xx, sig(xx), ’k--’, ’LineWidth’, 3); 
% below are several solutions based on different initial conditions 
[x,yl]=ode45(der, [-5 5], 3.1); % IC: y(-5)=3.1 
plot(x,yl,’r’, ’LineWidth’, 2); 
[x,y2]=ode45(der, [-5 5], 4.0); % IC: y(-5)=4.0 
plot(x,y2,’r’, ’LineWidth’, 2); 
print -deps direction_field.eps; % or print -deps2 direction_field.eps; 
print -depsc direction_field.eps; % for color image 


In the above code, the subroutine quiver(z, y, u,v) displays velocity vectors as arrows with components (uw, v) 
at the points (x,y). To draw a slope field without arrows, quiver is not needed as the following code shows. If a 
graph of a function needs not to be plotted along with the slope field, comment out the last line. 


func = @(x) 3*x.72 - 8*x; % slope function 

ifunc = @(x) x.73 - 4*x.°2; % integral from 0 to x 

xmin = -2; xmax = 5; % set limits for x 

dx = (xmax - xmin)/100; % step for computing ifunc 

XX = xmin:dx:xmax; % arguments for computing ifunc 

sing = ifunc(xx); % computing ifunc 

dy = (max(sing)- min(sing))/100; %step along y-axis for slope field 
figure; axis tight; % set the axis limits to the range of the data 


for y = min(sing) :6*dy:max(sing) % loop for plotting a slope field 
for x = xmin:6*dx:xmax 
c=2/sqrt((1/dx)°2 + (func(x)/dy).*2 ); d=func(x).*c; 
tpt = [x -c, x + c]; ypt=ly - d, y + d]; 
line(tpt,ypt); % plot an element of slope field 
end; 
end 
hold on; plot(xx, sing, ’k-’, ’LineWidth’, 3); hold off; 


As you have seen from the above two scripts, although resourceful, MATLAB does not have the ability to plot 
direction fields naturally. However, many universities have developed some software packages that facilitate drawing 
direction fields for differential equations. For example, John Polking at Rice University has produced dfield and 
pplane programs for MATLAB. The MATLAB versions of dfield and pplane are copyrighted in the name of John 
Polking [38, 39]. While they are not in the public domain, these subroutines are being made available free of charge 
to educational institutions. Another possibility to plot slope fields provides MuPad, an integrated CAS in MATLAB. 


Maxima 

Mazima and its popular graphical interface wrMazima’ are free software® projects. This means that you have the 
freedom to use them without restriction, to give copies to others, to study their internal workings and adapt them to 
your needs, and even to distribute modified versions. Maxima is a descendant of Macsyma, the first comprehensive 
computer algebra system, developed in the late 1960s at the Massachusetts Institute of Technology. 

Maxima provides two packages for plotting direction fields, each with different strengths: plotdf supports 
interactive exploration of solutions and variation of parameters, whereas drawdf is non-interactive and instead 
emphasizes the flexible creation of high-quality graphics in a variety of formats. Let us first use drawdf to plot the 
direction field for the differential equation y’ = e~* + y: 
load (drawdf) ; 
drawdf(exp(-t)t+y, [t,y], [t,-5,10], [y,-10,10]); 


5See http://maxima.sourceforge.net/ and http://wxmaxima.sourceforge.net/ 
°See http://www.fsf.org/ for more information about free software. 
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(a) (b) 


Figure 1.11: Direction fields and streamlines for the equations (a) x’(t) = x?(t)+#? and (b) 2’(t) = 2?(t) —¢? using 
dfield in MATLAB. 


Note that drawdf normally displays graphs in a separate window. If you are using wxeMaxima (recommended for 
new users) and would prefer to place your graphs within your notebook, use the wxdrawdf command instead. The 
load command stays the same, however. 
Solution curves passing through points y(0) = 0, y(0) = —0.5, and y(0) = —1 can be included in the graph as 

follows: 
drawdf(exp(-t)+y, [t,y], [t,-5,10], [y,-10,10], 

solns_at([0,0], [0,-0.5], [0,-1])); 
By adding field_degree=2, we can draw a field of quadratic splines (similar to Maple’s comets) which show both 
slope and curvature at each grid point. Here we also specify a grid of 20 columns by 16 rows, and draw the middle 
solution thicker and in black. 
drawdf(exp(-t)+y, [t,y], [t,-5,10], [y,-10,10], 

field_degree=2, field_grid=[20,16], solns_at([0,0], [0,-1]), 

color=black, line_width=2, soln_at(0,-0.5)); 


We can add arrows to the solution curves by specifying soln_arrows=true. This option removes arrows from 
the field by default and also changes the default color scheme to emphasize the solution curves. 
drawdf(exp(-t)t+y, [t,y], [t,-5,10], [y,-10,10], field_degree=2, 

soln_arrows=true, solns_at([0,0], [0,-1], [0,-0.5]), 
title="Direction field for dy/dt = exp(-t) + y", 
xlabel="t", ylabel="y") ; 


Actual examples of direction fields plotted with Maxima are presented on the front page of Chapter 1, and scattered in 
the text. The following command will save the most recent plot to an encapsulated Postscript file named “plotl.eps” 
with dimensions 12cm by 8cm. Several other formats are supported as well, including PNG and PDF. 


draw_file(terminal=eps, file_name="ploti", 
eps_width=12, eps_height=8) ; 


Since drawdf is built upon Mazima’s powerful draw package, it accepts all of the options and graphical objects 
supported by draw2d, allowing the inclusion of additional graphics and diagrams. To investigate differential equations 
by varying parameters, plotdf is sometimes preferable. Let us explore the family of differential equations of the 
form y’ = y—a+bcost, with a solution passing through y(0) = 0: 
plotdf(y-atb*cos(t), [t,y], [t,-5,9], [y,-5,9], 

[sliders,"a=0:2,b=0:2"], [trajectory_at,0,0]); 

You may now adjust the values of a and 6 using the sliders in the plot window and immediately see how they affect 
the direction field and solution curves. You can also click in the field to plot new solutions through any desired 
point. Make sure to close the plot window before returning to your Mazima session. 


Sage 
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Figure 1.12: Direction field for the equation y'(x) = 6 x? — 3y + 2 cos(x + y), plotted with Sage. 


SageMath is a free open-source mathematics software system licensed under the GPL. It builds on top of many 
existing open-source packages Python, R, Julia (computational geometry), GAP (discrete algebra), Octave, including 
computer algebra systems Maxima and SymPy, and much more. Sage comes with two options: one can download 
it’ (for free) or use it interactively through cloud version. SageMath is available for every platform, and is ubiqui- 
tous throughout industry. SageMathCloud supports authoring documents written in 4T@X, Markdown, or HTML. 
SageMathCloud also allows you to publish documents online. 

To plot a direction field for first order differential equation y’ = f(x,y) using Sage we first declare variables and 
then we use a standard command, which we demonstrate in the following example: y’ = 62? — 3y +2 cos(x + y). 
x,y = var(’x,y’) 
plot_slope_field(6*x*2 -3*y+2*cos(xty), (x,-3,3), (y,-2,4), xmax=10) 


Python 


Python® is a high-level and general-purpose programming language (free of charge). Part of the reason that it 
is a popular choice for scientists and engineers is the language versatility, online community of users, and powerful 
analysis packages such as NumPy, SciPy, and, of course, SymPy, a CAS written completely in Python. Anaconda is 
a free Python distribution from Continuum Analytics that includes many useful packages for scientific computing. 

The function odeint is available in SciPy for integrating first order vector differential equations. A higher order 
ordinary differential equation can always be reduced to a differential equation of this type by introducing intermediate 
derivatives into the vector (see §6.3). There are many optional inputs and outputs available when using odeint that 
can help tune the solver. 


1. For each equation below, determine its order, and name the independent variable, the dependent variable, and any 
parameters. 


(a) y=y? +27; (b) P=rP(1— P/N); (c) mt+ra+ka=sint; 
(d) Lé+gsin0=0; (e) (ry (x)? =2? y(x); (f) 2y”+3y' +5y=e7°. 
2. Determine a solution to the following differential equation: 


(1 — 227) y” — ay’ + 6y =0 


of the form y(«) = a + bx + cx? satisfying the normalization condition y(1) = —1. 


3. Differentiate both sides of the given equation to eliminate the arbitrary constant (denoted by C’) and to obtain the 
associated differential equation. 


"See http: //www.sagemath. org/. 
Shttps://www.python.org/ 
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10. 


11. 


12. 


13. 
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(a) a? 4+4ry?=C; (b) 2? 4222y=C; (c) ycosa—ay?=C; 
(d) Cx? =y?+2y; (e) e?—Iny=C; (f) 3tana + cos’ y = C; 
(g) Cx =In(cy); (h) Cy+Inz=0; (Gi) y-lt+peR =9; 
G@ yer (k) yt+Cur=2'; (1) sin? y—tanz =C. 


. Find the differential equation of the family of curves yn(ax) = (1 + 2)" , where n # 0 is a parameter. Show that y = e” 


is a solution of the equation found when n — oo. 


. Find a differential equation of fourth order having a solution y = Ax cosx + Ba sin, where A and B are arbitrary 


constants. 


. Which of the following equations for the unknown function y(x) are not ordinary differential equations? Why? 


(a) Seer =F '@))s (6) va) = ule - 1) - v(@)/35 
(c) f% cos(ha)y/(x) dx = VEmy'(k); (a) y"(a) = 9 -y'(@)ly'(@)h 


(ec) [en*y(a) dx = et /t (f) y(a) =1+4 f y(t)at. 
. Determine the order of the following ordinary differential equations: 
(a) (y')? +07? y = 0; (b) = (yy’) =sing; (c) a +y+(y')? =1; 
(d) 2” y+ a(ty)+y=0; (e) gly’ sina) =0; (f) Bul (t) +t? u(t) + tu(t) =0; 
(g) [tu (t)] =0; (bh) (w(t) =#°; Gy = 4+’)? 
G) ay" +) +e7=0; (Kk) ay” Fsign(z)=0; (1) (y")? Fy! sina = cose. 
. Which of the following equations are linear? 
(a) y 40° y =O; (b) ae (yy') = sina; (c) gley) = 0; 
d +y sing = 1; e —2° =0; f —¢° =0, 
) . : 7) 1: S) (y) " 2 <A) (f) Y ‘= 2 
8) alt +¥]=h (bh) y"(a)+ey"(a)+y(@)=0; (i) y= Vay; 
(i) u(x) +2 y(x) = cosa; (k) y' =a? +47; (1) y" +y! sinz = cose. 


. Let y(x) be a solution for the ODE y"(x) = xy(x) that satisfies the initial conditions y(0) = 1, y’/(0) = 0. (You will 


learn in this course that exactly one such solution exists.) Calculate y’’(0). The ODE is, by its nature, an equation 
that is meant to hold for all values of x. Therefore, you can take the derivative of the equation. With this in mind, 
calculate y’”"(0) and y (0). 
In each of the following problems, verify whether or not the given function is a solution of the given differential equation 
and specify the interval or intervals in which it is a solution; C always denotes a constant. 

(a) y”+4y=0, y=sin2x+C. (b) y’—y?(x) sing = 0, y(x) = 1/cosz. 


(c) 2yy’ = 1, y(x) = Ve $1. (d) wy" y’ +423 y = 0, y = sin(x? +1). 
(ce) y! =ky, y(x) =Ce. (f) y= 1—hy, kyla) =1+Ce™. 
(g) y’” =0, y(x) = Ca”. (h) y+ 2y =2 cos’ a, y(x) = sin? x. 


(i) y" —5y'+6y=0,y=Ce™*. (j)_ yy" $2y' ty =0, w(x) =Caze*. 

y’ — 2ry = 1, xy —y=x sina, 
9 y=e™ ie ef? dt+ Ce®’. (1) y(x) =a fy Btdt+ Cz. 
Solutions to most differential equations cannot be expressed in finite terms using elementary functions. Some solutions 
of differential equations, due to their importance in applications, were given special labels (usually named after an 
early investigator . its properties) 2 therefore are referred to as special functions. For example, ee are two known 
sine ibe oe Si(x =f; = sint dt, si(x) = — oie smt dt = Si(x) — = and three cosine integrals: Cin(x =f docost dt, 
ci(w) = — fre ss a i and Gila )=%7 ie Ina + {> “* dt, where 7 © 0.5772 is Euler’s constant. Use eae ieeeeaton 
to find an eplcil solution to the initial value problem xy’ = sin zx, subject to y(1) = 1. 


Verify that the indicated function is an implicit solution of the given differential equation. 
(a) (yt+l)y’ +24+2=0, (c+ 2)? + (yt+1)? =C’; 

(b) xy’ =(1-y*)/y, — yw-Iny =C; 

(c) y =(/Gt+y)+1+y)/1+4+2), CV¥1i+a-/IFy=1; 

(d) y =(y—2)?/2?,  (y—2)(1+ Cx) =a. 
Show that the functions in parametric form satisfy the given differential equation. 

@ 3a dx + 2/4 — x2 dy = 0, (b) (a + 3y)dy = (5a + 3y)da, 


x =2cost, y= 3sint; cae 430%, y=—e 7% 456%: 
(c) 2y’ =y-1,¢2=t?, y=t+1; (d) yy =2, x=tant, y=sect; 
(e) ay’ =3y, r=t?, y=?; (f) ary’ =by, r=t*, y=? 
(g) y =—4z, x = sint, y=cos2t; (h) 9yy' =42, x =3cosht, y = 2sinht; 
. 2yt+ly =1, . Yt ee mee . 


(e+1)y' =1, (1) ayy’ = ba, 
x=t—-1, y=Int, t>0; x =acosht, y = bsinht. 


(k) 
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Determine the value of \ for which the given differential equation has a solution of the form y = e”’. 


(a) y'—3y=0; (b)  y"—4y'+3y=0; (ec) y"—y'—2y=0; 
(d)  y"—2y'—3y=0;  (e) y+ 38y"+2y'=0; (f)  y — 3y" + 8y’-y =0. 
Determine the value of \ for which the given differential equation has a solution of the form y = x. 
(a) wy" + 2ey'—2y=0; (b) azy’—2y=0; (ce) ay" — 3ay’ + 8y = 0; 
(d)  «x?y"+2ay’—6y=0; (e) x?y”-—6y=0; (f) a?y” —5ay’+5y=0. 
Show that (a) the first order differential equation |y’| + 4 = 0 has no solution; (b) |y’| + y? +1 = 0 has no real 
solutions, but a complex one; (c) |y’| + 4|y| = 0 has a solution but not one involving an arbitrary constant. 


Show that the first order differential equation y’ = 4,/y has a one-parameter family of solutions of the form y(x) = 
(2x + C)*, 22 + C > 0, where C is an arbitrary constant, and a singular solution y(2#) = 0 which is not a member of 
the family (22 + C)? for any choice of C. 


Find a differential equation for the family of lines y = Cx — C?. 
For each of the following differential equations, find a singular solution. 
(a) y! = 30? —(y— 2°); (b) y= V(@+1(y- 1); 
(c) y= Va? —y + 2a; (d) y= (2y)*+(y?- 2)"; 
(e) yo! = (a? 4+3y—2)? 422; (f) yy! = 2(@ +1)(2? + 2a — 3)7/9. 
Show that y = +a are singular solutions to the differential equation yy’ = \/a? — y?. 
Verify that the function y = 7+4V/x +1 is a solution of the differential equation (y—x) y’ = y—x+8 on some interval. 


The position of a particle on the x-axis at time t is x(t) = t@9 for t > 0. Let v(t) be the velocity of the particle at time 
t. Find lim¢-4o v(t). 


An airplane takes off at a speed of 225km/hour. A landing strip has a runway of 1.8km. If the plane starts from rest 
and moves with a constant acceleration, what is this acceleration? 


Let m(t) be the investment resulting from a deposit mo after t years at the interest rate r compounded daily. Show 
that 

[1 : r ia 

m(t) =m — 

Osi 365 


From calculus we know that [1 + =] "’_y ert as n— oo. Hence, m(t) > mo exp{rt}. What differential equation does 
the function m(t) satisfy? 


A particle moves along the abscissa so that its instantaneous acceleration is given as a function of time t by a(t) = 2—3t?. 
At times t = 1 and t = 4, the particle is located at x = 5 and x = —10, respectively. Set up a differential equation and 
associated conditions describing the motion. 


A particle moves along the abscissa in such a way that its instantaneous velocity is given as a function of time t by 
v(t) = 6— 3t?. At time t = 0, it is located at 2 = 1. Set up an initial value problem describing the motion of the 
particle and determine its position at any time t > 0. 


A particle moves along the abscissa so that its velocity at any time t > 0 is given by u(t) = 4/(t? +1). Assuming that 
it is initially at 7, show that it will never pass x = 2. 


The slope of a family of curves at any point (x,y) of the plane is given by 1+ 2x. Derive a differential equation of the 
family and solve it. 


The graph of a nonnegative function has the property that the length of the arc between any two points on the graph 
is equal to the area of the region under the arc. Find a differential equation for the curve. 


Geological dating of rocks is done using potassium-40 rather than carbon-14 because potassium has a longer half-life, 
1.28 x 10° years (the half-life is the time required for the quantity to be reduced by one half). The potassium decays 
to argon, which remains trapped in the rocks and can be measured. Derive the differential equation that the amount 
of potassium obeys. 


Prove that the equation y’ = (ay + b)/(cy +d) has at least one solution of the form y = kz if either b = 0 or ad = be. 


Which straight lines through the origin are solutions of the following differential equations? 


(a) y= 24, ey yf = SPH, (ey yf = SY, 

er y-x ar 
d peed: i a ere 6 (pmuieel 
(d) y fea (e) y oy (f) y reas 


Phosphorus (?!P) has multiple isotopes, two of which are used routinely in life-science laboratories dealing with DNA 
production. They are both beta-emitters, but differ by the energy of emissions—*’P has 1.71 MeV and *°P has 0.25 MeV. 
Suppose that a sample of 32-isotope disintegrates to 71.2mg in 7 days, and 33-isotope disintegrates to 82.6mg during 
the same time period. If initially both samples were 100mg, what are their half-life periods? 
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: Direction field for Problem 36. 


0.5 
3 


Affe t 

fap fp f 

POTS f 

A444 4 

APPA f 

AALA A 
AAPAALALALA LAP AP AP AP AP AA 


AAA AAA AAAAAAAAAAAAAAAAAA 
A AAA AAA AAA AAA AAA AA AAAAAA 


AAA AAA AAA AAA AAA AAA AAA AAS 
ly or oe oe 


| ee ee ee 
[> > > — ee 


ee 
Je te 
SANS ANAS NAN SN AS SNA NSS SANA ASAIN 


! 
! 
Z 
/ 
7 


LOLLSLLELLLLLLLLLSLLSLLLLLELELSLLES 


1 
1 
! 
LL 
7 
/ 
be 


Paw paeae Ca 


Figure 1.15: Direction field for Problem 38. 
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y=y(y+1); 
y=yt3; 
y=3yt1. 


(1) 


39. The direction field of Figure 1.16. 
40. The direction field of Figure 1.17. 
41. The direction field of Figure 1.18. 


(d) 
(h) 


Figure 1.18: Direction field for Problem 41. 


0 has infinitely many solutions. 
y(0) = 2; 
y(y +3)"; 
(y + 3)?; 
y=—y +3; 


y 
y 


(c) 
(g) 


2 (2 = ye are their solutions, respectively. What are their long-term 
(k) 


y =y(—y), 


(b) 
y = 3y— 1; 
y=y'(y +3); 
y=y—3; 


y(0) = 3; 
(1+ er) and yp 


(b) 
(f) 
(i) 


y’ = y(1 — y); 
show that ya(x) = 
— 3; 


behaviors? 
=y 


o= y??: ; 
y=yyt+3); 


(a) 
y 


(i) 


36. The direction field of Figure 1.13. 
37. The direction field of Figure 1.14. 


Figure 1.17: Direction field for Problem 40. 
38. The direction field of Figure 1.15. 


(a) 
(e) 


Consider the following list of differential equations, some of which produced the direction fields shown in Figures 1.13 
through 1.18. In each of Problems 36 through 41, identify the differential equation that corresponds to the given direction 


The following problems require utilization of a computer package. 
field. 


34. Show that the initial value problem y’ = 42,/7, y(0) 


35. For the following two initial value problems 
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1.6 Existence and Uniqueness 


An arbitrary differential equation of the first order y’ = f(x,y) does not necessarily have a solution that satisfies it. 
Therefore, the existence of a solution is an important problem for both the theory of differential equations and their 
applications. 

If some phenomenon is modeled by a differential equation, then the equation should have a solution. If it does not, 
then presumably there is something wrong with the mathematical modeling and the simulation needs improvement. 
So, an engineer or a scientist would like to know whether a differential equation has a solution before investing time, 
effort, and computer applications in a vain attempt to solve it. An application of a software package may fail to 
provide a solution to a given differential equation, but this doesn’t mean that the differential equation doesn’t have 
a solution. 

Whenever an initial value problem has been formulated, there are three questions that could be asked before 
finding a solution: 


1. Does a solution of the differential equation satisfying the given conditions exist? 


2. If one solution satisfying the given conditions exists, can there be a different solution that also satisfies the 
conditions? 


3. What is the reason to determine whether an initial value problem has a unique solution if we won’t be able to 
explicitly determine it? 


A positive answer for the first question is our hunting license to go looking for a solution. In practice, one wishes 
to find the solution of a differential equation satisfying the given conditions to less than a finite number of decimal 
places. For example, if we want to draw the solution, our eyes cannot distinguish two functions which have values 
that differ by less than 1%. Therefore, for printing applications, the knowledge of three significant figures in the 
solution is admissible accuracy. This may be done, for instance, with the aid of available software packages. 

In general, existence or uniqueness of an initial value problem cannot be guaranteed. For example, the initial 
value problem y! = y?, x < 1, y(1) = —1 has a solution y(x) = —a~1+, which does not exist for x = 0. On the other 
hand, Example 1.4.2 on page 9 shows that the initial value problem may have two (or more) solutions. 

For most of the differential equations in this book, there are unique solutions that satisfy certain prescribed 
conditions. However, let us consider the differential equation 


ry’ — 5y = 0, 


which arose in a certain problem. Suppose a scientist has drawn an experimental curve as shown on the left side of 
Fig.1.19 (page 22). 

The general solution of the given differential equation is y = C x° with an arbitrary constant C. From the initial 
condition y(1) = 2, it follows that C = 2 and y = 2°. Thus, the theoretical and experimental graphs agreed for 
x > 0, but disagree for x < 0. 

If the scientist had erroneously assumed that a unique solution exists, s/he may decide that the mathematics 
was wrong. However, since the differential equation has a singular point x = 0, its general solution contains two 
arbitrary constants, A and B, one for domain x > 0 and another one for x < 0. So 


Therefore, the experimental graph corresponds to the case A = 2 and B = 0. 

Now suppose that for the same differential equation xy’ = 5y we have the initial condition at the origin: y(0) = 0. 
Then any function y = Cz° satisfies it for arbitrary C and we have infinitely many solutions to the given initial 
value problem (IVP). On the other hand, if we want to solve the given equation with the initial condition y(0) = 1, 
we are out of luck. There is no solution to this initial value problem! 

In this section, we discuss two fundamental theorems for first order ordinary differential equations subject to 
initial conditions that prove the existence and the uniqueness of their solutions. These theorems provide sufficient 
conditions for the existence and uniqueness of a solution; that is, if the conditions hold, then uniqueness and/or 
existence are guaranteed. However, the conditions are not necessary conditions at all; there may still be a unique 
solution if these conditions are not met. The following theorem guarantees the uniqueness and existence for linear 
differential equations. 
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Figure 1.19: Experimental curve at the left and modeled solution at the right. 


Theorem 1.1: Let us consider the initial value problem for the linear differential equation 
y +a(x)y = f(a), (1.6.1) 


y(Lo0) = Yo, (1.6.2) 


where a(x) and f(x) are known functions and yo is an arbitrary prescribed initial value. Assume that the 
functions a(x) and f(a) are continuous on an open interval a < x < @ containing the point 29. Then the initial 
value problem (1.6.1), (1.6.2) has a unique solution y = $(z) on the same interval (a, 3). 


In §2.5 we show that if Eq. (1.6.1) has a solution, then it must be given by the following formula: 


oie / evi hee c| :. HOS Sexe a Aa) ar} (1.6.3) 


When u(x) is a nonzero differentiable function on the interval (a, 8), we have from Eq. (2.5.2), page 86, that 


& [u(a)y(a)] = lo) (2), 
x 
Since both u(x) and f(x) are continuous functions, its product p(x) f(x) is integrable, and formula (1.6.3) follows 
from the latter. Hence, from Eq. (1.6.3), the function y(a) exists and is differentiable over the interval (a, 3). By 
substituting the expression for y(a) into Eq. (1.6.1), one can verify that this expression is a solution of Eq. (1.6.1). 
Finally, the initial condition (1.6.2) determines the constant C' uniquely. 

If we choose the lower limit to be zo in all integrals in the expression (1.6.3), then 


1 x zx 
ie) => | [ wisysie)as+u0], ule) ex | a(s)as} 
xo xO 
is the solution of the initial value problem (1.6.1), (1.6.2). o 
In 1886, Giuseppe Peano” gave sufficient conditions that only guarantee the existence of a solution for initial 
value problems (IVPs). 


°Giuseppe Peano (1858-1932) was a famous Italian mathematician who worked at the University of Turin. The existence theorem 
was published in his article [36]. 
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The Peano existence theorem can be viewed as a generalization of the fundamental theorem of calculus, which 
makes the same assertion for the first order equation y’ = f(x). Geometrical intuition suggests that a solution curve, 
if any, of the equation y’ = f(x,y) can be obtained by threading the segments of the direction field. We may also 
imagine that a solution is a trajectory or path of a particle moving under the influence of the force field. Physical 
intuition asserts the existence of such trajectories when that field is continuous. 


Theorem 1.2: [Peano] Suppose that the function f(x, y) is continuous in some rectangle: 


OSM a =O Ret, By = SO el (1.6.4) 

Let 5 
M= : ; h = mi > SE Po 1.6.5 
anax, (sev) = min fa, 7} (1.65) 


Then the initial value problem 
y! = fea); y(xo) = Yo; (1.6.6) 


has a solution in the interval [a9 — h, ao + Al. 


Corollary 1.1: If the continuous function f(x,y) in the domain 2 = {(2,y) : a <a < B, —co < y < cw} 
satisfies the inequality |f(«, y)| < a(x)|y| +b(x), where a(x) and b(x) are positive continuous functions, then the 
solution to the initial value problem (1.6.1), (1.6.2) exists in the interval a < x < £6. a 


In most of today’s presentations, Peano’s theorem is proved with the help of either the Arzela—Ascoli compactness 
principle for function sequences or Banach’s fix-point theorem, which are both beyond the scope of this book. 

In 1890, Peano showed that the solution of the nonlinear differential equation y/ = 3y?/? subject to the initial 
condition y(0) = 0 is not unique. He discovered, and published, a method for solving linear differential equations 
using successive approximations. However!®, Emile Picard!! had independently rediscovered this method and applied 
it to show the existence and uniqueness of solutions to the initial value problems for ordinary differential equations. 
His result, known as Picard’s theorem, imposes a stronger condition” on f(a, y) to prevent the equation y’ = f(z, y) 
from having singular solutions. 


Theorem 1.3: [Picard] Let f(x,y) be a continuous function in a rectangular domain 2 containing the point 
(xo, yo). If f(a, y) satisfies the Lipschitz condition 


[eye — fe, 42) Lea = yo! 


for some positive constant L (called the Lipschitz constant) and any x, yi, and y2 from 2, then the initial value 
problem (1.6.6) has a unique solution in some interval x) — h < « < a +h, where h is defined in Eq. (1.6.5). 


We cannot guarantee that the solution y = $(«) of the initial value problem (1.6.6) exists in the interval 
(xo — a, Xo + a) because the integral curve y = ¢(a) can exist outside of the rectangle 0. For example, if there exists 
x, such that v9 —-a < 41 < a +a and yo +b = (21), then for x > x, (if x1 > xo) the solution (2) cannot be 
defined. 

We definitely know that the solution y = ¢(z) is in the range yo — b < d(x) < yo + b when a -—h<uxcapth 
with h = min{a, b/M} since the slope of the graph of the solution y = (x) is at least —M and at most M. If 
the graph of the solution y = (x) crosses the lines y = yo + b, then the points of intersection with the abscissa are 
xo +b/M. Therefore, the abscissa at the point where the integral curve goes out of the rectangle 2 is less than or 
equal to 7p + b/M and is greater than or equal to ap — b/M. 


10Tn 1838, Joseph Liouville first used the method of successive approximations in a special case. 

11Charles Emile Picard (1856-1941) was one of the greatest French mathematicians of the nineteenth century. In 1899, Picard lectured 
at Clark University in Worcester, Massachusetts. Picard and his wife had three children, a daughter and two sons, who were all killed 
in World War I. 

121t is called the Lipschitz condition in honor of the German mathematician Rudolf Lipschitz (1832-1903), who introduced it in 1876 
when working out existence proofs for ordinary differential equations. 


24 Chapter 1. Introduction 


yytb 


Figure 1.20: The domain of existence. 


To prove the theorem, we transform the initial value problem (1.6.6) into an integral equation. After integrating 
both sides of Eq. (1.6.6) from the initial point zo to an arbitrary value of x, we obtain 


ya) =w+ f fls.y(s))ds (1.6.7) 


Since the last equation contains an integral of the unknown function y(z) it is called an integral equation. More 
precisely, the equation of the form (1.6.7) is called a Volterra integral equation of the second kind. This integral 
equation is equivalent to the initial value problem (1.6.6) in the sense that any solution of one is also a solution of 
the other. 

We prove the existence and uniqueness of Eq. (1.6.7) using Picard’s iteration method or the method of 
successive approximations. We start by choosing an initial function ¢o, either arbitrarily or to approximate 
solution of Eq. (1.6.6) in some way. The simplest choice is 


bo(x) = yo- 


Then this (constant) function satisfies the initial condition y(ao) = yo. The next approximation ¢, is obtained by 
substituting ¢9 for y(s) in the right side of Eq. (1.6.7), namely, 


x(t) =vot f fls.¢0)ds=yo+ f f(s.40) ds. 
zo xo 
Let us again substitute the first order approximation in the right-hand side of Eq. (1.6.7) to obtain 
dale) =w+ | f(s.0u(9))as. 
xO 


Each successive substitution into Eq. (1.6.7) results in a sequence of functions. In general, if the n-th approximation 
¢n(s) has been obtained in this way, then the (n + 1)-th approximation is taken to be the result of substituting ¢, 
in the right-hand side of Eq. (1.6.7). Therefore, 


don4i1(@) = yo + s f(s, bn(s)) ds. (1.6.8) 


Jo 


All terms of the sequence {¢,,(x)} exist because 


< max | f(x, y)| = M|z —a0| <b 


[teas 
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for |x — xo| < 6/M, where M = max,z,y)eq |f (x, y)|- 

The method of successive approximations gives a solution of Eq. (1.6.7) if and only if the successive approxima- 
tions dn41 from Eq. (1.6.8) approach uniformly to a certain limit as n > oo. Then the sequence {¢,,(x)} converges 
to a true solution y = ¢(2) as n + co, which in fact is the unique solution of Eq. (1.6.7): 


y = $(x) = lim ¢n(2). (1.6.9) 


We can identify each element ¢,,(x) on the right-hand side of Eq. (1.6.9), 


bn(x) = bo + [¢1(x) — Go] + [b2(x) — b1(x)] + +++ + [On (%) — Gn-1(2)], 


as the n-th partial sum of the telescopic series 
= $0 + > [en() — bn—1(2))- (1.6.10) 


The convergence of the sequence {¢,,(x)} is established by showing that the series (1.6.10) converges. To do this, 
we estimate the magnitude of the general term |¢n(2) — dn—1(2)). 
We start with the first iteration: 


|o1(x) — do| = 


“Slewod j<o f ds = M|a — zo. 
xo 


For the second term we have 


Ido() oi< fits ,41(8)) — f(s, 0(s))| ds < Lf M(s~ a0 as = MEG to)" 


where L is the Lipschitz constant for the function f(x,y), that is, | f(x,y) — f(x, y2)| < Lily: — y2|. For the n-th 
term, we have 


lOn (x) =] én—1(#)| 


IN 


J \F(6-4-1(6)) = F(6,¢n-2(8))| ds 
ro 
7 ML"-*(s —a)""! 
< L |dn—1(8) _ on— 2( s)| \| ds < Lf ane a ds 
xo (n 7 a) 
ML"-1 _ n ML"- pn 
2 i ee eet 
nN. nN. 


Substituting these results into the finite sum 


n(x) = o + i bx (x) — de—1(2)], 


we obtain 
ML |x — xo|? ML"""|a2 — axo|” 
lon — bo] < Mp — ao + MER Sel yy ME ea ol! 
M L? |x — xo|? L”"\x — xo|" 
= — Popa ili. OU a dage 1 O 
L 2 ni 


When n approaches infinity, the sum 


L?\x2 — xo|? L" |x — ao|" 
PE 6 oo, 


Lla — 
Je — aol + 2 n! 


approaches e/!*—*ol — 1. Thus, we have 


M 


len(z) — vol < = [e271 — 3] 
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for any n. Therefore, by the Weierstrass M-test, the series (1.6.10) converges absolutely and uniformly on the interval 
|z —2ao| < h. It follows that the limit function (1.6.9) of the sequence (1.6.8) is a continuous function on the interval 
|x — xo| < h. Sometimes a sequence of continuous functions converges to a limit function that is discontinuous, as 
Exercise 26 on page 35 shows. This may happen only when the sequence of functions converges pointwise, but not 
uniformly. 


Next we will prove that $(z) is a solution of the initial value problem (1.6.6). First of all, ¢(a) satisfies the initial 
condition. In fact, from (1.6.8), 
on(xo) = Yo, n=0,1,2,... 
and taking limits of both sides as n > ov, we find ¢(29) = yo. Since (x) is represented by a uniformly convergent 
series (1.6.10), it is a continuous function on the interval 79 —h << @ < ao +h. 
Allowing n to approach oo on both sides of Eq. (1.6.8), we get 


a(e) = 0+ Jim, f f(s, 9n(s)) ds. (1.6.11) 


Recall that the function f(x,y) satisfies the Lipschitz condition for |s — xo| < h: 


f(s, bn(s)) — f(s, 6(s))] < Llon(s) — (9). 


Since the sequence ¢,(s) converges uniformly to $(s) on the interval |s — x9| < h, it follows that the sequence 
f(8,¢n(s)) also converges uniformly to f(s, ¢(s)) on this interval. Therefore, we can interchange integration with 
the limiting operation on the right-hand side of Eq. (1.6.11) to obtain 


(2) =w + flim F(s.bn(s))ds=y+ fo F(s,08)) As. 


Thus, the limit function ¢(z) is a solution of the integral equation (1.6.7) and, consequently, a solution of the initial 
value problem (1.6.6). In general, taking the limit under the sign of integration is not permissible, as Exercise 27 
(page 35) shows. However, it is true for a uniform convergent sequence. Differentiating both sides of the last equality 
with respect to x and noting that the right-hand side is a differentiable function of the upper limit, we find 


(x) = f(x, O(@)). 


This completes the proof that the limit function ¢(z) is a solution of the initial value problem (1.6.6). | 
Uniqueness. Finally we will prove that ¢(x) is the only solution of the initial value problem (1.6.6). To start, 
we assume the existence of another solution y = w(x). Then 


for |x — ao| < h. Setting U(x) = |d(ax) — (x), we have 


x 


Ue) < / " fG,40) = FO, 9O)ide <T / U(s) ds. 


0 zrO 
By differentiating both sides with respect to x, we obtain U’(a) — LU(x) < 0. Multiplying by the integrating factor 
e~/* reduces the latter inequality to the following one: 


[e** U(z) i <0. 


The function e~/* U(x) has a nonpositive derivative and therefore does not increase with x. After integrating from 
xo to x (> Xo), we obtain 
e -* U(x) <0 


since U(0) = 0. The absolute value of any number is positive; hence, U(#) > 0. Thus, U(x) = 0 for > a. The 
case x < Xo can be treated in a similar way. Therefore ¢(x) = (2). 
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Corollary 1.2: If the functions f(x,y) and Of /Oy are continuous in a rectangle (1.6.4), then the initial value 
problem y’ = f(x,y), y(%o) = yo has a unique solution in the interval |a — xo| < h, where h is defined in 
Eq. (1.6.5) and the Lipschitz constant is L = max |Of (x, y)/Oy|. 


Corollary 1.3: If the functions f(x,y) and Of /0x are continuous in a neighborhood of the point (a9, yo) and 
f (xo, yo) #0, then the initial value problem y’ = f(x,y), y(xo) = yo has a unique solution. 


The above statements follow from the mean value relation 


f(x, y1) — f(a, ye) = fy(x, €)(y1 — y2), 


where € € [yi, y2] and fy = Of /Oy. The proof of Corollary 1.3 is based on conversion of the original problem to its 
reciprocal counterpart: Ox/Oy = 1/f(a, y). | 

The proof of Picard’s theorem is an example of a constructive proof that includes an iterative procedure and 
an error estimate. When iteration stops at some step, it gives an approximation to the actual solution. With 
the availability of a computer algebra system, such an approximation can be exactly found for many analytically 
defined slope functions. The disadvantage of Picard’s method is that it only provides a solution locally—in a small 
neighborhood of the initial point. 

Usually, the solution to the IVP exists in a wider region (see, for instance, Example 1.6.3) than Picard’s theorem 
guarantees. Once the solution y = ¢(z) of the given initial value problem is obtained, we can consider another initial 
condition at the point « = x) + Aw and set yo = 6(ap + Az). Application of Picard’s theorem to this IVP may 
allow us to extend the solution to a larger domain. By continuing in such a way, we could extend the solution of 
the original problem to a bigger domain until we reach the boundary (this domain could be unbounded; in this case 
we define the function in the interval x9 — h < x < co). Similarly, we can extend the solution to the left end of 
the initial interval zo —h < « < a +h. Therefore, we may obtain some open interval p < x < q (which could be 
unbounded) on which the given IVP has a unique solution. Such an approach is hard to call constrictive. 

The ideal existence theorem would assure the existence of a solution in a longest possible interval, called the 
validity interval. It turns out that another method is known (see, for example, {24]) to extend the existence theorem 
that furnishes a solution in a validity interval. It was invented in 1768 by Leonhard Euler (see §3.2). However, the 
systematic method was developed by the famous French mathematician Augustin-Louis Cauchy (1789-1857) between 
the years 1820 and 1830. Later, in 1876, Rudolf Lipschitz substantially improved it. The Cauchy—Lipschitz method 
is based on the following fundamental inequality. This inequality can be used not only to prove the existence theorem 
by linear approximations, but also to find estimates produced by numerical methods that are discussed in Chapter 3. 


Theorem 1.4: [Fundamental Inequality] Let f(x,y) be a continuous function in the rectangle |a, b] x [c,d] and 
satisfying the Lipschitz condition 


[een — f(a ,2i5))|| = Eley 7 


for some positive constant L and all pairs y1, y2 uniformly in x. Let y;(x) and y2(a) be two continuous piecewise 
differentiable functions satisfying the inequalities 


yi (x) — f(x, y1(@))| < 1, |y2(x) — f(x, y2(x))| < €2 
with some positive constants €1,9. If, in addition, these functions differ by a small amount 6 > 0 at some point: 
ly1(o) — ya(2o)| < 4, 
then 
He) nell eee a (ehe-eol — 1). (1.6.12) 
The Picard theorem can be extended for non-rectangular domains, as the following theorem’® states. 


13Tn honor of Sergey Mikhailovich Lozinskii/Lozinsky (1914-1985), a famous Russian mathematician who made an important contri- 
bution to the error estimation methods for various types of approximate solutions of ordinary differential equations. 
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Theorem 1.5: [Lozinsky] Let f(x,y) be a continuous function in some domain 2 and M(x) be a nonnegative 
continuous function on some finite interval I (a < x < 2) inside 2. Let |f(x,y)| < M(x) for « € I and 
(a, y) € Q. Suppose that the closed finite domain Q, defined by inequalities 


zx 
my Qe< 2X1, ly—wl < | M(u) du, 
x0 


is a subset of 2 and there exists a nonnegative integrable function k(x), x € I, such that 


[Fhe en) — F(a, 84)| < k(x) ly2 — yl; LoQe<L AX, (x, yo), (@, yr) © Q. 


Then formula (1.6.8) on page 24 defines the sequence of functions {¢,(a)} that converges to a unique solution of 
the given IVP (1.6.1), (1.6.2) provided that all points (a, ¢,(a)) are included in Q when wp < x < 21. Moreover, 


ly(z) — on(a)] < 4 i M(u) du if k(u) au] (1.6.13) 


Actually, the Picard theorem allows us to determine the accuracy of the n-th approximation: 


|9(z) — or(z)| < 


y] ‘Geea)—feeaees 


i) 


< | If(w, (2) — f(@, bn—1(@)) dal < / 


x 


“DO =Waae ae 


< pte | L|0(x) — on—a(a)| da <.... 


Therefore, 


M (Dla - n+l ML" M (Lh)"+1 
|\d(x) _ on(x)| <— (Lx = xol)"™ < —-— p_rtt each (Lh)yrr 


L n! n! L n! 


(1.6.14) 


which is in agreement with the inequality (1.6.13). 


Example 1.6.1: Find the global maximum of the continuous function f(x,y) = (~+2)? +(y—1)?+1 in the domain 
Q={(2,y): |v+2| <a, |ly—1| <b}. 


Solution. The function f(a, y) has the global minimum at « = —2 and y = 1. This function reaches its maximum 
values in the domain 2 at such points that are situated furthest from the critical point (—2,1). These points are 
vertices of the rectangle 0, namely, (—2+a,1+b). Since the values of the function f(z, y) at these points coincide, 
we have 


max f(x,y) = f(-2+4,1+6)=a@?+0? +1. 
(x, y)EQ 


Example 1.6.2: Show that the function 


_ Jf ylniyl, if y 40, 


is not a Lipschitz function on the interval [—b, b], where b is a positive real number. 

Solution. We prove by contradiction, assuming that f(y) is a Lipschitz function. Let y; and y2 be arbitrary 
points from this interval. Then | f(y1) — f(y2)| < L|y1 — y2| for some constant L. We set y2 = 0 (limy+o yln|y| = 0), 
making |f(yi)| = ly: In|yi|| < Lly1| or |In|yi|| < L for all y; € [0,6], which is impossible. Indeed, for small values 
of an argument y, the function Iny is unbounded. Thus, the function f(y) does not satisfy the Lipschitz condition. 


Example 1.6.3: Let us consider the initial value problem for the Riccati equation 
y =a? ty’, y(0)=0 
— gone which does 


a2 + b2 
not exceed 1. According to Picard’s theorem, the solution of the given IVP exists (and could be found by successive 


in the rectangle {(2, y):|z| <a, |y| < b}. The maximum value of h is max min {a 
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approximations) within the interval |x| < h < 1, but cannot be extended on whole line. The best we can get from 
the theorem is the existence of the solution on the interval |z| < a = .7071067810. 
This result can be improved with the aid of the Lozinsky theorem (on page 28). We consider the domain Q 


defined by inequalities (containing positive numbers x; and A to be determined) 


0O<a<gn, 0<|y|< Aa. 
Then M(x) = max(z,yeq (2? + y”) = 2? + A?x®, 0 < x < 21, and the set Q is defined by inequalities 
3 7 
O<a<n, 0<Wl< pts. 
In order to have Q C Q, the following inequality must hold: 
1 c 
age SA, 
3 —_ 
141-441 
7 + pipet 
This is equivalent to the quadratic equation A? a A+$% = 0, which has two roots: A = a Hence, the 
oy 


quadratic equation has two real roots when 1 satisfies enequality 7; < ¢ 2 ~ 1.513700052. Therefore, Lozinsky’s 


theorem gives a larger interval of existence (more than double) than Picard’s theorem. 
As shown in Example 2.6.10, page 103, this Riccati equation y’ = x? + y? has the general solution of the ratio 
—u’/(2u), where wu is a linear combination of Bessel functions (see §4.9): 


1 Siyq(a?/2) + C¥f,,(0?/2) 


ue) = FF? FA @/2) 4 OV y(e2/2) 


A constant C’ should be chosen to satisfy the initial condition y(0) = 0 (see Eq. (2.7.1) on page 114 for the explicit 
expression). As seen from this formula, the denominator has zeroes, the smallest of them is approximately 2.003, so 
the solution has the asymptote x = h, where h < 2.003. 

The situation changes when we consider another initial value problem (IVP): 


y=x?—-y, y(0)=0. 


The slope function x? — y? attains its maximum in a domain containing lines y = +2: 
Pp y y 


2 +6925 72 
2 2 a, ifa- zy’, 
max |a* — y*| = 


y*, ify? >a’. 
For the function x? — y? in the rectangle || < a, |y| < b, the Picard theorem guarantees the existence of a unique 


solution within the interval |a| < h, where h = min < a, \ <1. To extend the interval of existence, we 


max {a?, b?} 
apply the Lozinsky theorem. 

First, we consider the function x? — y? in the domain Q bounded by inequalities 0 < x < -, 
where A and p are some positive constants, and xj, will be determined shortly. Then 


2 7 2 2 
ee = 38 x, if ° 2 (Aa?)’, 
go < M(x) = max |x — = 

OS = ae eee eee 


and |y| < Ag?, 


Now we define the domain Q by inequalities 0 < x < x*, |y| < f° M(u) du, where 


[ Meau= | 


_ 1 ys) 1 gatsy@-1) 4 AY ppt 
3 2p+1 2p+1 , 


Al/(e-1) x 
uw? du+ | A2u?? du 
Al/(p-1) 


In order to guarantee inclusion Q C Q, the following inequality should hold: ih M(u)du < Ax?. It is valid in the 
interval € < x < xj, where «7, is the root of the equation i, M(u) du = Az? and ¢€ is a small number. When A — +0 
and p + 1+ 0, the root, x), could be made arbitrarily large. For instance, when A = 0.001 and p = 1.001, the root 
is xj, © 54.69. Therefore, the given IVP has a solution on the whole line —oo < & < oo. 
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Example 1.6.4: (Example 1.4.2 revisited) Let us reconsider the initial value problem (page 9): 


dy = 2yi/? 

da ; 

Peano’s theorem (page 23) guarantees the existence of the initial value problem since the slope function f(x,y) = 

2y'/? is a continuous function. The critical point y = 0 is obviously a solution of the initial value problem. We 

show that f(x,y) = 2y'/? is not a Lipschitz function by assuming the opposite. That is to say suppose there exists 
a positive constant D such that 


y(0) = 0. 


lyr? — yo!" | < Lys — yal. 
Setting y2 = 0, we have 
ni71<Llyil or 1 < Lyi. 
The last inequality does not hold for small y;; therefore, f(y) = 2y'/? is not a Lipschitz function. In this case, we 
cannot apply Picard’s theorem (page 23), and the given initial value problem may have multiple solutions. According 


to Theorem panes (page 48), since the integra 
[ 2 
0 VV 


converges, the given initial value problem doesn’t have a unique solution. Indeed, for arbitrary ro > 0, the function 


ee 0, for —o <2 <4, 
Y= PMI) (¢ —29)?, for «> xo, 


is a singular solution of the given initial value problem. Note that y = 0 is the envelope of the one-parameter family 
of curves, y = (a — C)?, x > C, corresponding to the general solution. 


Example 1.6.5: Consider the autonomous equation 
y' = lyl- 


The slope function f(x,y) = |y| is not differentiable at x = 0, but it is a Lipschitz function, with L = 1. According 
to Picard’s theorem, the initial value problem with the initial condition y(0) = yo has a unique solution: 


Yo e*, for Yo > 0, 
y(x) = 0, for Yo = 0, 
yoe*, for yo <0. 


Since an exponential function is always positive, the integral curves never meet or cross the equilibrium solution 
y=0. 


Example 1.6.6: Does the initial value problem 
— =14+=(y—2)¥3, y(0) = 0, 


have a singular solution? Find all solutions of this differential equation. 
Solution. Changing the dependent variable to y — x = u, we find that the differential equation with respect to 
u is 
ul = 3u4/3/2. 
The derivative of its slope function f(z,u) = f(u) = 2ul/3 is f’(u) = 1/2u7?/%, which is unbounded at u = 0. 
In this case, Picard’s theorem (page 23) is not valid and the differential equation u’ = Zul/ 3 may have a singular 
solution. Since the equation for u is autonomous, we apply Theorem 2.2 (page 48). The integral 


“du 3 [ 1/3 2/3)u 2/3 
wm=s U du =u*!"|i_9 =u 
o fu) 2 Jo fimo 


converges. Hence, there exists another solution besides the general one, y = (a + C)°/*, where 7 = +1 and C is 
an arbitrary constant. Notice that the general solution of this nonlinear differential equation depends on more than 
the single constant of integration; it also depends on the discrete parameter 7. Returning to the variable y, we get 
the singular solution y = x (which corresponds to u = 0) and the general solution y = x + n(a — C)3/?, « > C. 


3/2 
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Example 1.6.7: Let us find the solution of the problem 
y=y’, y(0)=1, 


by the method of successive approximations. We choose the first approximation ¢9 = 1 according to the initial 
condition y(0) = 1. Then, from formula (1.6.8), we find 


dnaa(e) =14 [ 6(s)as (n = 0,1,2,...). 


Hence, we have 


l| 
= 
| 


oi (x) 
& 3 

do(x) = i+ f (l+s)?)ds=l+a+a?+—; 
0 


3 
x 3\ 2 
d3(z) = 1+ f (1+s+2+5) ds 
0 
7 


2 1 1 
- l+ata?+a+sa too t+ oa", 


+ 23 


and so on. The limit function is 


To check our calculations, we can use either Mathematica: 


Clear [phi] 
T[phi_] := Function[x, 1 + Integrate[phi[t]~2, {t, 0, x}]]; 
f[x_] = 1; (* specify the initial function *) 


Nest(T, £, 5] [x] (* Find the result of 5th iterations *) 


or Maple: 
yO:=1: T(phi,x) :=(phi,x)->y0+eval (int (phi(t)°2,t=0..x)): 
y:=array(O..n): Y:=array(0..n): 
y lO] :=x->y0: 
for i from 1 to n do 
y Ci] :=unapply (T(y[i-1] ,x),x): 
Y [i] :=plot(y[i] (x) ,x=0..1): 
od: 
display ([seq(Y[i],i=1..n)]); 
seq(eval(y[i]),i=1..n); 


Example 1.6.8: Using the Picard method, find the solution of the initial value problem 
y =art+2y, y(0)=1. 


Solution. From the recursion relation 
x 2 xr 
On4i(x) = 1+ f (t +2 ¢,(t))dt=1+4+ = +2 Z gn(t) dt, n=0,1,2,..., 
0 0 


where ¢9(x) = 1 is the initial function, we obtain 


x 2 
oi(z) = 1 fe 2)dt=1+ 22+, 

x 3 
da(z) = 1 fe 2di(t))dé= 14+ 2e+ Fa? +5, 

G 4 
o3(z) = 1 [« 2do(i))dt= 1420+ sa? +208 +o, 
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Picard iteration 


Figure 1.21: First four Picard approximations, plotted with MATLAB. 


and so on. 
To check calculations, we compare these approximations with the exact solution 


1 @« 5 », 
ui) = —g-gtae 

_ 1 a5 14% Awe Bat at wt 

yo a 31 rT 5! 
1 @2@ 5 5 5 58 5 16 5 32 

= we ee tH, 2 = Aq? LO ae En A a 5 
1 oo asa ae ae ed 

5 5 
— 1+2%e+5a°+5a°+-a'+ea° + 


Maple can also be used: 

restart: picard:=proc(f,x0,y0,n) #n is the number of iterations 
local s,y; y:=y0; # yO is the initial value at x=x0 
for s from 1 to n do 

y:=y0+int (f(a, subs (x=a,y)) ,a=x0..x); 

y:=sort(y,x,ascending) ; 

od; return(y) ; end 


Example 1.6.9: Find successive approximations to the initial value problem 
y=z—y’, y(0)=0 


in the square |x| < 1, |y| < 1. On what interval does Picard’s theorem guarantee the convergence of successive 
approximations? Determine the error in the third approximation. 

Solution. The slope function f(x,y) = x—y® has continuous partial derivatives; therefore, this function satisfies 
the conditions of Theorem 1.2 with the Lipschitz constant 


Calculations show that 


b ij. 7 
M= = —y|=2, h=mi — +} =min41;=} =-. 
met lf(z.y)| mt jz — y"| = 2, min fay ah min { 3} 5 


The successive approximations converge at least on the interval [—1/2,1/2]. From Eq. (1.6.8), we get 


x 2 x 
bun = f (-oar=>— f #(t)dt, n=0,1,2,...,  do(e) =O. 
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For n = 0,1,2, we have 


a [ @-oa=5, 


x 8 a at 
= t—_—) dt=— 
ba(x) i ( =) 2 56 
x 2 47 3 2 Pid wi2 Ry 17 22 
= a Nee 
3(2) | (5 =) | 2 56 896 106,624 ' 3,963,552 


Their graphs along with the exact solution are presented in Fig. 1.21. The absolute value of the error of the third 
approximation can be estimated as follows: 

Mis .4 O23? 1 9 

ly(x) — b3(a)| < Y= —— + 5g = Fg = (0-5625. 

Of course, the estimate could be improved if the exact solution were known. We plot the four Picard approximations 
along with the solution using the following MATLAB commands (which call for quad1, adaptive Gauss—Lobatto rules 
for numerical integral evaluation): 

t = linspace(time(1),time(2) ,npts); % Create a discrete time grid 


y = feval(@init,t,y0); % Initialize y = yO 

window = [time,space]; 

for n = 1:N % Perform N Picard iterations 
[t,y] = picard(@f,t,y,npts); % invoke picard.m 


plot(t,y,’b--’,’LineWidth’,1); % Plot the nth iterant 
axis(window); drawnow; hold on; 


end 

[t,y] = ode45(@f,time,y0); % Solve numerically the ODE 
plot(t,y,’k’,’LineWidth’ ,2); % Plot the numerical solution 
hold off; 


axis([min(t) max(t) min(y) max(y)]) 
function [t,y] = picard(f,t,y,n) % picard.m 


tol = 1e-6; % Set tolerance 
phi = y(1)*ones(size(t)); % Initialize 
for j=2:n 
phi(j) = phi(j-1)+quadl(@fint,t(j-1),t(j),tol, 0],f,t,y); 
end y = phi; 


Example 1.6.10: Let 
0, ifa <0, -wo<y<o; 
Hei< = if0<a<l, -w<y<QO0; 
: a=), if0<2<1,0<y<2’; 
—a2, if0<a2<l,a2?<y<o. 


Prove that the Picard iterations, 9, ¢1, ..., for the solution of the initial value problem y’ = f(x,y), y(0) = 0, do 
not converge on [0,¢] for any0<e <1. 

Solution. It is not hard to verify that the function f(x,y) is continuous and bounded in the domain 2 = {0 < 
x <1, -—co < y< oo}. Moreover, |f(x,y)| < 1. Hence, the conditions of Theorem 1.2 are satisfied and the initial 
value problem y’ = f(x,y), y(0) = 0 has a solution. Let us find Picard’s approximations for 0 < x < 1, starting with 


oo = 0: 


dt = | fe0d=; 


x x 2 
2 svolyar= f° ¢(4F) arso: 


x x 2 
do = f soanar= f° se.oae= 4: 
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and so on. Hence, dan(x) = 0 and dam4i(v) = -. m =0,1,.... Thus, the sequence ¢,,(a) has two accumulation 
points (0 and x?/2) for every x # 0. Therefore, Picard’s approximation does not converge. This example shows that 
continuity of the function f(x,y) is not enough to guarantee the convergence of Picard’s approximations. 


There are times when the slope function f(z, y) in Eq. (1.6.6), page 23, is piecewise continuous. We will see such 
functions in Example 2.5.4 and Problems 5 and 13 (§2.5). When abrupt changes occur in mechanical and electrical 
applications, the corresponding mathematical models lead to differential equations with intermittent slope functions. 
So, solutions to differential equations with discontinuous forcing functions may exist; however, the conditions of 
Peano’s theorem are not valid for them. Such examples serve as reminders that the existence theorem 1.2 provides 
only sufficient conditions needed to guarantee a solution to the first order differential equation. 

Computer-drawn pictures can sometimes make uniqueness misleading. Human eyes cannot distinguish drawings 
that differ within 1% to 5%. For example, solutions of the equation 2’ = x? — ¢? in Fig. 1.11(b) on page 16 and 
Fig. 2.32(b) on page 111 appear to merge; however, they are only getting very close. 


2 
i, Shaw that luca S { Ce aes) 


Ge. tres 0 is the general solution of the differential equation zy’ — 2y = 0. 
Qu", , 
2. Prove that no solution of x? y' — 2a? y = 4 can be continuous at x = 0. 


3. Show that the functions y = 0 and y = x«*/16 both satisfy the differential equation y’ = x yl! ? and the initial conditions 
y(0) = 0. Do the conditions of Theorem 1.2 (page 23) hold? 


4. Show that the hypotheses of Theorem 1.3 (page 23) do not hold in a neighborhood of the line y = 1 for the differential 
equation y’ = |y — 1|. Nevertheless, the initial value problem y’ = |y — 1|, y(1) = 1 has a unique solution; find it. 


5. Does the initial value problem y’ = \/|y|, x > 0, y(0) = 0 have a unique solution? Does f(y) = ./ly| satisfy the 
Lipschitz condition? 


6. Show that the function f(a,y) = y? cosa + e?* is a Lipschitz function in the domain 0 = {(a,y) : |y| < 6} and find 
the least Lipschitz constant. 


7. Show that the function f(a, y) = (2+sin x)-y?/? —cos x is not a Lipschitz function in the domain Q = { (x,y) : |y| <b}. 
For what values of a and £ is this function a Lipschitz function in the domain Q; = { (#,y):0<a<y< fh}? 


8. Could the Riccati equation y’ = a(x) y? + b(x)y + c(a), where a(x), b(x), and c(a) are continuous functions for 
x € (—oo, 00), have a singular solution? 


9. Find the global maximum of the continuous function f(x,y) = 2? + y? + 2(2—2—y) in the domain 2 = {(z,y) : 
|x —1| <a, |y—1] < Bf. 


10. Consider the initial value problem y’ = —2x + 2(y + #?)'/?, y(—1) =-1. 
(a) Find the general solution of the given differential equation. Hint: use the substitution y(#) = —x? + u(z). 
(b) Derive a particular solution from the general solution that satisfies the initial value. 


(c) Show that y = 2Cx + C?, where C is an arbitrary positive constant, satisfies the differential equation y’ + 2a = 
2,/x? + y in the domain « > —C/2. 


(d) Verify that both yi(z) = 2% +1 and y2(x) = —a? are solutions of the given initial value problem. 
(e) Show that the function y2(a) = —2? is a singular solution. 


f) Explain why the existence of three solutions to the given initial value problem does not contradict the uniqueness 
y & 
part of Theorem 1.3 (page 23). 


11. Determine a region of the ry-plane for which the given differential equation would have a unique solution whose graph 
passes through a given point in the region. 


(a) (y-y)y’ =2, y(1) =2; (b) x =y?/, y(0) =1; 
(c) (y—a)y’ =y+2’, y(1) =2; (d) y'=Very, y(0) =1; 
(e) (#2? +y)y'=siny, y(m/2)=0; (f) zy’ =y, y(2) =3; 
(g) (1+y*)y’ =a, y(1) =0; (bh) y’ —y=27,y(2) =1. 


12. Solve each of the following initial value problems and determine the domain of the solution. 


(1+sin x) y’ + cot(x) y =cosz, (1 —23)y! — 8a? y = 42°, 


yan) = 1. OP Ay St. 
(c) (sinx)y’ + cosry = cota, (d) xy +ay//1— 2 =0, 
y(3n/4) = 2. v(V3/2) = 1. 


(xsinx)y’ + (sina + xcosx)y =e", 


(2) ¥(-0.5) =0. (f) y= 2axy?, (0) = 1/k?. 


1.6. 


13. 


14. 
15. 


16. 


17. 


18. 


19. 


20. 


2. 
22. 


23. 


24. 


25. 


26. 


27. 


28. 


29. 
30. 
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Determine (without solving the problem) an interval in which the solution of the given initial value problem is certain 
to exist. 


(a) (Int)y’+ty=In*t, y(2)=1;  (b) 2(@—2)y'+(a—-lyta*y=0, a(1) =2; 

(c) yi +(cotz)y=x, y(n/2)=9; (d) (—-9)y'+ty=t*, — y(-1)=2; 

(ec) @-9ytty=t, y4)=2 (f) (@-ly'+(sinz)y=2°, 2(2)=1. 
Prove that two distinct solutions of a first order linear differential equation cannot intersect. 


For each of the following differential equations, find a singular solution and the general solution. 
a2 ¢ 
(a) y= 2e43Qy— 27)"; (b) w= tS =a) (C) yo =a? - Va —3y; 


(dd) y=, (ec) y = JS/uy—D(et2); (f) y! =2/2e+y — 2; 
(g) y= Vy —1; (h) yf = Ye. @ v= /7e>: 
G) ¥=W-DWy; (k)  (y')?+ay' = y; Q) yy =(1—-y?)”. 


Compute the first two Picard iterations for the following initial-value problems. 
(a) y’=1—-(l+a)yt+y’, yO)=1; (b) y=ar-y’, yO)=1 
(ce) y'=1+easiny, y() = 2n; (dq) y=l1+a-y’, y(0)=0. 


Compute the first three Picard iterations for the following initial value problems. On what interval does Picard’s 
theorem guarantee the convergence of successive approximations? Determine the error of the third approximation. 


(a) y’ = ey, y(1) = 15 (b) y =e y?, yO =); 
(c) sy yy y)=1,; (d) y= 3y? + 4x7, y(0) =0; 
(e) sy =y-2y, yA)=2; (f) yi =sing—-y, y(0)=0. 


Compute the first four Picard iterators for the differential equation y’ = x+y” subject to the initial condition y(0) =0 
and then another initial condition y(1) = 2. Estimate the error of the fourth approximation for each. 


Find the general formula for n-th Picard’s approximation, ¢n(x), for the given differential equation subject to the 
specified initial condition. 


(a) yo =8e%*+y, y(1)=0; (b) y’=e**-y, y(0)=1; 
(c) y=at+y, y(0)=1; (d) y= -y?, y(0)=1. 


Let f(x,y) be a continuous function in the domain 2 = { (2, y) : zo <x <aot+e, |y—yo| <b}. Prove the uniqueness 
for the initial value problem y’ = f(x,y), to <e@<aote, O<e<a, y(xo) = yo if f(x, y) does not increase in y for 
each fixed x. 


For which nonnegative values of a does the uniqueness theorem for the differential equation y’ = |y|° fail? 


For the following [VPs, show that there is no solution satisfying the given initial condition. Explain why this lack of 
solution does not contradict Peano’s theorem. 


(a) ay’-y=27,y(0)=1; (b) ay’ =2y—-2%, (0) =1. 


Convert the given initial value problem into an equivalent integral equation and find the solution by Picard iteration: 


y = 6x — 2xy, y(0) = 1. 


Prove that an initial value problem for the differential equation y’ = 2,/|y| has infinitely many solutions. 
Does the equation y’ = y?!/ 4 have a unique solution through every initial point (xo, yo)? Can solution curves ever 
intersect for this differential equation? 


Show that the sequence of continuous functions y,(«) = 2” for 0 < « < 1 converges as x — 0 pointwise to a discontinuous 
function. 


Find a sequence of functions for which limn-.. i fn(x) dx # So limnoo fn(x) da. 


Find successive approximations ¢0(x), ¢1(x), and ¢2(x) of the initial value problem for the Riccati equation y’ = 
1—(1+a)y+y’, y(0) = 1. Estimate the difference between ¢2(x) and the exact solution y(a) on the interval 
[—0.25, 0.25]. 


Does the initial value problem y’ = g 2, y(0) = 1 have a unique solution? 


Consider the initial value problem y’ = /a + Jy, y(0) = 0. Using substitution y = t®v*, « = t*, derive the differential 
equation and the initial conditions for the function v(t). Does this problem have a unique solution? 
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Review Problems 


Review Questions for Chapter 1 


1. 


15. 


16. 


Eliminate the arbitrary constant (denoted by C) to obtain a differential equation. 


(a) y= Se (b) Jy —« —-InCx = 0; (c) e+ Cy? = 1; 

(d)  y+1=Ctang; (ec) Ce*+e¥=1; (f) Ce =s+ a3 

(g) ye? = 1; (h) Ce® =arcsinx +arcsiny; (i) Cy—Ing =0; 

(Gj) Wite2=14+CV/i+y; (k) y=C@- ©)’; ()s-y(w +1) + Ce~* +e” =0. 


In each of the following problems, verify whether or not the given function is a solution of the given differential equation 
and specify the interval or intervals in which it is a solution; C always denotes a constant. 


(a) (y)?-zyt+y=0, 3/y=(2-1)°”?; (b) yy =a, y?—-2?=C; 


(c) yy’ = 2a cos(a*)y, y= Ce; (d) yy’ +y?=0, y(x) =1/z; 
(e) y' +2ay=0, y(«)=Ce*; (f) ay =2y, y=Cx’; 
(g)  y'+2/y=0, y(x) = VCP a; (h) = -y’ =cos?y, tany=a+C. 


The curves of the one-parameter family x? + y? = 3Cxy, where C is a constant, are called folia of Descartes. By 
eliminating C’, show that this family of graphs is an implicit solution to 


dy _ y(y* = 2x*) 
dx x(2y3 — #3)" 


Show that the functions in parametric form satisfy the given differential equation. 

(a) ry =2y, «=t?, y=t*; (b) a’yy’ =b?x, «x =asinht, y= b cosht. 
A particle moves along the abscissa in such a way that its instantaneous velocity is given as a function of time t¢ by 
v(t) =4- 3t?. At time t = 0, it is located at x = 2. Set up an initial value problem describing the motion of the 
particle and determine its position at any time t > 0. 


Determine the values of \ for which the given differential equation has a solution of the form y = e*°. 


(a) oy" —y'—-6y=0; (Bb) oy + 3y" + 3y' + y = 0. 
Determine the values of for which the given differential equation has a solution of the form y = <>. 
(a) Qa7y" — 3xy’+3y=0; (b) Qa7y" + day’ — 2y = 0. 
Find a singular solution for the given differential equation 
(a) oy! = 32/1 —y?; (b) y= 7 Vin Fy- 2; 
(cc) (@ +1)y=VHy-T, (dy =-Vy-4 
Verify that the function y = \/(x? — 2a + 3)/(x? + x — 2) is a solution of the differential equation 2yy’ = (9— 3x?) /(x? + 
x — 2)? on some interval. Give the largest intervals of definition of this solution. 


The sum of the z- and y-intercepts of the tangent line to a curve in the xy-plane is a constant regardless of the point 
of tangency. Find a differential equation for the curve. 


. Derive a differential equation of the family of circles having a tangent y = 0. 


. Derive a differential equation of the family of unit circles having centers on the line y = 2x. 


Show that «* and («*/? +5)? are solutions of the nonlinear differential equation (y’)? — 9xy = 0 on (0,00). Is the sum 
of these functions a solution? 


In each of the following problems, verify whether or not the given function is a solution of the given differential equation 
and specify the interval or intervals in which it is a solution; C' always denotes a constant. 


(a) cy’ =yt+a, y(x)=xlnx+Cz; (b) y’ cosr+ysinvx=1, y=Ccosx+sing; 


(c) y'+ye™=0,C=e"+Iny; (d) y+ 2xy=e™, y=(xt+C)e™; 
(e) zy’+y=yln|zy|, njay|=Cz; (hb) y!=sec(y/x)+y/e, y= xarcsin(Ina); 


/ 
1 
(f) 2 + a = aresina + aresin Y, Ce* = arcsin x + arcsin y; 


1l-y2 vVl-«? 
(g) ty’ +2ty—t+1=0, y=4-1/t+C/t?. 


Show that y’ = y*, where a is a constant such that 0 < a < 1 has the singular solution y = 0 and the general solution 
is y = [(1—a)(a + C)|‘/"-.. What is the limit of the general solution as a — 1? 


Which straight lines through the origin are solutions of the following differential equations? 


pho , , ox + 4y ,  “+3y 
(a) sy iy (b) yxy; (ec) y Te + By (dq) oy eG 
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17. 


18. 


19. 


20. 


21. 
22. 
23. 


24. 


25. 


26. 


27. 


28. 


29. 
30. 


3l. 


32. 


33. 


34. 


35. 
36. 


Show that y = C'x — C®, where C is an arbitrary constant, is an equation for the family of tangent lines for the parabola 
y= a? /4, 

Show that in general it is not possible to write every solution of y’ = f(x) in the form y(x) = / f(t) dt and compare 
this result with the fundamental theorem of calculus. . 


Show that the differential equation y (y’)? + y? = 1 has the general solution family (x + C)? + y? = 1 and also singular 
solutions y = +1. 


The charcoal from a tree killed in the volcanic eruption that formed Crater Lake in Oregon contained 44.5% of carbon-14 
found in living matter. The half-life of C!* is 5730 + 40 years. About how old is Crater Lake? 


Derive a differential equation for uranium with half-life 4.5 billion years. 
With time measured in years, the value of \ in Eq. (1.1.1) for cobalt-60 is about 0.13. Estimate the half-life of cobalt-60. 


The general solution of 1— y” = (yy’)? is (ce —c)? +y = 1, where c is an arbitrary constant. Does there exist a singular 
solution? 


Use a computer graphic routine to display the direction field for each of the following differential equations. Based on 
the slope field, determine the behavior of the solution as rx — +00. If this behavior depends on the initial value at 
x = 0, describe the dependency. 

(a) y=ax+y?, -2<2<2; (b) y' =cos(x+y), -4<a2<4; 

(c) yi =a%4+y?, -3<a<3; (d) y=ay?, -2<2<2. 
Using a software solver, estimate the validity interval of each of the following initial value problems. 

(a2) oy =y?-aeytl, y(0)=1, (b+) yo =y?teyt1, y(0) =1; 

(c) yi =ay?-a*, yO) = (a) y= 1/(e? +y*), yO) = 1. 
Using a software solver, draw a direction field for each of the given differential equations of the first order y’ = f(z, y). 
On the same graph, plot the graph of the curve defined by f(x, y) = 0; this curve is called the nullcline. 

(a) oy’ =y?-2; (b) yi =yt+ayta?s (ec) yay 2’; 

(a) y’=ay/(x?+y); (0) y= ay"; (f) y= Vieyl- 
A body of constant mass m is projected away from the earth in a direction perpendicular to the earth’s surface. Let the 
positive x-axis point away from the center of the earth along the line of motion with x = 0 lying on the earth’s surface. 
Suppose that there is no air resistance, but only the gravitational force acting on the body given by w(x) = —k(a+R)~?, 
where k is a constant and R is the radius of the earth. Derive a differential equation for modeling body’s motion. 


Find the maximum interval for which Picard’s theorem guarantees the existence and uniqueness of the initial value 
problem y’ = (2? + y? +1) ie oP y(0) = 0. 
Find all singular solutions to the differential equation y’ = y?! 3 (y? —1). 


Under what condition on C does the solution y(a) = y(x,C) of the initial value problem y’ = ky(1+ y?), y(0) =C 
exist on the whole interval [0, 1]? 


Determine whether Theorem 1.3 (page 23) implies that the given initial value problem has a unique solution. 
(a) -y’+yt=sin?t,y(m)=1, (b) oy’ = a? +y°, yO) = 1; 


(ce) y' =2/y, y(2) =0; (dq) y' =y/2, y(2) = 0; 
(ec) y=2-Yy-T,y6)=1; (f) y' =siny+cosy, y(r) =0; 
(zg) y =2/%, y(0) =0. (h) yy’ =a Injy|, y(1) = 0. 


Convert the given IVP, y’ = x(y—1), y(0) = 1, into an equivalent integral equation and determine the first four Picard 
iterations. 


Consider the initial value problem y’ = y? + 4sin? 2, y(0) = 0. According to Picard’s theorem, this IVP has a unique 
solution in any rectangle D = {(x,y) : |x| < a, |y| < b/M}, where M is the maximum value of | f(z, y)|. Show that the 


b 
unique solution exists at least on the interval [—h, h], h = min {a awl \ 


Use the existence and uniqueness theorem to prove that y = 2 is the only solution to the IVP y’ = 


4), (0) = 2. 
Prove that the differential equation y’ = xz — 1/y has a unique solution on (0, 00). 


Determine (without solving the problem) an interval in which the solution of the given initial value problem is certain 
to exist. 
(a) oy’ + pSry=z, y(0) =0; (b)  (1+2" 
(c) y+ (tant)y=t?, y(m/2)=9; (d) (a? + 
(ce) ty +? y=t*, ym) =1; (f) (sin? t 
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Review Problems 


Compute the first two Picard iterations for the following initial-value problems. 

(a) oy =a? +y", y(0)=0; (b) yo =(e? +9"), yO) =1; 

(c) yy’ =sin(xy), y(m)=2; (d) y’=e™, y(O)=1. 
Compute the first three Picard iterations for the following initial value problems. On what interval does Picard’s 
theorem guarantee the convergence of successive approximations? Determine the error of the third approximation when 
the given function is defined in the rectangle |x — xo| < a, |y — yo| < b. 


(a) y' = a + zy”, y(0) = 1; (b) wy =o —1, y(1) = 2; 
(c) y = ey’ y(1) =1; (d) yao uy y(0) = 0; 
(e) ry =y, y(1) = 1; (f) y’ = 2y? + 3x”, y(0) = 0; 


(g) zy’ = 2y?— 32, y(0)=0; (bh) yy! =y + cosa, y(m/2) = 0. 
Compute the first four Picard iterations for the given initial value problems. Estimate the error of the fourth approxi- 
mation when the given function is defined in the rectangle |x — xo| < a, |y — yo| < b. 

(a) oy’ = 3a? +ay?+y, y(0)=0; (b) y= y—e* +2, (0) =0; 

(c)  y'+y=2sinz, y(0) = 1; (4) yy =—y— 2t, (0) =1. 
Find the general formula for n-th Picard’s approximation, ¢n(x), for the given differential equation subject to the 
specified initial condition. 

(a) of =y-e*, yO) = 1; (b) oy’ = 2y +a, y(1) = 0; 

(c) y=y-e,y0)=1 GQ y=y’,¥(1)=1. 
Find the formula for the eighth Picard’s approximation, ¢g(x), for the given differential equation subject to the specified 
initial condition. Also find the integral of the given initial value problem and compare its Taylor’s series with ¢g(z). 

(a) yy’ =2y+cosz, y(0)=0; (b) yy’ =sing—y, y(0) = 1; 

(ce) y=a*yt1,y(0)=0; 9 (d) oy’ + 2y = 8a”, y(1) = 1. 
Sometimes the quadratures that are required to carry further the process of successive approximations are difficult 
or impossible. Nevertheless, even the first few approximations are often quite good. For each of the following initial 


value problems, find the second Picard’s approximation and compare it with the exact solution at points x, = k/4, 
k = -1,0,1, 2,3, 4,5, 6,7, 8. 

@ y=277, yO=h ©) v=, 70) =0. 
The accuracy of Picard’s approximations depends on the choice of the initial approximation, ¢o(x). For the following 


problems, calculate the second Picard’s approximation for two given initial approximations, ¢o(x) = 1 and yo(x) = 2, 
and compare it with the exact solution at points x, = k/4, k = —1,0,1,3,4,5, 6, 7,8. 


(—) y=y,¥)=b (b+) yay’, yD=HL 
The Grénwall™ inequality: Let x, g, and h be real-valued continuous functions on a real t-interval I: a < t < b. Let 
h(t) > 0 on I, g(t) be differentiable, and suppose that for t € J, 


Prove that on I : 


a(t) < g(t) + i 


a 


t 
h(r)g(7) exp {/ h(s) as} dr. 
Hint: Differentiate the given inequality and use an integrating factor u(t) = exp {- ic h(r) dr}. 


For a positive constant k > 0, find the general solution of the differential equation y = \/|y| + &. Show that while the 


slope function ,/|y| + & does not satisfy a Lipschitz condition in any region containing y = 0, the initial value problem 
for this equation has a unique solution. 


14Tn honor of the Swedish mathematician Thomas Hakon Grénwall (1877-1932), who proved this inequality in 1919. 


Chapter 2 


f(a, y) is called the equilibrium 


a, 


SF ger 


7 Pe ree ea 


| 7774+ 


™ 


e , or Maxima. 


ee EE NEN «SRO WE z 
Fh ee ek see a) 
Eee eee an 
Ff o 
es as | 
ee ee ~ 
NA A447 eon nr eee ee ee ee » i 
NNN | 744 er ero tooo ee eee @ 
a “> 
a a a <— 
NO A 
Se | Oe ee ee rr er n 
| 
is] 
> 
| 
> 
N 


x 
\ 
\ 
\ 
\ 
\ 
\ 


v 


ee >_> > 


> 
| 
) 
—_ 
aie) 
oS 
=. 
a 
oO 
[ee | 
3 
8 
=| 
o 
5 
& 
ac} 
o 
| 
_ 
a4 
is) 
* 
> 


ae 


= 0 for all x. 


KARR RRR RRR NRK 
PAANRRRRANA A 
ZT NNNNNNNNA 


OT oe 


f(x,y). 


First Order Equations 


Linearization of the equation y 
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In calculus, points where the derivative vanishes are called the critical points. Therefore, sometimes equilibrium 


solutions are called the critical points of the slope function. 
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Definition 2.2: We say that the differential equation 


Definition 2.1: A constant solution y 


This chapter presents some wide classes of first order differential equations that can be solved at least implicitly. 
or stationary solution, if f(z, y*) 


The last section is devoted to the introduction of qualitative analysis. All pictures are generated with one of the 
In this section, we consider a class of differential equations for which solutions can be determined by integration 


(called quadrature). However, before doing this, we introduce the definition that is applicable for arbitrary differential 


equations in normal form y’ 


following software packages: MATLAB®, Mathematica® , Mapl 


2.1 Separable Equations 
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Let us consider the separable equation y’ = p(x)q(y). For values of y from interval I, (which is abbreviated as 
y € I,) where q(y) is not zero, we can divide both sides by q(y) and rewrite the equation as 


ae) =p(x), gy) #0 forye ly. 


When we integrate both sides, the corresponding integrals will be the same up to an additive constant (denoted by 


C): 
lum (st) de = f p(a)de+C. 


If we substitute in the left-hand side integral u = y(x), then du = se dx and 


[apna [r@aete. 


Replacing u again by y, we obtain the general solution in implicit form: 


[+ = J v(e)ae +C, ay) #0, (2.1.2) 


q(y) 


where C is a constant of integration. We separate variables such that the multiplier of dz is a function of only one 
variable x, and the multiplier of dy is a function of another variable y. That is why we call Eq. (2.1.1) a separable 
equation. Each member of the equation q~'(y) dy = p(x) dz is now a differential, namely, the differential of its 
own!’ quadrature. Generally speaking, formula (2.1.2) only gives an implicit general solution of Eq. (2.1.1). An 
explicit solution is obtained by solving Eq. (2.1.2) for y as a function of xz. Note that the equilibrium solutions (if 
any) of the equation y’ = p(x)q(y) which satisfy q(y) = 0 do not appear in Eq. (2.1.2). 


Definition 2.3: When the differential equation is given in an irreducible differential form 


M(a,y) dx + N(a,y) dy = 0, (2.1.3) 


we say that it is separable if each function M(x, y) and N(a,y) is a product of a function of x by a function of 
y: M(a,y) = pi(x) qi (y) and N(a,y) = po(x) q2(y). Hence, a separable differential equation is an equation of 
the form 

pi(x)qi(y) dx + p2(x)q2(y) dy = 0. (2.1.4) 


A differential equation may have either horizontal or vertical constant solutions. Their determination becomes 
easier when a differential equation is written in the differential form!® Mdx + Ndy = 0, with some continuous 
functions M(a, y) and N(x, y). It is assumed that functions p1(x), po(x), and qi(y), go(y) are defined and continuous 
on some intervals J; and I, respectively, where neither p;, pz nor q1, gz are zeroes. 

The equation p;(x)qi(y) dx + p2(%)q2(y) dy = 0 can be solved in a similar way: 


pi(2) oe qa(y) = qa(y) __ pi(2) 98 
Eo) dx AG) dy =0 => / a) dy [2s dz+C. (2.1.5) 


It is assumed that there exist intervals I, and I, such that the functions po(x) and qi(y) are not zero for all 
values of x € I, and y € I,, respectively. Therefore, solutions to a separable differential equation (2.1.4) reside 
inside some rectangular domain bounded left-to-right by consecutive vertical lines where p2(a%) = 0 (or by too) and 
top-to-bottom by consecutive horizontal lines where qi(y) = 0 (or by +00). 


Definition 2.4: A point (ao, yo) on the (2, y)-plane is called a singular point of the differential equation 


(2.1.3), written in an irreducible differential form, if M(xo, yo) = 0 and N(ao, yo) = 0. 


15The word quadrature means an expression via an indefinite integral. 
16Modeling physical problems, mechanical ones in particular, usually leads to differential equations in the form (2.1.3) rather than 
(2.1.1). Because of that, these equations are historically called “differential equations” rather than “derivative equations.” 
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If M(a,y*) = 0 but N(a,y*) A 0, then the given differential equation has the horizontal equilibrium solution 
y = y*. On the other hand, if there exists a point x = x* such that the function M(a2*,y) 4 0 but N(a*,y) = 0, 
then the solution to the differential equation M(x, y) dv+ N(x, y) dy = 0 (or y! = —M (a, y)/N(a, y)) has an infinite 
slope at that point. Since numerical determination of integral curves in the neighborhood of a point with large 
(infinite) slope becomes problematic, it is convenient to switch to a reciprocal form and consider the equation 
da/dy = —N(x,y)/M(a, y) for which « = «x* is an equilibrium solution. Then y is assumed to be an independent 
variable. A special case when both functions, M(x,y) and N(a,y), vanish at the origin is considered in §2.2.2. 

From a geometrical point of view, there is no difference between horizontal lines (y = y*) and vertical lines 
(a = «*), so these two kinds of points should share the same name. However, the vertical line is not a function as it 
is defined in mathematics. Another way to get around this problem is to introduce an auxiliary variable, say t, and 
rewrite the given differential equation (2.1.3) in an equivalent form as the system of equations 


da/dt = N(x, y), dy/dt = —M(z,y). 


When solving a differential equation, try to find all constant solutions first because they may be lost while reducing 
it to the differential form (2.1.3) and following integration. For instance, dividing both sides of the equation y’ = 
p(x)q(y) by q(y) assumes that q(y) 4 0. If there is a value y* at which q(y*) = 0, the equation y’ = p(x)q(y) has a 
constant solution y = y* since y’ = 0 or dy = 0. Sometimes, such constant solutions (horizontal y = y* or vertical 
x = x"), if they exist, cannot be obtained from the general solution (2.1.5), but sometimes they can. These constant 
solutions may or may not be singular. It depends on whether an integral curve from the general solution touches 
the singular solution or not. 

We would like to classify equilibrium solutions further, but we will use only descriptive definitions without heavy 
mathematical tools. When every solution that starts “near” a critical point moves away from it, we call this constant 
solution an unstable equilibrium solution or unstable critical point. We refer to it as a repeller or source. If every 
solution that starts “near” a critical point moves toward the equilibrium solution, we call it an asymptotically 
stable equilibrium solution. We refer to it as an attractor or sink. When solutions on one side of the equilibrium 
solution move toward it and on the other side of the constant solution move away from it, we call the equilibrium 
solution semi-stable. 


Example 2.1.1: (Population growth) The growth of the U.S. population (excluding immigration) during the 20 
years since 1980 may be described, to some extent, by the differential equation 


dP 
dt #241 


P(t), 


where P(t) is the population at year t beginning in 1980. To solve the given equation, we separate variables to 


obtain 
dP tdt 


P > BT 
Integration yields 
1 
InP= 5 In(t? +1) + nC =nvV#?+14+mC=nCV# +1, 


where C is an arbitrary (positive) constant. Raising to an exponent, we get P = C (t?+1)!/?. The given differential 


equation has the unstable equilibrium solution P = 0, which corresponds to C = 0. To check our answer, we ask 
Mathematica for help: 


DSolve[{y’ [t] == t+ty[t]/(t+t + 1)}, yl[t], t] 
Plot([{Sqrt[1 + t*t], 4*Sqrt[it+t*t], 0.4*Sqrt[1t+t*t], 8*Sqrt[it+t*t]}, 
{t, 0, 5}, PlotLabel -> "Solutions to Example 1.2.1"] 
Example 2.1.2: Find the general solution of the equation 
Qx(y — 1) dx + (2? — 1) dy =0. (2.1.6) 


Solution. The variables in Eq. (2.1.6) can be separated: 


aoe fora A+1, yAl. 
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Solutions to Example 1.2.1 


Figure 2.1: Example 2.1.1. Some solutions of the dif- 


ferential equation y’ = ty/(1+ t?), plotted in Math- Figure 2.2: Example 2.1.2. Solutions (2? — 1)(y — 
ematica. 1) = C of Eq. (2.1.6), plotted with MATLAB. 


The expression in the left-hand side is the ratio where the numerator is the derivative of the denominator. Recall 
the formula from calculus: 


/ 
/ ae dx = In |v(x)| + C = InCv(2), u(x) £0. (2.1.7) 
Integration of both sides yields In |a? — 1| = —In|y — 1| + C, where C is an arbitrary constant. Therefore, 


In |x? —1)+Inly—1|=C or In |(z? — 1)(y — 1)| =C 


Taking an exponential of both sides, we obtain 

(2? =y-Dl=e2 or (a? = 1)(y-1) = 4°, 
where sign plus is chosen when the product (x? —1)(y—1) is positive, otherwise it is negative. Since C is an arbitrary 
constant, clearly te© is also an arbitrary nonzero constant. For economy of notation, we can still denote it by C, 
where C' is any constant, including infinity and zero. Thus, the general integral of Eq. (2.1.6) in implicit form is 


(a? — 1)(y—1) =C. The general solution in explicit form becomes y = 1 + C (a? — iy) 4 ee 

Finally, we must examine what happens when « = +1 and when y = 1. Going back to the original equation 
(2.1.6), we see that the vertical lines z* = +1 are solutions of the given differential equation. The horizontal line 
y = 1 is an equilibrium solution of Eq. (2.1.6), which is obtained from the general solution by setting C = 
Therefore, the critical point y* = 1 is a nonsingular stationary solution. MATLAB codes to plot solutions are as 
follows. 


C1 = 1; C2 = 10; C3 = - 

xO = 1.1; x = x0:.01:x0 + 2.2; 

x2d = 1./(x.72 - 1); 

yi=1+C1.*x2d; y2=1+C2.*x2d; y3=1+C3.*x2d; 
plot(x,yl,’r’,x,y2,’k-.’,x,y3,’b--’, linewidth’ ,2) 
legend(’C = 1’,’C = 10’,’C = -2’); 


Example 2.1.3: Solve the initial value problem first analytically and then using an available software package: 


(1+ e¥) da — ec sin'ady=0, y(a/2) = 


Solution. The given differential equation has infinitely many constant solutions « = z* = nz (n = 0, ag Pads 
each of them is a vertical asymptote for other solutions. Outside these points, division by (1 + e¥) sin? x reduces 
this differential equation to a separable one, namely, 


dx e2y , Ee Y+e¥ 
Gc ot dy or y= = 3 
sinv’x I1l+ey sin? x 


(x Ann), 
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where n is an integer. Integrating both sides, we obtain the general solution in implicit form, 


dx ey 
—— dy + C, 
sin? x i 1+eY # 


where C is an arbitrary constant. To bypass this integration problem, we use the available software packages: 

Maple: int((sin(x))*(-3),x); Mathematica: Integrate [(Sin[x])*(-3) ,x] 
Maxima: integrate((sin(x))*(-3) ,x); 
and Sage: from sage.symbolic.integration.integral import indefinite_integral 
indefinite_integral(1/(sin(x))3, x) 

It turns out that all computer algebra systems give different forms of the antiderivative; however, all corresponding 
expressions are equivalent. In Maple, we define the function of two variables: 

restart; with(plots): with(DEtools): 

f:=(x,y) -> (ltexp(y) )/(exp(2*y)*sin(x)*sin(x)*sin(x)); 

After that we solve and plot the solution in the following steps: 


dsolve( diff(y(x),x) = f(x,y(x)), y(x)); # find the general solution 
dsolve( {diff (y(x),x) = f(x,y(x)), y(Pi/2) =O}, y(x)); # or 

soln := dsolve( {diff(y(x),x) = f(x,y(x)), y(Pi/2) =O}, y(x), numeric); 
odeplot(soln, [x,y(x)], 0..2); # plot solution 
fieldplot([x,soln], x=0..2, soln=-2..2); # direction field 


Example 2.1.4: Draw the integral (solution) curves of the differential equation 


dy siny 

dx sing 
Solution. The function on the right-hand side is unbounded at the points «* = nz, n =0,+1,+2,.... The 
constant functions y= kz, k =0,+1,+2,..., are equilibrium solutions (either stable or unstable depending on the 


parity of k) of the given differential equation except points 2* (n = 0,+1,+2,...). At the singular points (nz, k7) 
where siny = 0 and sinx = 0 simultaneously, the slope function sin y/ sina is undefined. For other points, we have 


dy da 5 dy daz 
= r — i 
siny sing sin y sin x 
Integration leads to 
In |tan 3] = In|tan S| + In = n|C'tan 5]. : 
2 2 2 
Hence, the general solution is tan¥4 a 
C tan $ or y = 2 arctan (C tan a) if 
|C tan $| < §. Using the following MATLAB com- ; 
mands, we plot a few solutions. 4 
C= [4j,,02;.2,2,8,221; ¢ = [¢,=cl; ; 
epsilon = .001; x0 = pi - epsilon; ‘i 
x = -x0:0.01:x0; 44 ae i 0 ‘ 2 3 4 
for i = 1:numel(C) 
y(:,i) = 2*atan(C(i)*tan(0.5*x)); Figure 2.3: Example 2.1.4. The so- 
end lutions of the initial value problem, 
plot(x,y,’k’,’linewidth’ ,2); plotted with MATLAB. 


Several integral curves are shown in Fig. 2.3. If we set C = 0 in the general solution, then the equation tan 4 = 0 
has infinitely many constant solutions y = 2km (k = 0,41,+2...). The other constant solutions, y = (2k + 1)z, are 
obtained by setting C' = oo in the general formula. Therefore, all equilibrium solutions are not singular because they 
can be obtained from the general solution. On the other hand, the vertical lines « = na (n = 0,+1,+2,...) that we 
excluded as singular points for the given differential equation dy/dx = sin y/ sina become equilibrium solutions for 


the reciprocal equation dx/dy = sin z/ sin y. 
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Example 2.1.5: (Young leaf growth) A young leaf of the vicuna plant has approximately a circular shape. Its 
leaf area has a growth rate that is proportional to the leaf radius and the intensity of the light beam hitting the 
leaves. This intensity is proportional to the leaf area and the cosine of the angle between the direction of a beam of 
light and a line perpendicular to the ground. Express the leaf area as a function of time if its area was 0.16 cm? at 6 
a.m. and 0.25 cm? at 6 p.m. Assume for simplicity that sunrise was at 6 a.m.!’ and sunset was at 6 p.m. that day. 

Solution. Let S = S(t) be the area of the leaf at instant t. We set t = 0 for 6a.m.; hence, $(0) = 0.16 cm?. It is 
given that dS/dt = arlI, where a is a coefficient, r = r(t) is the radius of the leaf at time ¢, and J is the intensity 
of light. On the other hand, I(t) = 8(t)cos0, where ( is a constant, and @ is the angle between the light beam 
and the vertical line (see Fig. 2.4(a)). Assuming that at midday the sun is in its highest position, and the angle 0 is 
approximately a linear function with respect to time t, we have 


A(t) =at+b, 0(0)=-=, 0(6)=0, 0(12) = . 
Therefore, a = 7/12, b = —7/2, and 6(t) = 7 (t — 6)/12. Then we find the intensity: 
I(t) = BS(t) cos = (t —6). 


Since S(t) = mr?(t) and r = \/S/z, we obtain the differential equation for S(t): 


dS T 
ee es —(t— 
eT k SVS cos 13! 6), 

, ds 1 
where k = af/./7. Separating variables ——= = k cos —(t — 6) dt which, when integrated, yields 
SVS 12 

2 


12 T 
ee eee ey | 
Ts — sin 75 ( 6)+C 


We use the conditions $(0) = 0.16 and S(12) = 0.25 to determine the values of the unknown coefficients & and C' to 
be k = 7/24 and C = —9/2. Hence 


s(t) = 16 : 
[sin 4 (t — 6) —9] 
\/ “ ie 
! me “ ra 
! — J 
I TNS . Z 
oe itl ‘i J 
rp ‘ oft 
of ee 
ee a8 2 4 6 8 10 12 
(a) (b) 


Figure 2.4: Example 2.1.5; (a) The growth rate of a vicuna leaf, and (b) integral curve S(t). 


Example 2.1.6: (Curve on the plane) Let us consider a curve in the positive half plane y > 0 that passes 
through the origin. Let M(x, y) be a point on the curve and OBMA be the rectangle (see Fig. 2.5, page 45) with the 
vertices at M and at the origin O. The curve divides this rectangle into two parts; the area under the curve is half 
the area above or 1/3 the total area. Find the equation of the curve. 


17It is a custom to use a.m. for abbreviation of the Latin expression ante meridiem, which means before noon; similarly, p.m. 
corresponds to the Latin expression post meridiem (after noon). 
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Solution. Suppose y = y(«) is the equation of the curve to be determined. The area of the rectangle OBMA is 
zx 
Aosma = Xy since the curve passes through the point M(x,y). The area under the curve is Q = i y(t) dt. We 
0 
know that Aogma — Q = 2Q or Aosma = 3Q. Thus we have 


zy = 3 | y(t) dt. 
0 


Differentiation yields 


d ; d [* 
— =yt = 3— t) dt = : 
ds (vy) =ytxy Sa [ y(t)d 3y(x) 


Hence, 
ytay =3y or sry’ =2y. 


This is a separable differential equation. Since the curve is situated in the positive half plane y > 0, we obtain the 
required solution to be y(x) = C'2?. 


Next, we present several applications of separable differential equations in physics and engineering. We refer the 
reader to [14], which contains many supplementary examples. 


Example 2.1.7: (Temperature of an iron ball) Let us consider a hollow iron ball in steady state heat; that is, 
the temperature is independent of time, but it may be different from point to point. The inner radius of the sphere 
is 6cm and the outer radius is 10cm. The temperature on the inner surface is 200°C and the temperature on the 
outer surface is 100°C. Find the temperature at the distance 8 cm from the center of the ball. 


Solution. Many experiments have shown that the quan- 
tity of heat Q passing through a surface of the area A is 
proportional to A and to the temperature gradient (see 
[14]), namely, 
M(x.y) 
dT 
Q=-kAVT =-kA—, 
dr 
where T is the temperature, V is the gradient operator, 
k is the thermal conductivity coefficient (& = 50 W/ (m-K) 
for iron), and r is the distance to the center. For a sphere 
of radius r, the surface area is A = 4rr?. In steady heat 
state, Q is a constant; hence, 


O B 


Figure 2.5: Example 2.1.6. The 
SA? dT =; curve to be determined. 
dr 
Separation of variables and integration yields 
—4tk far =Q [s or 4tk K —4rtkT = 2 for some constant K. Thus, T = ae + Kk. From the 


boundary conditions T(6) = 200 and T(10) = 100, we obtain the system of algebraic equations 


Q Q 


a ay, ie 1b 


200 + Kk, 100= 
Solving for Q and K, we get 


Q=60007k and K = —50. 
Therefore, T(r) = 1500/r — 50 and, in particular, T'(8) = 137.5°C. 


Example 2.1.8: (Light passing through water) A light beam is partially absorbed by water as it passes 
through. Its intensity decreases at a rate proportional to its intensity and the depth. At 1m, the intensity of a light 
beam directed downward into water is three quarters of its surface intensity. What part of a light beam will reach 
the depth of h meters? 
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Solution. Let us denote the intensity of the beam at the depth h by Q(h). Passing through a water layer of 
depth dh, the rate of absorption dQ equals —k Q dh with some coefficient k. Hence, Q(h) = Ce~*". 

Suppose that at the surface Q(0) = Qo. Then Q(h) = Qo e7*". We know that at the depth h = 1, Q(1) = 2 Qo. 
Thus, 


3 _ —k -~_ 3 
7 @o = Qe = ee. 


Therefore, Q(h) = Qo (3)". 


Figure 2.6: The infinitesimal work done by the system during a small compression dz. 


Example 2.1.9: (Work done by compression) In a cylinder of volume Vo, air is adiabatically (no heat transfer) 
compressed to the volume V. Find the work done by this process. 

Solution. Suppose A is the cross-sectional area of the cylinder. We assume that the pressure exerted by the 
movable piston face is p. When the piston moves an infinitesimal distance dz, the work dW done by this force is 
dW = —pAdz. 

On the other hand, Adz = dV, where dV is the infinitesimal change of volume of the system. Thus, we can 
express the work done by the movable pistol as dW = —pdV. For an adiabatic compression of an ideal gas, 


R 
pV’ =constant, y=—-+1, 
Cy 


where Cy is the molar heat capacity at constant volume and R is the ideal'® gas constant (~ 8.3145 J/mol-K). 
From the initial condition, it follows that 


pV" =p V," => p=poV) V”. 


dV 
Substituting this expression into the equation dW = —pdV, we obtain dW = —po V;’ VT Since W(Vo) = 0, 
integration yields 
Po Vo’ PoVo 
W(V) = ——— - —.. 
”) (G-1)VE* yal 


Example 2.1.10: (Fluid motion) Let us consider the motion of a fluid in a straight pipeline of radius R. The 
velocity v of the flow pattern increases as it approaches the center (axis) of the pipeline. Find the fluid velocity 
v = u(r) as a function of the distance r from the axis of a cylinder. 

Solution. From a course in fluid mechanics, it is known that the fluid velocity v and its distance from the axis 
of a cylinder are governed by the separable differential equation: 


where p is the density, js is the viscosity, and A is the pressure difference. The minus sign shows that the fluid 
velocity decreases as the flow pattern approaches the boundary of a cylinder. Integration yields 


A 
v(t) = per4id 


Aw 
with an arbitrary constant C. We know that on the boundary r = R the velocity is zero, so 
_ _ PA pe _ = PA: pe 8 
o(R) = rt +C=0 => he oar (=r). 


18, — 5/3 for every gas whose molecules can be considered as points. 
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2.1.1 Autonomous Equations 


Definition 2.5: A differential equation in which the independent variable does not appear explicitly, i-e., an 
equation of the form 


y = dy/dt = f(y) (2.1.8) 


is called autonomous (or self-regulating); here y = y(t) is a function of variable t. 


For example, a simple autonomous equation y’ = y is a topic in calculus; it leads to the definition of an exponential 
function with base e. Autonomous equations arise naturally in the study of conservative mechanical systems. For 
instance, the equations used in celestial mechanics are autonomous. The book [14] contains numerous examples of 
autonomous equations. 

As we discussed previously (page 41), the constant, or equilibrium solution, is particularly important to au- 
tonomous equations. The constant solution is called an asymptotically stable equilibrium if all sufficiently small 
disturbances away from it die out. Conversely, an unstable equilibrium is a constant solution for which small pertur- 
bations grow with ¢ rather than die out. By definition, stability is a local characteristic of a critical point because 
it is based on small disturbances, but certain large perturbations may fail to decay. Sometimes we observe global 
stability when all trajectories approach a critical point. 

To determine stability of a critical point y*, it is convenient to apply the Stability Test: If f’(y*) > 0, then the 
equilibrium solution is unstable. However, if f’(y*) < 0, then the equilibrium solution is stable. 

The implicit solution of the Eq. (2.1.8) can be determined by separation of variables: 


dy = Y dy 
f(y) ie FO) 


if the function f(y) 4 0 on some interval. Otherwise there exists a point y* such that f(y*) = 0, and the horizontal 
line y(t) = y* is an equilibrium solution of Eq. (2.1.8). 


=t—to, 


Example 2.1.11: (Example 1.6.7 revisited) Solve the initial value problem 


y=y, ya) = yo. 


Solution. The given differential equation has one equilib- 
rium solution y = 0 for which the stability test is inconclu- 


sive. Let us show that the solution is 350 
Yo coop 
y(t) = —————__.. 2.1.9 
( ) 1 — yo(t — to) ( ) 250+ 


It is easy to see that after separation of variables we have i 


d i 
a — dt, or = yt = t + C, 4007; I | 
" | 
y 50 i J 
which is a solution in implicit form. Some algebra work is Op ! rere a 
required to obtain an explicit solution: so} " | 
i 
y(t) = a, (2.1.10) ‘i - | | | | 
t + C ar 15 2 25 3 3.5 
Substituting the initial point ¢ = to, y = yo into Figure 2.7: Example 2.1.11. Solutions (2.1.9) 
Eq. (2.1.10), we find the value of an arbitrary constant C of the autonomous differential equation y = 
als _ 
to be —yg = to + C or y’, plotted with MATLAB. 
= to+1 
Cecpay = 
Yo 


If yo # 0, then the integral curve (2.1.10) is a hyperbola. If yo = 0, then we get the integral curve to be 
nonsingular equilibrium solution y = 0, which is the abscissa (horizontal axis). Indeed, assuming the opposite is also 


48 Chapter 2. First Order Equations 


true, we should have y(t1) = yi 4 0 at some point ¢; and also y(to) = 0. According to formula (2.1.9), the initial 
value problem y = y?, y(t1) = y1 has the solution y(t) = == that should go through the point y(t) = 0. 
Since y(t) 4 0 for any t, it is impossible to satisfy the condition y(to) = 0 and we get a contradiction. Therefore 
such a point (t1, y1), y1 4 0, does not exist and the integral curve should be y(t) = 0. 

Example 2.1.11 shows that the solution (2.1.9) may reach infinity in finite time. In fact, if yo > 0, then the 
solution (2.1.9) reaches infinity at t = to + 1/yo. 

The next theorem shows that solutions of an autonomous differential equation may reach infinity at the endpoints 
of an interval if the slope function in Eq. (2.1.8) is positive or negative on this interval and vanishes at the endpoints. 


Theorem 2.1: Let f(y) be a continuously differentiable function on the closed interval [a,b], where f(y) is 
positive except for the only two endpoints, f(a) = f(b) = 0. If the initial value, yo, is chosen within the open 
interval (a,b), the initial value problem 


¥= f(y), y(0) = yo € (a,b) 
has a solution y(t) on (—oo, co) such that lim; y(t) = a and lim:,+4.0 y(t) = 0. 


The constant functions y(t) = a and y(t) = 6 are solutions of the autonomous equation (2.1.8) since 
f(a) = f(b) =0. Let y = y(t) be a solution of the given initial value problem. This function monotonically increases 
if f(y) > 0 and it monotonically decreases if f(y) < 0 since p = f(y). The function y(t) cannot pass through 6 (or 
a, respectively). We assume the opposite: that there exists t = t* such that y(t*) = b for some t = t*. This leads to 
a contradiction of the uniqueness of the solution (see Theorem 1.3 on page 23) because we have two solutions y = b 
and y = y(t) that pass through the point ¢t = t*. Therefore, the bounded solution y = y(t) monotonically increases; 
hence, there exists the limit limz,. y(t) = C. 

We have to prove that C = b (C = a, respectively). Suppose the opposite: that C 4 b (C ¥ a, respectively). 
Then C <b (C >a) and f(C) > 0 (f(C) < 0). Since y(t) > C as t > ov, we have y’(t) > 0 as t > oo. Thus 


This contradiction proves that C = b. 


Theorem 2.2: Let f(y) be a continuous function on the closed interval [a, b] that has only one null y* € (a,b), 
namely, f(y*) =0 and f(y) £0 for all other points y € (a,b). If the integral 


y" dy 
i Fw) (211i) 


diverges, then the initial value problem for the autonomous differential equation 


def 


y =dy/dt= f(y), y(t)=y" (2.1.12) 


has the unique constant solution y(t) = y*. 


(2.1.12) has multiple solutions. 


If the integral (2.1.11) converges, then the initial value problem 


Let y = y(t) be a solution of the differential equation y = f(y) such that y(to) = yo < y*. Suppose 
f(y) > 0 for y € [yo, y*). Hence, separation of variables and integration yield that the function y = y(t) satisfies the 


equation 
aus -{~ dy 
—to= ee, 
VO f(y) 


p(t) dy 
I, fy) 


diverges as y(t) approaches y*. Therefore, the variable ¢ grows with no limit. This means that the integral curve of 
the solution y = y(t) asymptotically approaches the line y = y* as t increases infinitely, but does not cross the line. 
Thus, the initial value problem (2.1.12) has a unique solution. 


The integral 
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Let us consider the case when the integral 
yd 
/ a ee 
Yo 


converges. Then the curve y = y(t) goes through the point (tp + A, y*) on the line y = y*. Hence, at this point, the 

initial value problem (2.1.12) has no unique solution since the function y = y(t — to) is the solution of y = f(y) for 

an arbitrary constant to € R. Thus, y = y* is a singular solution. 

Note: As can be seen from the proof, Theorem 2.2 can be extended for separable equations y = f(y)g(t) if g(t) isa 

monotonically increasing continuous function without bounds. a 
The following examples present an array of physical models and mathematical equations where autonomous 

differential equations occur. 


Example 2.1.12: (Cell dynamics) The bone marrow of a 
human constantly produces red blood cells (erythrocytes) because 
they have a finite life span (approximately 120 days). Let us 
denote the number of erythrocytes in the blood at time t by u(t). 
This number may be modeled by the solution of the autonomous 
differential equation (see [1]) | 


du b 

— = —aul(t) + — 

a a 
with positive constants a and b. The coefficient a determines the Figure 2.8: Example 2.1.12. Erythro- 
rate of natural loss of erythrocytes. The oxygen level in kidney cyte dynamics, plotted with MATLAB. 


tissue determines the production of erythropoietin (Epo), and the 
level of Epo is proportional to the rate of red blood cell production. 
This means that the oxygen level is inversely proportional to the 
rate of erythrocyte production. 


To find the general solution of the equation, we separate variables to obtain 


eek eee 
—au+b/u 
Integration yields 
udu 1 d(b — au”) 1 4 
on [Ss 2a b— au? 2a a 


with an arbitrary constant C;. Raising both sides of the equation —2a(t + C,) = In |b — au?| to an exponent, we get 
b-—auw =aCe >, 


with another arbitrary constant C = e~?°°1/a. Hence, 


a 


b 1/2 
u(t) = F _ Cont 
is the general solution of the differential equation that serves as a mathematical model of the red blood cell system 
(see its graph in Figure 2.8). 


Example 2.1.13: (Investment) Suppose you learned that the U.S. government!’ borrowed from one of your 
ancestors $24 two hundred years ago at an interest rate of 8% compounded continuously. How much should you get 
for this investment? Find the amount after 300 years. 


19This example is based on a story about George Washington (1732-1799), who borrowed $24 on behalf of the U.S. government. 
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Solution. Let s(t) be the amount of money (capital plus interest) at time ¢ measured in years. Initially, s(0) = $24. 
The balance s(t) grows by the accumulated interest, which is proportional to the interest rate of 8%, or 0.08 in 


absolute numbers. Thus, 


ds(t) _ 
— = 0.08 s(t). 


This differential equation is separable and its solution is 
s(t) = Ce?! where C= s(0) = $24. 


At t = 200, we have s(200) = 24 e'© = $2.1326665 x 10°, which is more than 200 million dollars. 
The amount after 300 years will be s(300) = 24e74 ~ $6.357398 x 101! (in the short scale, one billion is 10°), 
which is about half the property value of New York City. 


Example 2.1.14: (Population growth) The simplest mathematical model of population growth is obtained by 
assuming that the rate of population increase at any time is proportional to the size of the population at that time: 


P = dP/dt =rP, (2.1.13) 


where P(t) is the population at an instant t. For other population models, see [14]. 

If r is a positive constant, then Eq. (2.1.13) is called the Malthusian?? growth model, which gives a reasonably 
accurate description of the population of certain algae, bacteria, and cell cultures. This differential equation is easily 
solved by simple integration. Collecting like variables in Eq. (2.1.13) gives 


P : 
< =—rdt => PO = Pie", 


The time taken for a population to double in size is called the doubling time. For the Malthusian model, this 
time, ta, is determined from the algebraic equation P(ta) = 2P(0) which leads to 


P(0)e"*4 =2P(0) and hence e”4 = 2. 
Thus, the doubling time is tg = (In2)/r. 


1. Determine which of the following differential equations are separable. 
(a) (4+y*)edr+ydy=0; (b) 27y'=3-y?;  (c) (a? +y’) da —2aydy =0; 


(d)  y' =cosy; (e) y' =cos(xy); (f) yy! =ax—2xy-—yt4; 
(g) vy’ -y=ay’; (h) y!=cos(et+y); (i) yy =a? +y; 
G) yy’ =er*4; (k) ay’—y=2*; (I) (1 +y") cosada = (1 —sin® x) ydy. 
2. Find the general solution of the following differential equations. Check your answer by substitution. 
(a) dy’ + (1+ 4x) y* = 0; (b) yy! =sin’ wa; (c) ry? y' = er; 
(d) e” dx + 2x7y dy = 0; (e) 4y' =y' sing; (f) y' =y° cosa; 
(g) cosxcosyy’=sinzsiny; (h) y' =ytanhz; (i) y' = cos’ xcosy; 
Gj) y= (1+ 2a)(1 + 4y”); (k) y= V1l-y9; (Il) y’=4-y; 
(m) dr=rcosOd6—sinOdr; (n) (a? +4)y! = 2zy; (0) y'=27y; 
(Pp) yl’ =ay?—3y?+e-3, (q) VI-ay’+2=0; (r) yf +ay=ca/y; 
(s) y’ = (tany)/(x +1); (t) y =(44a)y’. 


3. Solve the following separable equations 
(a) e*cosyda+secydy=0; (b) (y+1)?da=2?dy; (ce) y!=(e-1D(yt1)?”; 


(d) (1+?) dy—e7¥ dx =0; (e) (x+1)y' =ay; (f) (3y?+1)y' =cosz; 
(g) dy = cos? xcos*(2y) da; (h) «dx+ye "dy=0; (i) y’y’ =a/V1+ 2. 
4. Solve each of the following equations 

dy _ (yt+1)(@-1)(y—2), dy _ eel: 4s — 9rty. 
(a) $= Gace (b) de — Jatyaa) (c) yl = 2°"; 
(dq) cgt—-y=2"-y; e:) Y=; (ff) wy’ =14+y'; 

we é 3/2 

() Y=V¥4+1/@'+); ) v=pae Wo =40% (ty). 


20The British economist Thomas Malthus (1766-1834) discovered this law at the end of the 18th century. 
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5. Find all solutions (including singular) of the following differential equations. 
(a) x?y’y’+1=y; (b) ayy’ = (x? +1)(1—y”). 


6. Obtain the particular solution satisfying the initial condition. 


(a) yy’ =1, y(1)=1; (b) yi’ =a7e%, y(1) =0; 

(c) 6ryy’=1+y*, y(4)=1; (4) y=y?+y-6, y(1) =-1/2; 

(e) y' =y(yt+2), (2) =4; (f) y’=1+y?, y(0) =0; 

(g) 4rdx+9ydy=0, y(3) =0; (h) y' =2y/e, y(1) = —2; 

(i) y’=(1-a2)/t+y), yQ)=0; (Gj) 3a7ty?dr—22ydy=0, y(4) =8; 

(k) y’+ysin2z=0, y(0)=1; (l) y=ylny, y(0) =e; 
(m) y’=(y+1)’, y(0) =0; (n) y' =(2y+1)?, y(0) =-1; 

(0) y'=e""*, (0) =0; (p) y'=ysecx, y(0) =1; 

(q) 2ayy’=1+y’, y(1) =2; (vr) azy?dx+e*dy=0, y>1/2 when x > oo. 


7. Solve the given initial value problems and determine the interval of definition for the explicit solution. Hint: To find 
the validity interval (= the longest interval where the solution is defined), look for points where the integral curve has 
vertical tangent. 


x—x? x—x? 
(a) 28 yy=1; (bh) Bases ya, 
8. Find the most general function that has the property that its square plus the square of its derivative equals 1. 


9. The initial population in a village is 1,000. After five years, it reaches 1,848. Using the logistic population model 
P’ = 0.00025(k — P)P, determine the population after 15 years. 


10. The initial population in a town is 10,000. After five years, it reaches 13,564, and after 10 years it equals 14,929. Is 
there a solution to the logistic equation P’ = r(k — P)P that fits this data? 


11. Six of the 1,000 passengers, crew, and staff that board a cruise ship have the flu. After one day of sailing, the number 
of infected people rises to 12. Assuming that the rate at which the flu virus spreads is proportional to the product of 
the number of infected individuals times the number of people not yet infected, determine how many people will have 
the flu at the end of the 7-day cruise. 


12. The population of insects in a certain area increases at a rate proportional to the current population and, in the absence 
of other factors, the doubling time is one month. There are 500 insects in the area initially, and predators eat 100 
insects/month. Determine the population of insects in the area at any time. 


13. Suppose that a certain population has the growth rate (1 + cos(0.5t)) P(t), where P(t) is the population at time t. 
Determine the population at any time. 


14. A population of birds increases at a rate proportional to the number present. If the number of birds doubles in eight 
years, how long will it take to triple the initial number? 


15. The half-life of radioactive white lead is 22 years. Find the general formula modeling its radioactive decay. 


16. In winter, the thickness of the ice on a lake increases at a rate proportional to the square root of the time t. Write a 
differential equation for the thickness at any time. 


17. Observations show that the rate of change of atmospheric pressure with altitude is proportional to the pressure. At 
6km (about 18,000 ft), the pressure is half of its value at sea level. Find the formula for the pressure at any height. 


18. Solve the differential equation (that is, find its general solution and equilibrium solutions) 
daz 
a =k(a—a)(b—2), k>0,a>0, b>0, 
which occurs in chemistry (see [14]). What value does x approach as t — oo? 


19. For the given differential equation 2yy’ = (1 + y) cost, 


(a) find its general solution; 


(b) determine a particular solution subject to y(a/2) = —1 and identify its interval of existence. 


20. A person invested some amount of money at the rate 4.8% per year compounded continuously. After how many years 
would he/she double this sum? 


21. What constant interest rate compounded annually is required if an initial deposit invested into an account is to double 
its value in seven years? Answer the same question if an account accrues interest compounded continuously. 


22. In acar race, driver A had been leading archrival B for a while by a steady 5 km (so that they both had the same speed). 
Only 1km from finish, driver A ran out of gas and decelerated thereafter at a rate proportional to the square of his 
remaining speed. Half a kilometer later, driver A’s speed was exactly halved. If driver B’s speed remained constant, 
who won the race? 
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. For a positive number a > 0, find the solution to the initial value problem y’ = zy°, y(0) = a, and then determine the 
domain of its solution. Show that as a approaches zero the domain approaches the entire real line (—oo, 00) and as a 
approaches +oco the domain shrinks to a single point x = 0. 


For each of the following autonomous differential equations, find all equilibrium solutions and determine their stability 
or instability. 

(a) ¢=2°-4¢+3; (b) #@=2?-6r+5; (c) #= 427-1; 

(d) #=22?-a2-3; (e) &=(a—-2)?; (f) #&=(14+2)(a— 2). 
For each of the following initial value problems, determine (at least approximately) the interval in which the solution 
is valid and positive, identifying all singular points. Hint: Solve the equation and plot its solution. 

(a) y' =(2+3a7)/(5y*—4y), yO)=1 (b) x =(-y*)/(@? +1), 0) =0; 

(c) y' =(Qr—3)/(y?-1), y(1) =0; (d) y’=sinh2/(1+2y), (0) =1; 

(e) y' =9 cos(3t)/(2+ 6y), (0) = 1; (f) y= (8—e*)/(2+3y), y(0) =0. 
Solve the equation y’ = y?/x’, where p and q are positive integers. 
In each of the following equations, determine how the long-term behavior of the solution depends on the initial value 
y(0) =a. 

(a) gy=ty(3—y)/2+t; (b) y=(t+y)/(y? +3); (c) y= +t?)/(4y’ — 9); 

(d) y=y?—4¢?; (ec) gy=(1+t)(4+y*); (f) y=sint/(3+4y). 
Suppose that a particle moves along a straight line with velocity that is inversely proportional to the distance covered 
by the particle. At time t = 0 the particle was 10m from the origin and its velocity was vo = 20m/sec. Determine the 
distance and the velocity of the particle after 12 seconds. 


Show that the initial value problems, y = cost, y(0) = 0 and y = \/1 — y?, y(0) = 0, have solutions that coincide for 
O0<t<F. 

In positron emission tomography, a patient is given a radioactive isotope, usually technetium-99m, that emits positrons 
as it decays. When a positron meets an electron, both are annihilated and two gamma rays are given off. These are 
detected and a tomographic image (of a tumor or cancer) is created. 

Technetium-99m decays in accordance with the equation dy/dt = —ky, with k = 0.1155/hour. The short half-life of 
technetium-99m has the advantage that its radioactivity does not endanger the patient. A drawback is that the isotope 
must be manufactured in a cyclotron, which is usually located far away from the hospital. It therefore needs to be 
ordered in advance from medical suppliers. 

Suppose a dosage of 7 millicuries (mCi) of technetium-99m is to be administrated to a patient. Estimate the amount of 
the radionuclide that the manufacturer should produce 24 hours before its use in the hospital treatment room. 


Torricelli’s equation is an equation created by Evangelista Torricelli (1608-1647) to find the final velocity of an object 
moving with a constant acceleration without having a known time interval. It is based on expressing the velocity v = gt 
of a falling liquid drop in terms of distance: v = \/2gh, where g is acceleration due to gravity, and h is the distance. 
Suppose a tank has an outlet hole near the bottom. Let h = h(t) be the height of a fluid above the outlet at time t, 
and let a be the area of the outlet hole. For a tank with cross-sectional area A(h) at height h, the fluid volume V up 
to height h is given by an integral V(h) = is A(y) dy. This volume V(t), remaining after time t, satisfies Torricelli’s 
equation: 


x = —kav/2gh = A(h) & = —kav/2gh, 


where the flow constant k (0 < k < 1) depends on properties of the fluid such as viscosity and on the shape of the hole. 
A tank in the shape of a circular cone has radius rp = 0.3 meters and vertical height ho = 1.5 meters. Hence the radius 
r at height h satisfies r = h/5. A circular outlet has area a = 7/100. Find the time to empty the tank if k ~ 0.7. 


An upright hemispherical bowl of radius 1 meter (m) has an outlet hole near the bottom of radius r = 0.05m. Assuming 
flow constant k = 0.5, about how long would it take for the full bowl to empty under the influence of gravity? 


Stefan’s Law of radiation states that the radiation energy of a body is proportional to the fourth power of the absolute 
temperature T (in the Kelvin scale) of a body. The rate of change of this energy in a surrounding medium of absolute 
temperature M is thus 
4 4 

—=a0(M -T 

dt ( ) ? 
where o is a positive constant when T > M. Find the general solution of Stefan’s equation assuming M to be a 
constant. 


The antioxidant activity (of crude hsian-tsao leaf gum extracted by sodium bicarbonate solutions and precipitated by 
70% ethanol) A(c) depending on its concentration c can be modeled by the following initial value problem?! that you 


aresed to solve: 
dA _ 


<< =h[A"- A] b>0, — A(0) =0. 


lV ih-Shiun Lai et al., Journal of Agricultural and Food Chemistry, 49, 963-968, 2001. 
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2.2 Equations Reducible to Separable Equations 


One of the best techniques used to solve ordinary differential equations is to change one or both of the variables. 
Many differential equations can be reduced to separable equations by substitution. We start with a nonlinear 
equation of the form 

y = F(ar+ by+o), b#0, (2.2.1) 


where F(v) is a given continuous function of a variable v, and a, b, c are some constants. Equation (2.2.1) can 
be transformed into a separable differential equation by means of the substitution v = ax + by+c, b #4 0. Then 
uv’ =a+ by’, and we obtain from y/ = F'(v) that 
v =atby =a+bFi(v) or ay 
a+ bF(v) , 
which is a separable differential equation. 
For example, the differential equation y’ = (z+ y+1)? is not a separable one. By setting v = x+y+1, we make 


it separable: v’ = 1+y’+0=1+4v?. Separation of variables yields 
du 

—, =dz => arctan(v) =«#+C or v = tan(#+C). 
1+v? 


Therefore, the general solution of the given equation is arctan(x + y+1) = «2+ C, where C is an arbitrary constant. 
Equation (2.2.1) is a particular case of more general equations discussed in §2.2.3. oO 


There is another class of nonlinear equations that 
can be reduced to separable equations. Let us con- 
sider the differential equation 


ey =y F (xy), (2.2.2) 


where F'(v) is a given function. By setting v = zy, 
we reduce Eq. (2.2.2) to the following one: 


v 
v =y+yFO) =" 1+ Fo) 
since vo’ = y+ ay’ and y = v/x. Separation of vari- 


ables and integration yields the general solution in 
implicit form 


3.0 


dy a Figure 2.9: Example 2.2.1. Solutions for dif- 
soo SH et GG, v= ry. i es 
i v(1 + F(v)) y ia values of C, plotted with Mathemat 


Example 2.2.1: Solve the differential equation 
cy =2*y? — y. 


Solution. Since the equation is of the form (2.2.2), with F(v) = v? — 1, we use substitution v = xy, which yields 


vaytay =ytary?—yaay’ = v/a. 


We separate variables v and x to obtain 


dv da 1 
a or — 5,3 —lnlal +e. 
Since 
2 1 1 
UV =-_l OO _- SOC = 
—2(In|j2z|+C) C,-In2? 
where C; = —2C is an arbitrary constant (we denote it again by C), we get the general solution 
1 


Ine’? <C. 


— aVC —Inx2’ 
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To find the general solution in Mathematica, use the command DSolve. However, there are two options in its 
application: 
DSolve ly’ [x] == x*(y[x])3-y[x]/x,y [x] ,x] or 
DSolve ly’ [x] == x*(y[x])?-y[x]/x,y,x] 
The former does not give values for y’ [x] or even y[1]. The latter will return y as a pure function, so you can 
evaluate it at any point. Say, to find the value at x = 1, you need to execute the command y[1]/.% 
To plot solutions in Mathematica (see Fig. 2.9 on page 53), we type: 
solution = DSolvely’ [x] == x*(y[x])°3 - y[x]/x, ylx], x] 
g[x_] = yl[x] /. solution[[1]] 
t[x_] = Table[glx] /. C[1] -> j, fj, 1, 63] 
Plot([t[x], {x, 0.2, 3}] 


However, since the general solution is known explicitly, we can plot the family of streamlines with one Mathematica 


command: 
Plot[1/(x Sqrt [#1 - 2 Log[x]] &) /@ {1, 2, 3, 4}, {x, .3, 10} ] 


2.2.1 Equations with Homogeneous Coefficients 


Definition 2.6: Let r be a real number. A function g(x,y) is called homogeneous (with accent on syllable 


“ge”) of degree r if g(Ax, Ay) = A" g(a, y) for any nonzero constant . 


For example, the functions x/y, «—y, 27+ ./74+ y4, 22 +y? + xy? are homogeneous of degree zero, one, two, and 
three, respectively. Usually homogeneous functions of zero degree are referred to as homogeneous. Obviously, if f is 
a function of the ratio y/z, ice., 


f(e,y) = F (2), x #0, 


then f is a homogeneous function”? (of degree 0). We avoid the trivial case when f(x,y) = y/x. Let us consider the 
differential equation with homogeneous right-hand side function: 


oF ag (2) , «#0. (2.2.3) 
To solve this equation, we set the new dependent variable v(x) = y(x)/x or y = vx. Then Eq. (2.2.3) becomes 


dy _ 


i F(v). 


Applying the product rule, we find the derivative dy/dax to be 


oF +0= Fo) or oF = Fv) 0. 
We now can separate the variables v and z, finding 
at ~ = Bee 
Integration produces a logarithm of x on the right-hand side: 
Im =nC+4+lIn|2|=mMClz|=nCr (v=y/z), 
F(v)—v 


?2Because the ratio y/x is the tangent of the argument in the polar system of coordinates, the function f(y/x) is sometimes called 
“homogeneous-polar.” 


2.2. Equations Reducible to Separable Equations 55 


where C’ is an arbitrary constant so that the product Cx is positive. Thus, the last expression defines the general 
solution of Eq. (2.2.3) in implicit form. 
Sometimes it is more convenient to consider x as a new dependent variable. Since 24 = 1/(dx/dy), we can 


da 
rewrite Eq. (2.2.3) as 
dz _ 1 
dy = F(y/x)’ 
Using substitution x = uy and separation of variables, we find 
dx du es 1 = incurs / du 
dy” dy Flu) a LE Ih | a 
Example 2.2.2: Solve the equation 
d 2 
BLT TY «#0, y #0. 
daz xy 


Solution. Ruling out the singular lines x = 0 and y = 0, we make substitution y = vx. This leads to a new 
equation in v: 
dy DD 2 mc | 1 
pepe Ee Ee eg eed 
da LUX Uv Uv 


d 1 
Canceling v from both sides, we get 7 = —. The equation can be solved by separation of variables: 
L xv 


dx 


udv = — vy’ =In|z|+C, 
or 


Nle 


where C is a constant of integration. If we denote C = InC1, then the right-hand side can be written as In|z|-+C = 
In|z| + nC, = InCj|2|. We can drop the index of C; and write down this expression as In C|z| = InCz, where 
arbitrary constant C' takes care of the product Cx to guarantee its positiveness. That is, C > 0 when z is positive 
and C < 0 when z is negative. The last step is to restore the original variables by reversing the substitution v = y/a: 


2 2 
J =z =InCz or J = 2nCla| =n’ 2’, 
Qu? x 
because C? is again an arbitrary positive constant and we can denote it by the same letter. The general solution 
can be written explicitly as y = +aVln Cx? with a positive constant C. 
Now we rewrite the given equation in a reciprocal form: 


da ry 


dy 22 +42’ r#0, yO. 


Setting x = uy and separating variables, we obtain 


2 
1 

Integration yields u? = - = how so y = taVInCx?. To check our solution, we use the following Maple 

y nCz 


commands: 

dsolve(D(y) (x)=(x*xty (x) *y (x) ) /x*y (x) ,y(x)) 

To plot a particular solution we write 

p:=dsolve({D(y) (x) =(x*xty (x) ty (x) )/x*y (x) ,y(1)=3}, numeric) 

odeplot (p,0.4..5) 

We can reduce the given equation to a separable one by typing in Mathematica: 
ode[x_, y_] = (x°2 + y°2)*dx == x*y*dy 

ode[x, v*x] /. {dy -> x*dv + v*dx} 

Map[Cancel, Map[Functionlq, q/x*2], %]] 

Map[Function[u, Collect[u, {dx, dv}]], %] 
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Now suppose that M(a,y) and N(x, y) are both homogeneous functions of the same degree a, and consider the 
differential equation written in the differential form: 


M(a,y) dx + N(x, y) dy = 0. (2.2.4) 
Equation (2.2.4) can be converted into a separable equation by substitution y = vx. This leads to 
M(a, vx) daz + N(x, vx) (vdx + adv) = 0, 
which, because of the homogeneity of M and N, can be rewritten as 
x* M(1,v) da + 2° N(1, v) (vdx + adv) = 0 


or 
My (v) dx + Ni(v) (vdx + xdv) = 0, 


where M1(v) = M(1,v) and Ni(v) = N(1,v). We separate variables to get 


= dx Ni(v) iii 
[Mi (v) + vu Ni(v)] da + Ni(v)a du = 0 or ; © Tua uN” =0. 


Since the latter is already separated, we integrate both sides and obtain a solution of the given differential equation 
in implicit form. 


Example 2.2.3: Solve the differential equation 


d Ea 
gov z#0 and 27? >y?. 
x x 


Solution. When an equation contains a square root, it is assumed that the value of the root is always positive 
(positive branch of the root). We consider first the case when 2 > 0. Let us substitute y = vax into the given 


equation; then it becomes 
ev +u=v04+V1—-v? or ev =V1—v?. 


Separation of variables and integration yield arcsinv = In|z|+C1, or 


v(x) = sin (In |z|+ Ci) =sin(InCz), |nCa| < 


wlA 


9 


where C; = InC is a constant of integration. The constant functions, v = 1 and v = —1, are equilibrium solutions 
of this differential equation that cannot be obtained from the general one for any choice of C’. These two critical 
points are singular solutions because they are touched by integral curves corresponding to the general solution 
(arcsin (+1) = +4). Moreover, the constants v = +1 are envelopes of the one-parameter family of integral curves 
corresponding to the general solution. Since v = y/z, we obtain the one-parameter family of solutions 


y=axsin(InCz), |nCa|< 


re) 


Now let us consider the case where x < 0. Setting x = —t, t > 0, we get 


dy dy dt dy d Yaa a yea 
dx dt ded ™ ——s =<f — 
Equating these two expressions, we obtain the same differential equation. Therefore, the solutions are symmetrical 
with respect to the vertical y-axis, and the function y = y(t) = y(—a) = ¢ sin(InCt) is its general solution. The 
given differential equation has two equilibrium solutions that correspond to v = +1. One of them, y = 2, is stable, 
and the other one, y = —2, is unstable (see Fig. 2.11 on page 57). 
The given differential equation can be rewritten in the following equivalent form: 


wy! —y = fa? — 4. 


2.2. Equations Reducible to Separable Equations 57 


By squaring both sides, we get the nonlinear equation without the radical: 
x? (y')? — Qeyy’ + 2y? — 2? = 0. (2.2.5) 


As usual, raising both sides of an equation to the second power may lead to an equation with an additional solution. 
In our case, Eq. (2.2.5) combines two equations: xy’ = y + \/a? — y?. This additional solution, 


y =—x sin(InCz), 


is the general solution to the equation xy’ = y — \/x? — y?. To plot the direction field, we ask Maple: 
DEplot (x*diff (y (x) ,x)=y(x)-sqrt (x*x-y (x) *y(x)),y(x), 

x=-10..10,y=-10. .10, [y(4)=1, y(12)=1, y(-4)=1, y(-12)=11, 

scaling=constrained, dirgrid=[30,30], color=blue, linecolor=black, 
title="x*dydx=y-sqrt (x*x-y*y)") ; 


dy/dx = (y + sqrt(x? - y’)) /x 


x*dydx=y-sqrt(x*x-y*y) 
105 


TERR RRNA 
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Figure 2.10: Example 2.2.3: direction field and so- Figure 2.11: Example 2.2.3: direction field and so- 
lutions of the equation ry’ = y — \/x? — y?, plotted lutions of the equation ry’ = y + \/x? — y?, plotted 
with Maple. with Mazima. 


2.2.2 Equations with Homogeneous Fractions 
Let us consider the behavior of the solutions of the differential equation 
M(a,y)dz + N(x,y) dy =0 


near a singular point (x«*,y*), where M(a*,y*) = N(a*,y*) = 0. To make our exposition as simple as possible, we 
assume that the singular point is (0,0), and consider the direction field near the origin (locally). When M(z, y) is a 
smooth function in the neighborhood of (0,0) such that (0,0) = 0, it has a Maclaurin series expansion of the form 


M(a,y) = ax+by+cx? + dry+ey? +---, 


and N(a,y) has a similar expansion. If we neglect terms of the second and higher degrees, which are small near the 
origin, we are led to an equation of the form 
dy  ax+by 


Such an operation, replacing the given differential equation with a simpler one (2.2.6), is called linearization (see 
§2.5). In what follows, it is assumed that aB 4 Ab; otherwise, the numerator az + by will be proportional to the 
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denominator: Ar + By = £ (ax + by). Setting y = xv (x 4 0), where v = v(z) is the function to be determined, we 
get 
: , atbv , atbuv—Av— Bo? 
=y+ = = = 2.2.7 
en = aoe By a A+ Bu ( ) 


The obtained differential equation for v may have critical points that are solutions of the quadratic equation 
Bu? + (A—b)v—a=0. (2.2.8) 


Since Eq. (2.2.7) is a separable one, we can integrate it to obtain 


A+ Bu dx 
— dy = — | — = —- Cr =r! 0). 
lesan 7 x ee ert) 

The explicit expression of the integral on the left-hand side depends on the roots of the quadratic equation (2.2.8). 
If b = —A, then the numerator is the derivative of the denominator (up to a factor 1/2), and we can apply the 

formula from calculus: 


£0) a) - 
v =In|f(v)|+ Ci =nCf(v), (2.2.9) 
f(v) 
where C; = InC is a constant of integration. This yields 
A+ Bu 1 f d(Bv?+2Av-—a) 1 5 4 
saa = 5 Of So "= 5 WB 2Av —a) =InCa™. 
io v >| 7 a 5 Int uv’ +2Av —a) =Car 


By exponentiating, we have Bu? + 2Av —a = Cx~?. Substituting v = y/z leads to the following general solution: 
By? + 2Azy — az? =C, 


which is a family of similar quadratic curves centered at the origin and having common axes. They are ellipses (or 
circles) if Ab—aB > 0, and hyperbolas together with their common asymptotes if Ab—aB < 0. The case b+ A = 0 
is also treated in §2.3. 

In general, the family of integral curves of Eq. (2.2.6) appears unchanged when the figure is magnified or shrunk. 
The behavior of the integral curves for Eq. (2.2.6) in the neighborhood of the origin is one of the following three 
types according to 

Case L (Ab)? + 4aB >0 (a) aB—Ab<0,  semi-stable, 
(b) aB—Ab>0, hyperbola. 
Case II: (A — 06)? + 4aB =0. 
Case III: (A-—b)?+4aB <0. 


When (A — b)? + 4aB > 0, the roots of the equation Bu? + (A — b)v — a = 0 are real and distinct, that is, 
Bu? + (A-—b)v —a= Biv — 1)(v — v2). Partial fraction decomposition then yields 


A+ Bu D E 
Bu? +2Av-—a " [|= + 7 si gta Nea] 


with some constants D and E (of different signs if aB < Ab). When (A — b)? + 4aB = 0, the equation Bu? + (A — 
b)v — a = 0 has one double root v1 = v2 = -41. Then 
A+ Bu A+ Bu A+ Buy 


> dv = | Se = Ff Inv - 0) |. 
Bu? +2Au—a_— Biv — v)? 7 Boa” ate 


When (A— 0)? + 4aB < 0, the roots of the equation Bu? + (A—b)v—a = 0 are complex conjugates, and the integral 


‘ wie dv can be expressed through the arctan function, as seen in the next example. 
v2t2Av—a ? 


Example 2.2.4: (Case III) Solve the equation 5adaz+2(a+y)dy=0, «+y#O0anda 40. 
Solution. Excluding the line x + y = 0 where the slope function is unbounded and setting y = va, we calculate 
the differential of y to be dy = xdu+vudz. Therefore, the given differential equation becomes: 


5a daz + 2(v%+x)(adu+vudz) =0 (vu #A-1 and « #0), 
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from which a factor 4 0 can be removed at once. That done, we have to solve 
5dx + 2(u+1)(adv + vdz) =0 or [5 + 2u(u + 1)] da + 2(u + l)adu =0. 


Since 2u(v + 1) +5 > 0, the differential equation in v has no equilibrium solution. Separation of variables yields 


2(v+1)du da 
[sees x BIE Ces 


where C is an arbitrary constant. Since d(2v? + 2v + 5) = 4u + 2, we have 
/ 2Zu+1)dv | ; | (4v + 4) du =; / (4v + 2) du +3/ 2du 
Qv24+2v4+5 2f QW24+2v45 2 Jf Q2+Q24+5 2 J Qv2+2v4+5 
[SS / dv 
Qu? + 2u+5 2u2 + 2u+5 


du 
In(2u? + 2v +5 —_ 
BU PaO) oor one 


Nle wele 


To find the antiderivative of the latter integral, we “complete the squares” in the denominator as follows: 
1 hy fi. 
2 
Q.-. 24) a2 = 
UE ob 5 vt ( ;) ( 5) + | 
4g ot 43 va - 9 
— _—_ — UV — — 
ae) ae a 2) "4 


du 1 U 
+ Te = Ts aretan— + C, 
ur +a a a 


/ dv -;/ dv ith. é (+) 
Wwe+Qv+5 2 Gsi7 sey 3 a a 


Substitution yields the general solution 


5 
2? +2045=2 (+043) =2 


Using the identity 


we obtain 


2v4+1 


1 1 
5 In(2u? + 20+5)-+ 5 aretan ( ) = —In|z|+C/2 


that is defined in two domains v > —1 and v < —1. Using properties of logarithms, we get 


2v4+1 


2 
In [a?(2v? + 2u + 5)] a arctan ( ) =C (vu #A-1 and «F0). 
To check the integration, we use Maxima: 
integrate ((2*(vt+1))/(2Q*v'2+2*v+5), v); 
Since v = y/x, we obtain the integral of the given differential equation to be 


2y+x 
30 


2 
C = In[2y? + 2ay + 5x7] + 3 arctan (yA -—ax and « #0). 


To draw the direction field and some solutions, we type in Maxima: 

drawdf (-(5*x)/(2*(xty)), 

makelist( implicit (C=log(2*y*2+2*x*y+5*x*2)+2/3*atan ((2Q*y+x) /(3*x)), 
x,-10,10, y,-10,10), C, [1, 3, 5, 10]))$ 


Example 2.2.5: (Case IIT) Consider the differential equation 


dy  —3x+ 8y 
— SS 3 —2 . 
dx 22 + 3y ’ Ure 
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dy/dx = -5x / 2(x+y) 


Figure 2.12: Example 2.2.4 (Case III): direction field 


and solutions to 52 dr+2(x+y) dy = 0, plotted with Figure 2.13: Example 2.2.5 (Case II): solutions to 
Mazima. y’ = (8y — 3x) /(22 + 3y), plotted with Mathematica. 


Excluding the line 3y = —2x where the slope function is unbounded, we make the substitution y = xv that results 
in 

ee pee => (25 +8 mjo—1] +3 nce) =0. 

(v — 1)? x v—1 = 
Intersections of the integral curves from the general solution with the line 2x + 3y = 0 give the points where tangent 
lines are vertical (see Fig. 2.13). The given differential equation has the singular solution y = x (not a member of the 
general solution). To plot the direction field and solutions for the equation, we use the following Maple commands: 
ode: =diff (y (x) ,x)=(a*x+b*y (x) ) /(A*x+By(x)) ; inc:=y(0)=1; 
Y:=unapply (rhs (dsolve ({eval (ode, {a=-3, b=8, A=2,B=3}), ics}, y(x))), x); 
plot(Y, 0 .. 3*Pi, numpoints = 1000, scaling=constrained) ; 
dfieldplot (eval (ode, {a=-3, b=8, A=2,B=3}), ics}, y(x), x=-9..9,y=-8..8) 
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Figure 2.15: Example 2.2.6 (Case Ib): direction field 
Figure 2.14: Case Ia: direction field and solutions to and solutions to y/ = (# — y)/(« + y), plotted with 
(x + 3y) dx = (2x + y) dy, plotted with Maple. Maxima. 
dy  «~-y 
Example 2.2.6: (Case I) Solve the equation — = yA#A—ax and «40. 


dr x+y’ 
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Solution. The slope function (# — y)/(a# + y) is a homogeneous function because 


sey 1—y/a 
r+y 1+y/x a Os 


Thus, we set y = vx (a # 0) and get 


dy du l-v dv l-v 
alin ms ih, 
ies “de l+v i et 


We simplify this expression algebraically and separate variables to obtain 


dy v2 +2u—1 1+ dx 


= —-—_——__ = dy = — 
dx lt+uv v2 +2u—1 . x 


or, after integration, 


1l+v dx 
Sane = —_—_— => —-1t . 
foe / e nCz (uF V2, « #0) 


The derivative of the denominator <4 (v? + 2v — 1) = 2u + 2 = 2(v + 1) equals twice the numerator. Thus, 


1l+v 1 fd(v*+2v-1) 1 9 
eee ee ae as ch eed Qu — 1). 
sree - a giant 3 ee 


From these relations, we get the general solution 


1 
5 Inv? + 20 1] =—-mC —-In|z| => In |v? + 2v — 1) + Inz? =n. 


Using the property of the logarithm function, In(ab) = Ina + In}, we rewrite the general solution in a simpler form: 
Inz?|v? + 2v —1] =InC. After substituting v = y/x (a 4 0) into the equation, we obtain 


|y? + 2yx — 27] =C 20 or y? + 2yr — 2? =c, 


where c is an arbitrary constant (not necessarily positive). This is the general solution (which is a family of 
hyperbolas) in an implicit form. We solve the equation with respect to y to obtain the general solution in the 


explicit form: 
y= xt V 2a? +0, 


which is valid for all real 2 where 2x7 + c > 0. The given differential equation, y’ = (x — y)/(x + y), also has two 
equilibrium solutions y = (—1 + V/2)a that are the asymptotes of the solutions (hyperbolas). In fact, integration 


yields 
l+vu udu y x+y 
[pci f og: wee Fand wets 


subject to v? + 2v — 140 (or in original variables, y? + 2xy — x? #0). Upon equating the quadratic function to 0, 
we obtain the critical points v = —1 + V2, which correspond to y = (—1+ V2). The line where the slope function 
is unbounded, y = —2, separates the direction field and all solutions into two parts. As Fig. 2.15 shows, one part of 
the equilibrium solutions is stable, and the other part is unstable. 


The following equation: 
+b 
dy _p (— y ) (2.2.10) 


is an example of a differential equation with a homogeneous forcing function, where a,b, A, and B are some constants. 
This means that the function Fw) on the right side of Eq. (2.2.10) remains unchanged if x is replaced by ax, and 
y by ay, with any constant a 4 0 because 


PF aax + bay _Pp ax + by 
Aaz + Bay Ax + By 


In what follows, we present an example for F(w) = //w. 
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Example 2.2.7: Solve the differential equation y’ = a _ = x > 0, under two sets of initial conditions: y(1) = 1 
xL 
and y(1) =3. 
Solution. Setting y = va in the given equation, we transform it to a separable equation v’ a +v = /(1+v)/2. 


For positive root branch, the slope function has the critical point: v = 1, which corresponds to the equilibrium 
solution: y = x. Upon separation of variables, we obtain 


dv dx 


: _ i du _ / dx 

(l+v)/2-v & v—V/(1+v)/2 x 
where absolute value sign in the right-hand side can be dropped because we assume that x > 0. In the left-hand 
side integral, we change the variable by setting v = 2u? — 1. Then 


—InC|z\, 


2_itv 


u and v—/(14+v)/2=2u?—u-1, dv=4udu. 


Hence, 
4u du 


7 on ees 
Partial fraction decomposition yields 
4u — 4/3 2/3 
2u2—u-1 u-1l u+i1/2’ 


u#l1, w#-1/2. 


Note that u cannot be negative because u = ,/(a + y)/2x > 0. Integrating, we have 


4 2 
g mlu-i[+5hn uct 


1 
ut5[=—mnce or 2Inju-—1|+In 


1 

; = : nC cz. 

Thus, 
In(u — 1)?|u + 1/2| = nC|a|-9/?. 


Exponentiation yields the general solution 


(u — 1)?|u + 1/2] = Cla|-3/?. 
Substituting back u = ,/(1+ v)/2 = (a+ y)/2z, we find the general solution of the given differential equation to 


be 
2 
y+ta 
—1 
22 | 


ytex 


eo VSR Cla|-3/2, G11) 


As usual, we assume that the square root is nonnegative. Setting « = 1 and y = 1 in Eq. (2.2.11), we get C = 0 
from the initial condition and 
yYro 1 


22 


y+a 1 


oe al 


It is known that the product of two terms is zero if and only if one of the multipliers is zero. Hence, 


ytnx yter 
=1 =>-— 
22 we Va 


Since the square root cannot be equal to a negative number (—1/2), we disregard the latter equation and get the 


required solution y+a2=2x or y=vz. To check our answer, we type in Maple: 

dsolve ({diff (y(x) ,x)=sqrt ((xty) /(2*x)) ,y(1)=1}, y(x))} 

We may use the general solution to determine an arbitrary constant C' in order to satisfy the initial condition, 
y(1) = 3. Namely, we set x = 1 and y = 3 in Eq. (2.2.11) to obtain C = 2/2 — 1/2. Thus, the solution (in the 


implicit form) of the given initial value problem is 
2 
1 1 
a “va (Si = (2v2-5) 


2 
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dy/dx = sqrt((x+y)/(2x) 


/ 
a 
7 
f- 
/, 
we 
# 
4 


Figure 2.17: Example 2.2.8: direction field and solu- 
Figure 2.16: Direction field and solutions in Example tions to y’ = («+ y+ 1)/(a — y — 3), plotted with 
2.2.7, plotted with Maxima. Maple. 


2.2.3. Equations with Linear Coefficients 


We are now able to handle differential equations of the form 


dy r (es). 


a eS ae 2.2.12 
daz Ag+ By+C ( ) 


where a,b,c, A, B, and C are constants. When c = C = 0, the right-hand side is a homogeneous function. Equation 
(2.2.12) coincides with Eq. (2.2.1) if A= B=OandC=1. 

Each of the equations ax + by+c=0 and Ax + By + C = 0 defines a straight line in the ry-plane. They may 
be either parallel or intersecting. If we wish to find the point of intersection, we have to solve simultaneously the 
system of algebraic equations 

az + by+c=0, 
Ar+ By+C=0. 


This system has a unique solution if and only if the determinant of the corresponding matrix, 
a b 
A B |’ 
does not vanish (consult §7.2.1); namely, aB — Ab 4 0. Otherwise (aB = Ab), these two lines are parallel. We 
consider below these two cases. 
Case 1. Suppose that aB — bA # 0. Then two lines az + by +c = 0 and Ax + By + C = 0 have a unique 


intersection. In this case, the constants c and C' can be eliminated from the coefficients by changing the variables. 
The right-hand side of Eq. (2.2.12) can be made into a homogeneous function by shifting variables: 


(2.2.13) 


xc=X+a and y=Y+8, 


with constants a and £ to be chosen appropriately. Since a derivative is the slope of a tangent line, its value is not 
changed by shifting the system of coordinates. Then 


and 


ax+by+c=aX+bY+(aa+b8+c), Ar+By+C=AX+ BY +(Aa+ BE+C). 


Now choose a@ and £ so that 
aa+bB+c=0, Aa+BB4+C=0. (2.2.14) 
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Since the determinant of this system of algebraic equations is equal to aB — bA # 0, there exist an a and a § that 
satisfy these equations. With these choices, we get the equation 


with a homogeneous slope function in X and Y. The resulting equation can be solved by the method previously 
discussed in §2.2.2 (using substitution Y = vX). Its solution in terms of X and Y gives a solution of the original 
equation (2.2.12) upon resetting X =x«—aand Y =y- 8. 

Case 2. Suppose that aB — bA = 0. The two lines az + by +c = 0 and Ax + By+C = 0 in Eq. (2.2.13) 
are parallel. In this case we can transfer the differential equation (2.2.12) into a separable one by changing the 
dependent variable y with a new one that is proportional to the linear combination ax + by. That is, the numerator 
ax + by is proportional to the denominator Ax + By, and the fraction (ax + by)/(Az + By) will depend only on this 
linear combination. Since b = aB/A, we have 


aB a 


a y a (Ax + By). 


ax + by = ax + 


Therefore, we set 
av =ax-+by (or we can let Av = Ax + By). 


Then 


_ dy _ 
oe = as 0st =a t0F ( 


and, after dividing both sides by a, we obtain 


dv b av+ec 
—=1+-F : 
x = (5) 


ax + by+c 
Az+ By+C 


This is a separable equation that can be solved according to the method described in §2.1. 
Example 2.2.8: (Case 1) Solve the differential equation (see its direction field in Fig. 2.17 on page 63) 
(ce —y—3)dy = (a#+y+4+1)dz, x—-y—-340. 
Solution. We rewrite the slope function in the form 


gey+l  (@=—1)+ +2) 
a-y—3 (@—1)—(y+2)' 


which suggests the substitution X =x—1 and Y =y+2. This leads to the following differential equation: 


dy xX+Y 


oe eee Y. 
dx X-Y’ get 


with the homogeneous right-hand side function (X + Y)/(X —Y). To solve the equation, we make substitution 
«-1=X=rcosé, y+2=Y =r sin9@, with r, 6 being new variables. Then 


dX =cos@dr—rsiné dé, dY =sinédr+rcosédé. 
Substituting these expressions into the given differential equation, we get, after some cancellations, that 
dr=rdé. 


Then its general solution is Inr = 6+ C, where C is a constant of integration. So, in terms of X = «—-1, Y = y+2, 
we have 


2 
In(X? + ¥?) — 2arctan(Y/X)=C or In[(a —1)? + (y+2)?] — 2 arctan 4 us (ae 
em 


The given differential equation has no equilibrium solution. 
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Example 2.2.9: (Case 2) Solve the equation 


dy x+2y—5 
= = — 2 4 7#0. 2.2.15 
dx 2xa+4y+7’ a a ( ) 


Solution. This differential equation is of the form (2.2.12), where a= 1, b=2, c=—5, A=2, B=4, C=7, 
and F(z) = z. In this case, aB — bA = 0, and we set v = x + 2y. Then the slope function becomes 


x+2y—5 v—5 
Qr+4y+7 2w+7’ ey 


This leads to 


d —5  4v- 
oe a 


dx da Qu+7 Ww+7 
Synthetic division yields the separable equation 


+7), (1,1. 17 7 
(Pet) avs ($45 x py) aw = ae, 


After integrating, we find the general solution 


or (2u + 7) dv = (4v — 3) da. 


1 
5 + In| —3| =2 +6, where v= 2+ 2y, 
with the potential function 
C = 8y+ 34417 In|4x + 8y — 3}. 


The differential equation (2u + 7) dv = (4v — 3) dz has the critical point v* = 3/4 and the singular point v = —7/2 
to which correspond lines y = (3 — 4x)/8 and y = —(7 + 2x)/4, respectively. The unstable equilibrium solution 
y = (3 — 4x)/8 is nonsingular. 

If we try to find the solution subject to the initial condition y(0) = 0 using a software package, we will obtain 


the formula expressed via a special function. For instance, MATLAB code 
solution=dsolve(? (2*x+4*y+7) *Dy=x+2*y-5’ ) 


gives the output expressed through the omega function: 

exp ((2*C3)/3 + (2*t)/3 + (2*x)/3 - 2/3)/(2*exp(wrightOmega((2*C3)/3 
+ (2%t)/3 + (2*x)/3 + log(2/3) - 2/3))) - x/2 + 1/2 

subs(solution, ’C3’, 0); 


Similar output gives Maple: 
deqi:=diff (y(x) ,x)=(x+2*y(x)-5) /(2*x+4*y(x)+7): 


Y:=rhs (dsolve({deqi,y(0)=0}, y(x))); 
which presents the solution via the Lambert function (another name for the omega function): 
1 31% 3 gk I(-812+3I+17m) 

a — — — ey 

But 3 + 3 Lambert W 7° 

Similarly, Mathematica’s output for 

DSolve[{y’ [x] == (x + 2*y[x] - 5)/(2*x + 4*y[x] + 7), yl0] == O}, 

ylx], x] 


i 3 : : 
is y|z] > 3 3 — 4x + 17 ProductLog ae ee 


On the other hand, we can plot the solution of the initial value problem subject to y(0) = 0 along with the 
direction field using Maple’s commands 
implicitplot (17*log (3) =8*y-4*x+17*log (abs (4*x+8*y-3)), x=0..2,y=-6.4..0); 


DEplot (deqi,y(x) ,x=0..1,y=-1..0,dirgrid=[16,16] , color=blue) 
An equation of the form 
dy 


a (4) g(x), 20, (2.2.16) 
dx « x 
can also be reduced to a separable one. If we set y = va, then 


2S +u=v4 f(v) g(x), or ot = f(v)g(z). 
The latter is a separable equation, and we obtain its solution in implicit form by integration 
du daz 
7 {OS 
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(a) 


Figure 2.18: Example 2.2.9: (a) the implicit solution of Eq. (2.2.15) subject to y(0) = 0; (b) the direction field along 
with three solutions, the equilibrium solution 8y = 5— 4a, and the singular line 27 + 4y+7 = 0, plotted with Maple. 
The nullcline x + 2y = 5 is not shown. 


Example 2.2.10: Solve the differential equation 


dy _, (_y+1_\" 
dx ~ \a+y-2/) ° 
Solution. This is not an equation with a homogeneous function, but it is of the form under consideration (2.2.12), 


with a=0, b=1, c=1, A=1, B=1, C=—2, and F(w) =w”. Since aB # bA, we set x= X+aandy=Y+ 
to get 


dy _, Y+6+1 ° 
dx X+at+tY+fB-2) 


We choose a and £ so that 
8+1=0 and a+8-2=0. 


Then 6 = —1 and a = 3. With this in hand, we reduce the given differential equation to 


a aed a yx 
aX “\X+Y/) ~\L4+Y/X/) ° 


Setting Y = y+1=vX = v(x — 3) gives us 


2 
d 2vu% —v(1 2 
xFp+e=2() o | x AY a Put ev +o)? 
l+vuvu 


dx 


Separation of variables yields 


(l+v)?dv dX 
v+v3 xX 


Using partial fraction decomposition, 


(l+v? 1. 2 
ee ae + 
vt+v3 vy 1l4+v? 


(v £0), 


we integrate to obtain In |v| + 2 arctanv +In|X| = nC, or C = vX e222”, Substitution of X =x-3,Y =yt+1, 
and v = (y+ 1)/(a# — 3) yields the general solution in an implicit form: 


1 
C=(y+1) exp {2 arctan Lo} 
ae 


The given differential equation has the (nonsingular) equilibrium solution, y = —1. 


2.2. Equations Reducible to Separable Equations 67 


More advanced material is presented next and may be omitted on the first reading. 
Example 2.2.11: Solve the initial value problem for x > 0: 


dy y | 4x>cos(x?) 


= 542, (Vm) = 1. 


dr x y , 
Solution. We set y = va. Then 
dv 4x? cos(a?) 
zr— +u =v+ ——_. 
dx v 


This equation can be simplified by multiplying both sides by v/2, yielding 
vu’ = 42.cos(x”). 
After integration, we obtain its solution in implicit form to be v?/2 = 2sin(x?) + C. Since v = y/z, this gives 
y=vn=t2r \/sin(x?) + C/2. 


From the initial condition, y(./7) = 1, it follows that we should choose sign “+,” which results in C/2 = 1/7. Hence, 


the answer is 
y = «v/4 sin(#?) + 1/7. 


Definition 2.7: A function g(x,y) is said to be quasi-homogeneous (or isobaric) with weights a and ( if 


g(A%a, N° y) = r7 g(x,y) (2.2.17) 


for some real numbers a, 8, and y, and every nonzero constant 4. 


A differential equation with a quasi-homogeneous slope function can be reduced to a separable one by setting 
y = 2°/¢ or y= ux/*, We demonstrate two examples of differential equations with quasi-homogeneous right-hand 
side functions. 


Example 2.2.12: Find the general solution of the differential equation 


dy 6x = 3y4 
aes — f(x,y), where fe; y) = = pig xL x 0, y f 0. 
x aty 
Solution. Calculations show that 
6828 — 314y! 2 3 y? 
A ta = = 6a-—4a—B ~_ _ \48-4a—B 2 y_ 
fA x, A” y) DET gy 3 aa = 


The function f(a, y) is a quasi-homogeneous one if and only if the numbers 


6a—4da—-—B=2a—6 and 46-—4a-— 6 =—4a+ 38 


are equal. This is valid only when 3a = 28. 

Therefore, for these values of a and £, the given differential equation can be reduced to a separable one by setting 
y =v2*/?, From the product rule, it follows that y! = v' «°/? + v32'/?. Substitution y = v.r3/? reduces the given 
differential equation into a separable one: 


3 3x? 3u329/2 
yo = al? ol + zu y =a 
or : 
sist =5- sure ) == [2 — (1 + 0?)] 
Integration gives 
i 2v du = dx 
2—v?(1+v?) xo 
Since 
2vu 2vu _ 2vu jean: 


2—w(1+v2)  3(v2+2) 3(v2—1)’ 


we have 
2 
2 
In~ = =9In|z|+mC =InCxr? 
v2—1 
a) 


v2—-1 
the general solution 
y? +a 


a, 9 


The given differential equation has two equilibrium 


solutions y = +a3/? (see Figure at right). 


= Cx. Back substitution v = y2~?/? leads to 
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Example 2.2.13: Under which values of p and q is the slope function of the differential equation 


y =dy/dz =az” + by! 


quasi-homogeneous? For appropriate values of p, solve the equation y’ = —6 x? + y?. 


Solution. The given equation is quasi-homogeneous if 


ar°P gP + bdAPT yt = NP-% (aa? + by?) 


for some constants @ and (6. This is true if and only if 


1 1 
pa=q3 =B-a, or —---—=1, 
q Pp 
Hence, the function —6 x? + y? is quasi-homogeneous if and only if p = —2. For this value of p, we substitute 
y=u-ax?/4 = 4-27! into the given differential equation to obtain 
gle! -ug?=-62 742774 cu =u? +u—6. 
This is a separable differential equation. The roots u = 2 and u = —3 of the quadratic equation u? + u— 6 = 0 are 
critical points of this differential equation, and we must preclude u = 2 (y = 2x7!) and u = —3 (y = —3a~') in the 


following steps. Other solutions can be found by separation of variables. Integration yields 


/ du _ 
wtu-6 | 


Since u? + u—6 = (u+ 3)(u— 2), we have 


ee 1) 


“tsu—2) 5 


Substituting u = xy, we get the general solution 


1 


5 


xy —2 
ty+3 


/ & =InCz, 
x 


1 1 
5 [ln Ju — 2] — Inju + 3]] = 5 In 7 


ry —2 
ry +3 


| =nCxz or 


u#-3, uF¥2. 


| = InC2’. 


Raising to the exponent, we obtain all solutions of the given differential equation: 


general: ara 


ry-2 


2 
and equilibrium: y=—, y=— 
x 


3 
- 
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dy/dx = sqrt(x+y’ 
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Figure 2.19: Direction field and solutions for Exam- Figure 2.20: Direction field and solutions in Example 
ple 2.2.13, plotted with Maple. 2.2.14, plotted with Maxima. 


Example 2.2.14: Solve the differential equation y’ = \/x + y. 

Solution. The given equation is of the form (2.2.12), page 63, witha = 1,b=1,c=0,A=0,B=0, 
C =1, and F(v) = Vv. Since aB = bA = 0, we might try a change of variable y to v given by v = x+y. Then 
v =1+y' =1+4 WV, which can be solved by separation of variables. We rewrite the differential equation as 


~ 


du Fi 

Gy 

Integration leads to 
/ es etC. 

1+ Vu 

In the left-hand side integral, we change the variable of integration by setting p? = v; therefore, 
du 2p dp / (p+1—1)dp 
[<> 1l+p l+p en apap) 


dp 
— _ — = 2n-21 +1 
2 f ap 2f ; 'p n(p ) 
=2/v—-2In(l+ Vv) =2Vr+y—-2mn(1+ Vrt+y). 


Thus, 
2/e+y—-—2ln(l+Ve+y)=24+C 


is the general solution in implicit form, which contains an arbitrary constant C. 


Example 2.2.15: (Swan—Solow economic model) It may be said that the neoclassical approach has played 
a central role in modern economic growth theory. Most of these models are extensions and generalizations of the 
pioneering works published in 1956 by T. W. Swan and R. Solow. In this growth”? model, the production is described 
by some smooth function 

Y = F(K,L), 


where Y is the output flow attainable with a given amount of capital, K, and labor, L. In the neoclassical approach, 
a production function F' is considered to be a homogeneous function of degree one: F(rK,rL) = rF(K,L), for all 
nonnegative r. In order to have a growing economy, it is assumed that a constant fraction s of the total output flow 
is saved and set aside to be added to the capital stock. Neglecting depreciation of the capital, we have 


K=sY =sF(K,L), K(0) > 0. 


23 The model was developed independently by an American economist Robert Merton Solow (born in 1924) and an Australian economist 
Trevor Winchester Swan (1918-1989) in 1956. 
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Introducing the ratio k = K/L, the above equation can be reduced to the differential equation 


dk | 
dt 
where f(k) = F(K,L)/L = F(k,1) and n is the coefficient of the labor growth. Recall that for the Malthusian 
model, n is a constant; however, for another model it may be a function (like the logistic equation). In the Solow 
model (2.2.18), once the capital per labor function is determined, all other variables, such as consumption, savings, 


wages, kK, and L can be calculated accordingly. For example, the wage rate w and the profit rate r are given, 
respectively, by w = f(k) —kf"(k) and r = f’(k). 


sf(k) — nk, (2.2.18) 


Example 2.2.16: Consider the differential equation 


y —-V1l+y/x-1=0. 
We demonstrate Sage code that could be used to find the general solution: 


y = function(’y’) (x) 

de = diff(y,x) -sqrt(1t+y/x)-1 

h = desolve(de, y); h 

ckx == e* (-2/3*log (sqrt ((x + y(x))/x) + 1) - 4/3*log(sqrt((x + y(x))/x) - 2)) 


[Problems] 


1. In each exercise, transform the given differential equation to a separable one and solve it. 


(a) y =e"t¥-? 1, (b) y'=(Qx+y-1)?-1; (c) y'=sin(z+y); 
(d) y=e**4/(a+y)-1; (e) yy’ +ay =a; (f) y = Set y. 


Hint: f = = tan ($ =) = TECTIA Jas = 2/u— 8In(4+ Ju). 


2. In each exercise, solve the given differential equation of the form xy’ = y F(xy) by using transformation v = zy. 


(a) yay; (b) sy’ =e%—y, (c) sy’ =y/(cy+1); 
(d) 27y' =cos"(zy)—ay; (e) zy’ = ye; (f) ty! = 2%y? + Qay’. 
3. In each exercise, determine whether or not the function is homogeneous. If it is homogeneous, state the degree of the 

function. 

(a) 5x? — 2ay + 3y”; (f) Va-y; (j) tan a 

2 2. 

Se i cia (g) 2° —ay ty’; (k) €*; 

c x? + xy — y?; , 

oe Caen ao (h) In|2|—In|y); Seu 
(d) x cos(y/x) — y sin(x/y); busy Q) @+y yr"; 

. “a £ LY 
(e) (2? +4) exp (8); ea hah aa 


4. In each exercise, determine whether or not the equation has a homogeneous right-hand side function. If it does, find 
the general solution of the equation. 


(a) of =4+2,/5; (b) (y3 — ya?) dar = 2ary? dy; 

(c) y= =e, (d) (a? +y") da + xydy = 0; 

(e) 2eyy’ =a? +y"; (f) (y+ V2? + y?) dx — xdy = 0; 

(g)  a?y' = 2y? + ay; (h) (4£+1) de4 ( 1) dy = 0; 

(i) ay’ =y? + 2xy—6x?;  (j)_ (w* + y*) da — 2ay? dy = 0; 

(6) 9 = GREE, uly/a— 1) de + a(e/y + 1) dy = 03 
(m) wy sy t3cy+a°; (n) (2y° + ay) da + 2ry + y") dy = 0; 

(0) 2#?y'+3ary?+2y?=0; (p) (2x sin(y/x) — y cos(y/x)) dx + x cos(y/x) dy = 0. 


5. Solve the given differential equation with a homogeneous right-hand side function. Then determine an arbitrary constant 
that satisfies the auxiliary condition. 


(a) xcyda+ 2(x? + 2y”) dy=0, y(1)=1; 
(b) (y+ V2? + y?)dxz—2xrdy=0, y(1) =0; 
(c) (w—y)dx+ (8a+y)dy=0, y(3) = —2; 
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6. 


(d) (1+ 2e/") da + 2e*/¥(1—a/y)dy=0, y((1) =1; 
(e) (y° +ay) da —30*dy=0, y(1) =1; 
(f) (y? + 7xy + 16x7)da+a?dy=0, y(1)=1; 


(g) (3a? — 2y?)y’ = 2ay, y(1) = 1; 

(h) y?da + (a? + 3ry+4y?)dy=0, y(2) =1; 

(i) y(3a+2y)dxa—2?dy=0, y(1)=1; 

(i) 2(y +2)? dy = (y? + 2ay? +272y+a°)de, y(1) = 2; 

(k) (2x — 3y)dy = (4y—a)dax, y(-1)=2; (1) (a? +y*) de — 22?ydy=0, y(1) =1/2. 


Solve the following equations with linear coefficients. 


(a) (8a+y-—1)dx=(y—a—-1)dy; (b) (a+ 2y +4 2) da + (2x + 3y + 2) dy = 0; 
(c) (404+ 2y—8)dx+(2e-—y)dy=0; (d) (#-—4y—9)da4 (4x + y— 2) dy =0; 
(e) (5y—10)da +4 (2x + 4) dy =0; (f) (2a+ y— 8) dx = (—2a 4+ Dy — 12) dy; 
(g) (@-1de—(3e-2y—5)dy=0; — (h) (2 —y—5) der = (w — 2y — 1) dy; 

(i) (a+y—2)dx+(x+1)dy =0; (j) (2ea+y-—2)dx—(x—y+4) dy =0; 
(k) (38a@4+2y—5)dx = (24 —-3y+1)dy; (QQ) (Qr-—y-—3)dxr+ (3a+y+4+3) dy =0; 
(m) (4y—5x¢—-1)dv=(2ea+y+3)dy; (mn) (et+y+1)dr+(e4+3y+11)dy=0 


7. Find the indicated particular solution for each of the following equations. 


(a) (7+ 5y +5) da = (7y —x +7) dy, y(1) = 1; 
(b) (c—y-—3)dx+ (8x +y—-1)dy=0, y(2) = 2; 
(c) Y-wtl)de=(x+y+3)dy, — y(0)=V3—-2; 
(d) (y+2)da -—(a#+y+ 2) dy =0, y(1) = -1; 


(e) (4x + 2y — 2) da = (2a — y — 3) dy, y(3/2) = —-1; 

(f) ye/¥ dx + (y—xe*/¥) dy =0, y(1) = 1; 

(g) (4c + 3y + 2) dx + (2a + y) dy = 0, y(0) = 0; 

(h) (et ye"/* + ye %/*) da = a(e*/* + e/*) dy, (1) = 0; 

(i) (4a+y)?da=aydy, y(1)=-1; (j) (e+ 1dr = (4+ 2y+3)dy, y(0)=1. 
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8. In order to get some practice in using these techniques when the variables are designated as something other than x 


10. 


al 


12. 


13. 


and y, solve the following equations. 
dp q+4p+5. dr 2r—0+2. dw t—wt+5. 


= b) Soe C2 ba Wa 

le) fag ayqoene 2) aa ep SO? “ae ee 
dw wt+ou—-4 dk 3k + 2s —2 dz 5u—3z+8 

dj, S@a Meese. Ci PESOS gy, SEs BUS ee ee: 
(d) dv 3u + 3w — 8’ (e) ds k+4s+1?’ (f) du ztu ? 
dt 38xa+t+4 dx x+3t+3 dz 4t + llz — 42 
Be, = ot lee i) Se Ses f\. ei = Eee 
Oe eee Pe Fag SO SE Get 


Solve the following equations. 


(a) fu _ (v4) (b) i _ (2-042)' ie ge 


dx 2x — 2y dx xt+1 sey ; 

6a + 38y — 5 »_ 2a —4y + 

d) y=(4 a ‘= f =e d 
(d) y= (4e+y); (e) y ar (f) ra Dy ti 


For any smooth functions f(z) and g(z), show that yf(xy) + xg(xy) y’ is isobaric, i.e., by setting Ax, A°y, and A*~1y’ 


instead of x, y, and y’, respectively, the equation remains the same up to the multiple of A. 


Solve the following equations with quasi-homogeneous right-hand side functions. 


(a) yi =2y/e+2%/y; (b) y=y?/2*; (ce) y =3y/e?4+a4/y; (a) yl =y?/s*. 


Find the general solution of the differential equation (x — y)y’ = «+ y. Use a software package to draw the solution 


curves. 
Use the indicated change of variables to solve the following problems. 


(a) x’yda =(a7?y-2y°)dy, ew =tfu, y= Vu. 
(b) (1+ y?) da + [(1+ y?)(e¥ + 2? e7¥) — a] dy = 0; z=te’. 
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(c) an + 2xty? — e” Y) da + #3(1 + xy) dy = 0; u=ay 
(d) (14 dec (Qa + 2y +1+ 267”) dy =0; u=at+y. 
(e) (2y+27y 24 24) dx + (2a + xy) dy = 0; uU= «ry 

(f) (y- 2eylnz) dx + xlnz dy = 0; u=ylne. 

(g) (2y - vI— #4") da + 2x dy = 0; y=u/c. 

(h) (3y + 3a y? — = eee) da + (a + ay) dy = 0; u= ye. 


(i) 
(i) 


(2x 4 
(a 


3y + 5; u=22e+y. 
x) dy = 0; uy’. 


y)y = 6x4 
y’) dx + y(1 
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14. In each exercise, find a transformation of variables that reduces the given differential equation to a separable one. Hint: 
The equation y’ = f(x, y) with an isobaric function f(x, y) can be reduced to a separable equation by changing variables 


15. 


16. 


17. 


18. 


19. 


20. 


u=y/e, 
(a) 2xyy! = y? + V/y* — 92; 
(c) (a@+a?y)y’+2y+3ry?=0; (dd) 2?y'+a2°y4+3=0; 
(e) 3a°y’ — 2a7y4+ y? =0; (f) a8 y =24 /44 6r2y. 
Find the general solution by making an appropriate substitution. 
(a) y =4y?+4(e+3)yt(e+3)?; (b) y= She; 
(c) 3ay?y' +y?+2° =0; (d) y’ =secyt+ xtany; 


(ce) (@? +y?+3)y' = 2x(Qy—27/y);  (f) y! = eee. 


(b)  32°y! = 27 y? — 3ay — 4; 


where m is determined upon substitution f(Av, Ay) = A" f (a, y). 


In each of the following initial value problems, find the specific solution and show that the given function y,(x) is a 


singular solution. 


(a) y =(8a+y—5)/4*-62, (1) =2; —-ys(w) = 5 — 322. 
(b) = (#2, y4)=2 lx) =2 

(c) 3y'=14+ /1—3y/e, — (3)= 1; ys(w) = §. 

(d) yo! = Va +2y)— 327, y)=-4; — ys(a) =-¥. 


Solve the initial value problems. 


(a) (Qe+yy'=y, yl) =2; (b) xy Sot ay y(l) =1; 
(c) (e+2y)y=ax+y, y(l)=0; (d) (Say? —32%)y' =a Brean 
(e) ay =ayty’, y(1)=-1; (f) ayy’ =2%+y°, y(1) = 
(g) w=e/*(ry'—y), y)=0; (bh) y! = 2ay/(x? -y”), ee 
(i) ya=aey-x)y’, yl)=6 (3) = (2xy + y”)/(327), 

(k) cyy’=2?+y?, y(e)=ev2; (1) (y’x-y)cos# =—x sin¥, 


Use the indicated change of variables to solve the following equations. 


(a) yo =2y/e+a°/y, y= ua. (b) 


(3+92+3y+ fe) dx + (1+ 3a + y) dy =0, 


_ ul) =1; 
s 1/2; 
y(1) = §. 


u= 3+ y. 


(c) (a) 
(e) yur", (£) 


y=yte yy, you's. (Qy+1—<27y 


2eyy! = y? + /yt — 422, 


?) dx + 2a dy = 0, 


Qa7y! = 14+ \/1 + 422 y, 


U= xy. 
ua ax’y. 


Let P(x) = ax? +br+cbea polynomial of the second degree. The separable differential equation 


(P(x) = ax? + bar +c, 


was first considered by L. Euler for a polynomial P(-) of the fourth degree. 


P(y) = ay? + by +c) 


Find the general solution of the above 


differential equation using substitution « = ku+h, for some constants k and h, assuming that b? — 4ac > 0. 


Suppose that a small (tennis) ball of mass m is thrown straight up from the ground into earth’s atmosphere with 


the initial velocity vo. 


It is known from elementary physics that the presence of air influences the ball’s motion: 


it 


experiences now two forces acting on it—the force of gravity mg and the air resistance force, which can be assumed to 
be proportional to the velocity, —k|v|. Here v denotes the ball’s velocity and g the acceleration due to gravity. Make a 
numerical experiment by choosing some numerical values for m, vo, and k < 1 to show that the time for traveling up 


is not equal to the time traveling down. 
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2.3. Exact Differential Equations 


In this section, it will be convenient to use the language of differential forms. A differential form with two variables 
x and y is an expression of the type 
M(z,y) de + N(x, y) dy, 


where M and N are functions of « and y. The simple forms dx and dy are differentials and their ratio dy/dz is 
a derivative. A differential equation y’ = f(x,y) can be written as dy = f(x,y)da. This is a particular case of a 
general equation 

M(a,y) dx + N(x, y) dy =0 (2.3.1) 


written in a differential form, which suppresses the distinction between independent and dependent variables. Note 
that the differential equation (2.3.1) can have a constant solution y = y* (called the critical point or equilibrium 
solution) if M(a,y*) = 0. 

Let w(x, y) be a continuous function of two independent variables, with continuous first partial derivatives in a 
simply connected region 2, whose boundary is a closed curve with no self-intersections; this means that the region 
Q has no hole in its interior. The level curves ~(z, y) = C of the surface z = w(x, y) can be considered as particular 
solutions to some differential equation. This point of view is very important when solutions to the differential 
equations are composed of two or more functions (see Fig. 2.18 on page 66). Taking the differential from both sides 
of the equation w(x, y) = C, we obtain 


dy Ye(e,y) _ _ dv/de 


dz hy(a,y) Ob /Oy” 


Thus, the equation w(x, y) = C defines the general solution of Eq. (2.3.2) in implicit form. Furthermore, the function 
w(a, y) is a constant along any solution y = ¢() of the differential equation (2.3.2), which means that (a, 6(a2)) = C 
for all x € Q. For example, let us consider a family of parabolas 7(2, y) = C for (x,y) = y — 27. Then 


dw = yz dz + wy dy = 0 or (2.3.2) 


dy = dy — 2a dz. 
Equating dw to zero, we have 
Ue 2 2a: 
dx 9 


The latter equation has the general solution y — 2? = C for some constant C. 


dy — 2xdxz = 0 or 


Definition 2.8: A differential equation M(x, y) dx + N(z, y) dy = 0 is called exact if there is a function 7(z, y) 
whose total differential is equal to M daz + N dy, namely, 


du(x,y) = M(x, y) da + N(2,y) dy, 

Oy 

Or 

The function w(x, y) is called a potential function of the differential equation (2.3.1). 


M(c,y), = = N(w,y). 


The exact differential equation (2.3.1) can be written as dz(xz,y) = 0. By integrating, we immediately obtain 
the general solution of (2.3.1) in the implicit form 


w(x, y) = constant. (2.3.4) 


An exact differential equation is equivalent to reconstructing a potential function (x,y) from its gradient Vw = 
(Wa, Wy). The potential function, ~(a,y), of the differential equation (2.3.1) is not unique because an arbitrary 
constant may be added to it. Some practical applications of exact equations are given in [14]. 


Example 2.3.1: The differential equation 


2 2 
(+4) av+ (Fav) ay=o 


74 Chapter 2. First Order Equations 


is exact because 7(x,y) = ry(x + y)/2 is its potential function having partial derivatives 


2 2 

Surely, for any constant c, w(x, y) + c is also a potential function for the given differential equation. Therefore, this 
equation has the general solution in an implicit form: wy(a + y) = C, where C is an arbitrary constant. To check 
our answer, we ask Mathematica: 
psi := x*y*(x + y)/2 
factored = Factor[Dt[psi]] /. {Dt[x] -> dx, Dtl[y] -> dy} 
Collect [factored[[2]], {dx, dy}] 

The beauty of exact equations is immediate—they are solvable by elementary integration methods. To see how 
we can solve exact equations, we need to continue our theoretical development. 


Theorem 2.3: Suppose that continuous functions M(a,y) and N(a,y) are defined and have continuous first 
partial derivatives in a rectangle R = [a,6] x [c,d] or any domain bounded by a closed curve without self- 
intersections. Then Eq. (2.3.1) is an exact differential equation in R if and only if 

OM(x,y) _ ON(x,y) 

—+ = 2.3.5 

Oy Ox ( ) 

at each point of R. That is, there exists a function w(x, y) such that (x,y) is a constant along solutions to 
Eq. (2.3.1) if and only if M(x, y) and N(a, y) satisfy the relation (2.3.5). 


Computing partial derivatives M, =0M/Oy and N, = ON/0x from Eqs. (2.3.3), we obtain 
OM(x,y) _ Ov(a,y) — ON(a,y) _ OY(a,y) 


Oy Oxdy Ox OyOx 
Since M, and N, are continuous, it follows that second derivatives of ~ are also continuous in the domain R. This 
guarantees** the equality Yay = yr, and Eq. (2.3.5) follows. 

To prove that condition (2.3.5) satisfies the requirements for exactness of Eq. (2.3.3), we should show that it 
implies existence of a potential function. Choose any point (xo, yo) in the domain R where the functions M, N, 
and their partial derivatives M, = 0M/dy, N, = ON/Oz are continuous. We try to restore (x, y) from its partial 
derivative w, = M(x, y): 


vay) = [ME wde + hy), 
xO 
where h(y) is an arbitrary function of y, and € is a dummy variable of integration. We see that knowing one partial 


derivative is not enough to determine w(x, y), and we have to use another partial derivative. Differentiating (2, y) 
with respect to y, we obtain 


3) - “ OM(E,y) 
2,y)=— | M(a,y)de +h -| SENSE! es Aly 
by (#9) ae (x,y) (y) OE E+h'(y) 
Setting %, = N(ax,y) and solving for h’(y) gives 


” OM (x,y) 


A’(y) = N(2,y) — By 


da. (2.3.6) 


xo 

The right-hand side of this equation does not depend on x because its derivative with respect to x is identically zero: 
N,(a, y) — My(z,y) =0 for all (x,y) in R. 

Now we can find h(y) from Eq. (2.3.6) by a single integration with respect to y, treating « as a constant. This yields 


ny = [wen - ee as| dy. 


(0) 


24This condition is necessary for multi-connected domains, whereas for simple connected domains it is also sufficient. 
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Note that an arbitrary point (xo, yo) serves as a constant of integration. Substitution for h(y) gives the potential 


function : : = 9M( 
y= | M(z,y)a New {| =a | ev 2.3.7 
ven =f Mewae+ [|New — [ATP axl ay (2.3.7) 


0 i) 


To verify that the function (2, y) is the integral of Eq. (2.3.1), it suffices to prove that 
M,=Nz implies that %,=M, wy =N. 


In fact, from Eq. (2.3.7), we have 


af” Ya f* OM(c, 
be(@y) = Maw+— | N(au)ay~ f ws | OMe) ”) ae 
v Syo Yo LJ x6 Y 
y y 
~ Moy) + [ My(x.y) dy — f M,(2,y) dy = M(z, y). 
yo yo 


Similarly, we can show that w, = N(z,y). 


Example 2.3.2: Given (x,y) = arctanx + In(1+ y’), find the exact differential equation dy(x, y) = 0. 
Solution. The derivative of (x,y) is (note that y is a function of x) 


d Op oy , 1 ayy" 
=. — x = — + —— = 
lt+a* 1+y? 


Thus the function (a, y) is the potential for the exact differential equation 


14+y? 
2y(1+ 2?) 


(l+y*)dr+2y(l+a*)dy=0 or y= 


The simplest way to define the function ~(2, y) is to take a line integral of M(x, y) dx + N(a, y) dy between some 
fixed point (xo, yo) and an arbitrary point (#,y) along any path: 


(x,y) 
W(a,y) = / M(a,y) da + N(x, y) dy. (2.3.8) 


X0,Yo) 


The value of this line integral does not depend on the path of integration, but only on the initial point (xo, yo) and 
terminal point (a, y). Therefore, we can choose a curve of integration as we wish. It is convenient to integrate along 
some piecewise linear curve, as indicated in Fig. 2.21. Integration along the horizontal line first and then along the 
vertical line yields 


y (x, y) 


Figure 2.21: Lines of integration, plotted with pstricks. 


(#,y) 
v= M(z,y) de + N(c.y) dy = [ 


x0,Yo) (xo0,Yo) 


(x,y) 
M(a, yo) da + / N(a, y) dy. (2.3.9) 
(x,Yo) 


(x,Yo) 
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Similarly, if we integrate along the vertical line from (xo, yo) to (wo, y) and then along the horizontal line from (xo, y) 
to (x,y), we obtain 


(x,y) 
N(wo.y)ay + | M (a, y) dz. (2.3.10) 
(x0,y) 


(x,y) (xo0,y) 


y= M(z,y) de + N(c.y) dy = [ 


(x0,Yo) (x0,Yo) 
If the straight line connecting the initial point (xo, yo) with an arbitrary point (x,y) belongs to the domain where 


functions M(x,y) and N(ax,y) are defined and integrable, the potential function can be obtained by integration 
along the line (for simplicity, we set 2p = yo = 0): 


W(a,y) = | [x M(at, yt) + y N (ct, yt)] dt. (2.3.11) 


We may further simplify the integration in Eq. (2.3.8) by choosing the point (20, yo) judiciously. The line integral 
representation (2.3.8) for (x,y) is especially convenient for dealing with an initial value problem when the initial 
condition y(ao) = yo is specified. Namely, to define the solution explicitly, the equation ~(x, y) = 0 has to be solved 
for y = y(a) as a function of xz, which is the solution of Eq. (2.3.1) satisfying the initial condition y(xo) = yo. If 
it is impossible to resolve the equation w(x, y) = 0 with respect to y, the equation w(x, y) = 0 defines the solution 
implicitly. 

Note that the point (xo, yo) in the line integral has to be chosen in agreement with the initial condition y(xo) = yo. 
Then the solution of the initial value problem 


M(x, y) dx + N(x, y) dy = 0, y(x0) = Yo 


becomes w(x, y) = 0, where the potential function w(a, y) is defined by the line integral (2.3.8). 

While line integration is useful for solving initial value problems, its well-known variation, the method of “inte- 
gration by parts,” is beneficial for a large class of exact differential equations. For example, let us take the exact 
equation 

(3x?y — 2y? — 1) dx + (x* — 4ry + 2y) dy = 0. 


Its integration yields 


[eey- a — ide + f 2" — Ary + 2y)dy=C, 


where C is an arbitrary constant. We integrate the “parts” terms having more than one variable. Since [ x?dy = 
xy — [ 3x?ydax and — f 2y? da = —2xy? + f day dz, it follows 


[3eude — 2ry? + f ary ax —atary— [3eude - jf ecvey +y=C. 
The remaining integrals cancel, giving the solution 
gy —2Qey7 —a+y? =C. 


For visualization, we use Mathematica’s command ContourPlot (see Fig. 2.22): 


ContourPlot [x*x*x*y - 2*x*y*y - x + y*y, {x, -3, Pi}, fy, -3, Pit] 
Solution. This is an exact equation because for M(xz,y) = 2a+y+1 and N(a,y) =x — 3y +4 we have 


O2Qe+y+1)  A(a—-3y? +4) 
= = I. 
Oy Ox 


Hence there exists a potential function ~(z, y) such that y, = M(z,y) = 2x+y+1 and Wy, = N(a,y) = 2—3y?+4. 
We integrate the former with respect to x, treating y as a constant to obtain 


v(a,y) =2* +ay+a+hly), 
where h(y) is an unknown function to be determined. Differentiation of (a, y) with respect to y yields 


Ov 


—=N(a,y)=ae+h'(y) = 2 —3y? +4, 
Oy 
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Example 2.3.3: Consider a separable equation ee eee 
KZ ij 

On 

y' = p(x)q(y). 1 


We rewrite this equation in the form M(a) dx + 
N(y) dy = 0, where M(x) = p(x) and N(y) = 


—q ‘(y). Since M, = 0 and N, = 0 for any func- “ 
tions M(x) and N(y), the relation (2.3.5) is valid : 
and this equation is an exact differential equation. - 


Therefore, any separable differential equation is an 
exact equation. 


Figure 2.22: Contour curves 


3 2 2 
vy —2ay* —a#+y =C. 
Example 2.3.4: Solve the equation . ¢ ¢ 


(Qe + y+ 1)da + (x — 3y? + 4)dy = 0. 


so we get 
h'(y) = —3y* +4 = A(y) = —y? + 4y +, 


where c is a constant of integration. Substituting h(y) back into 7(a, y) gives the exact expression for the potential 
function 
W(a,y) =a? +ayta-yt+4yte. 


Of course, we can drop the constant c in the above equation because in the general solution C = 7(2, y) the constant 
C can absorb c. Instead, we apply the formula (2.3.11) and choose the initial point of integration as (0,0). Then 
Eq. (2.3.11) defines the potential function w: 


1 
W(a,y) = | [z(2at + yt +1) + y(at — yt + 4)] dt = eo? +ay+a—y? + 4y. 
0 


If we choose another point, say (1,2), we can apply, for instance, the line integration along the horizontal line y = 2 
first and then along the vertical line: 


(x,2) (x,y) 
/ Qr+2+1)de+ [ (a — 8y? + 4) dy = 2? -14+32-—3+ay—2r—y? + 3-23 + 4y—8, 
(1,2) (x,2) 


which differs from the previously obtained potential function (zx, y) = «2? +xry+a—y? + 4y by the constant 12. In 
MATLAB, we plot the direction field along with the contour plot: 

[x,y] = meshgrid(-2:.1:2,-1:.1:1); 

Z=x.72+ x.¥y + x - y.73 + 4xy; 

[DX,DY] = gradient(z,.2,.2); 

quiver(x,y,DX,DY), hold on 

title(’Direction field and level curves of z=x"2t+xy+x-y~3+4y’) 

[c,h] = contour(x,y,z,10,’linewidth’ ,2); 

h.LevelList = round(h.LevelList,1); clabel(c,h), hold off 


(x,y) 
w(x, y) = (L+y? + vy”) dx + (a?y + y + 2xy) dy. 
(0,1) 


We choose the path of integration along the coordinate axes; first we integrate along the vertical straight line from 
(0,1) to (0, y) (along this line dz = 0 and a = 0) and then we integrate along the horizontal straight line from (0, y) 
to (x,y) (dy = 0 along this line); this gives us the function 


y © 
wen) = [ yay+ [ (1+y? + ay’) dar = 5 (y? -1) + (20 + 2ay? + oy?) 
1 0 


The potential function v(az, y) can be determined by choosing another path of integration, but the result will be the 
same because the line integral does not depend on the curve of integration. For example, we may integrate along 
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Example 2.3.5: Let us consider the initial value problem 
(L+y? +ay*) dx + (a®y+y+2ay)dy=0, y(0) =1. 


First, we can check the exactness with Mathematica: 
MM[x_, y_] = 1+ y°2 + xy°2 

NN[x_, y_] = yx°2 + y + 2 xy 

Simplify (DIMM[x, y], y] == DINN[x, yl], x]] 


The line integral (2.3.8) defines the potential function for Figure 2.23: Direction field for Example 
this differential equation: 2.3.5, plotted with Mathematica. 


the horizontal line first and then along the vertical line or use the straight line formula (2.3.11): 
x y 
w(a,y) = (1+ +2-1)ar+ f (2?y + y + 2ay) dy 
0 1 
1 
= i [x (1+ (a@+1)(1+ t(y—1))?) + (y¥—1)(1 + t(y — 1)? (wt? +: 1 + 2zt)] de. 
0 


To check our answer, we type in Mathematica: 
psilX_, Y_] = Integrate[MM[x, yO], {x, x0, X}] + Integrate([NN[X, y], fy, yO, Y}] 
psilX, Y] == 


From the equation ~(2, y) = 0, we find the solution of the given initial value problem to be y = $(a) = 
(1 — 2a)/(1+2)?. 
Equation (2.3.1) is a particular case of the more general equation (functions M, N, and f are given) 


M(a,y) da + N(a, y) dy = f(x) da, (2.3.12) 


called a nonhomogeneous equation. If the left-hand side is the differential of a potential function (x,y), we get 
the exact equation dw(a,y) = f(x)da. After integrating, we obtain the general solution of Eq. (2.3.12) in implicit 
form: 


view) = f fade. 


Remark 1. We can obtain the potential function for Eq. (2.3.1) from the second equation (2.3.3), that is, 
Wy (x,y) = N(a,y). Then integration with respect to y yields 


Man= [| N@v dy +ote); 


where g is an arbitrary function of x. Differentiating with respect to x and setting w, = M(a,y), we obtain the 
ordinary differential equation for g(x): 


y 


o'()=M(a,v) - [| N(eway. 


Yo 


Remark 2. Most exact differential equations cannot be solved explicitly for y as a function of x. While this 
may appear to be very disappointing, with the aid of a computer it is very simple to compute y(x) from the equation 
w(x, y) =C up to any desired accuracy. | 

If Eq. (2.3.1) is not exact, then we cannot restore the function h(y) from Eq. (2.3.6) as the following example 
shows. 


Example 2.3.6: The differential equation 


(1+ y”) dx + (x?y + y + 2ay) dy = 0 
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is not exact because M, = 2y 4 2ay + 2y = Nz. If we try the described procedure, then 
to =1ty? and (x,y) = 4+ ay" +hly), 


where h(y) is an arbitrary function of y only. Substitution ~(z, y) into the equation w, = N(x,y) = 2?y+y + 2xy 
yields 
Qeyth'(y)=a7y+y+2ey or h'(y) =a2?y+y. 


It is impossible to solve this equation for h(y) because the right-hand side function depends on « as well as on y. 


1. Given a potential function w(x, y), find the exact differential equation dw(x, y) = 0. 


(a) v(a,y)=2? +4"; (b)  ¥(2,y) = exp(xy”); 
(c) (x,y) = In(a*y"); (d)  ¥(e,y)=(@+y—2)*; 
(e) v(z,y)=tan(92*+y");  (f)  ¥(x,y) =sinh(x? — y’); 
(g) ¥(x,y) =2/y*; (h)  v(2,y) = sin(2*y); 
(i) Y(@,y) = 2?y + 2y"; (Gj) o(x,y) = 2+ 3ay” +2%y. 
2. Show that the following differential equations are exact and solve them 
(a) xy! +2yx = 0; (b)  y (e? +y) de +a (c™ + 2y) dy = 0; 
(c) (24+e"%)da + xe" dy = 0; (d) (2xy — 3x”) dx + (x? + y) dy = 0; 


(e) (y—1)dx+4 (x—- 2)dy =0; (f) (6xy + 2y? — 4) dx = (1 — 3a? — 4ry) dy; 
(g) (coty+#*)dx=a esc? ydy; (hb) Gy t ly? +2°(1—ay)~*] dy = 0; 


(i) 2aeydax + (x? + 4y) dy = 0; (j) (cosxy — sinxy) (yda + xdy) = 0; 
(k)  y® da + 3ay? dy = 0; (Ql) (y/a@+)da + (Inz —1)dy =0; 
(m) e °dr—re°dé=0; (n) 2a(14+ \/a? — y) da = \/a? — ydy; 
(0) 4a7*da + dy =0; (pP) (4-p)det (33 - 3)dy=0. 
3. Solve the following exact equations. 
(a) (2y + 3x7) da + 2(a — y) dy = 0; (b) 4dx+ (y? +Inz) dy = 0; 
i: (2 eo (S22 ee @. 2a a 
y 72 r y? y ad y2 ys y rag 
(ey iniyl sinhade + 2°" ay = 0; (f) (2494 *) de 4 (« + Qy + Ine =) dy = 0; 
ydxz+ady y 1 ) x 
2 =0; h ——_— —- ———— ] dx —- — dy = 0; 
(g) T+ ry? ; (h) (x — yp 2/lo a ; (y—ap y 
(i) (2 v) dx = (« ri 1) dy; (j) Gi - a dx + [30 err + 205) dy = 0. 


4. Are the following equations exact? Solve the initial value problems. 
(a) cos7x cos2ry dx = 2sin7ax sin2rydy, y(3/2) = 1/2; 
(b) 2xydy + (2? + y7) dx =0, y(3) =4; 
(c) (2ry — 4) dx + (2? + 4y —1) dy =0, y(1) = 2; 
(d) sin(wy) da + wa cos(wy) dy = 0, y(1) = 7/2w; 
(e) (cos — 2r cos? 6) dr + rsin@(2rcos@ —1)d0=0, r(m/4) = 1; 
(f) (2r + sin 0 — cos @)dr+r(sin@ + cos@)d@=0, r(7/4) = 1; 
(g) 2 -te7¥/* dy — x~?ye-¥/* dx = 0, y(—2) = —2; 
(h) (2x — 3y) dx + (2y — 3x) dy =0, y(0) = 1; 
(i) 3y(x? — 1) da + (a? + 8y — 3x) dy = 0, y(0) = 1; 
(j) (xy? + a — 2y +5) dx + (2?y + y? — 2x) dy = 0, y(1) = 3; 
(k) 2y? sin? «da — ysin2a dy = 0, y(m/2) = 1; 
(1) (3a? sin 2y — 2ay) dx + (2x7 cos2y —2”)dy=0, y(1) =0; 
(m) (xy? + y) da + (xy +2 — 3y”)dy=0, y(0) = —1; 
(n) (siny+ycosx) da + (sina +acosy)dy=0, y(m) = 2; 
(0) ( 


5. Solve an exact equation 27 [30 +y- ye | daz + [2 + 3y? + e*] dy = 0. 


sinzy + aycosxy) da +a? cosxydy=0, y(1) =1; 
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2.4 Simple Integrating Factors 


The idea of the integrating factor method is quite simple. Suppose that we are given a first order differential equation 
of the form 
M(a,y)da + N(x, y) dy = 0 (2.4.1) 


that is not exact, namely, MM, #¢ N,. (It is customary to use a subscript for the partial derivative, for example, 
M, = 0M/oy.) If Eq. (2.4.1) is not exact, we may be able to make the equation exact by multiplying it by a suitable 
function pu(a, y): 

u(x, y) M(x, y)da + p(x, y) N(x, y)dy = 0. (2.4.2) 


Such a function (other than zero) is called an integrating factor of Eq. (2.4.1) if Eq. (2.4.2) is exact. The integrating 
factor method was introduced by the French mathematician Alexis Clairaut (1713-1765). If an integrating factor 
exists, then it is not unique and there exist infinitely many functions that might be used for this purpose. Indeed, 
let p(x, y) be an integrating factor for the first order differential equation (2.4.1) and w(a,y) be the corresponding 
potential function, that is, dW = pMdx+uNdy. Then any integrating factor for Eq. (2.4.1) is expressed by 
m(a,y) = u(x, y)o(w(a, y)) for an appropriate function ¢(z) having continuous derivatives. 

Using this claim, we can sometimes construct an integrating factor in the following way. Suppose that the 
differential equation (2.4.1) can be broken into two parts: 


Mi (a, y) da + Ni(a, y) dy + Mo(a, y) da + No(x, y) dy = 0 (2.4.3) 


and suppose that jui(2, y), vi(a, y) and po(a, y), W2(a, y) are integrating factors and corresponding potential functions 
to Mi(a,y) da + Ni(a,y) dy = 0 and Mo(a, y) dx + No(x, y) dy = 0, respectively. Then all integrating factors for 
equations M;, dx + N;,dy = 0 (k = 1,2) are expressed via the formula p(w, y) = ur(x, y)bn(We(a,y)) (K = 1,2), 
where ¢; and ¢2 are arbitrary smooth functions. If we can choose them so that j11(@, y)d1(Y1(a, y)) = Me(a, y) Xx 
$2(Wo2(x,y)), then p(x, y) = u(x, y)o1(Y1(a, y)) is an integrating factor. 

It should be noted that the differential equation is changed when multiplied by a function. Therefore, multi- 
plication of Eq. (2.4.1) by an integrating factor may introduce new discontinuities in the coefficients or may also 
introduce extraneous solutions (curves along which the integrating factor is zero). It may result in either the loss of 
one or more solutions of Eq. (2.4.1), or the gain of one or more solutions, or both of these phenomena. Generally 
speaking, careful analysis is needed to ensure that no solutions are lost or gained in the process. 


Example 2.4.1: The equation 
2dx+ («v+ =) dy =0 
y 


is not exact because 


_ OM 
=a 


for M =2 and N = 2°y+ay7'. If we multiply the given equation by p(x, y) =a °y~' (a £0, y £0), we obtain 


ON _ 
#5 = Say ty : 


2a 3y-1 da + (1 + a id dy = 0, 
which is now exact since 


ON 0 
OM oo Aha ee ON 
(Qa-"y"*) Qa °y an on 


2-2 
By By (l+a7*y ye 


Therefore, the potential function is w(x, y) = y — «~?y~! and the general solution becomes 7(x, y) = C, for some 


constant C. 


If an integrating factor exists, then it should satisfy the partial first order differential equation 
(uM)y=(UN)2 or = Mpy— Nutz + (My — Nz) =0 (2.4.4) 


by Theorem 2.3, page 74. In general, we don’t know how to solve the partial differential equation (2.4.4) other than 
transfer it to the ordinary differential equation (2.4.1). In principle, the integrating factor method is a powerful tool 
for solving differential equations, but in practice, it may be very difficult, perhaps impossible, to find an integrating 
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factor. Usually, an integrating factor can be found only under several restrictions imposed on M and N. In this 
section, we will consider two simple classes when js is a function of one variable only. 

Case 1. Assume that an integrating factor is a function of x alone, u = u(x). Then py = 0 and, instead of 
Eq. (2.4.4), we have 
du M,—N,z 
dc N © 
If (M, — N,)/N is a function of x only, we can can solve Eq. (2.4.5) because it is a separable differential equation: 


(uN), = UMy, or (2.4.5) 


ay = Mie Ne — Ne dx. 
Lb N 
Therefore, 
M, — Nz 
jo(“) = exp {| a ar} : (2.4.6) 


Case 2. A similar procedure can be used to determine an integrating factor when yp is a function of y only. Then 
Lz = 0 and we have 


dy M 
If (M, — N,)/M is a function of y alone, we can find the integrating factor explicitly: 
M,, — Nz 
u(y) = exp \-/ —— ay} : (2.4.7) 


Example 2.4.2: Consider the equation sinh x dx + cosh dy = 0, where 


1 1 | 
sinha = 5e?— Se", cosht= eh +56" 21. 


This is an equation of the form (2.4.1) with M(a,y) = sinha and N(z,y) = oan Since 


sinh 
M,=0 and N,=——, 
y 


this equation is not exact. The ratio 


M, — Nz (sinha)y sinha 


N  -ycosha2~—s cosh 


is a function on x only. Therefore, we can find an integrating factor as a function of «: 


2 My, — Nz a sinh x 
p(x) = exp f a ae} = exp{- | ae ar} 


We don’t need to find all possible antiderivatives but only one; thus, we can drop the arbitrary constant to obtain 


/ sinh x de = | d(cosh x) = In(cosh 2). 


cosh x cosh x 


Substitution yields 
1 


u(x) = exp {—Incosh x} = exp {In(coshx)~'} = =e 
x 


Now we multiply the given equation by p(2) to obtain the exact equation 


sinh x 


1 
dz +—-—dy =0. 
cosh x y 


O /sinha 0 f1 
By (==) =0 and ae (=) = 0. 


This equation is exact because 
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Since this differential equation is exact, there exists a function ~(a,y) such that 


Ow sinh x q Ow 7 1 
— = vy = — and —=,=-. 
Ox cosh x Oy og 
We are free to integrate either equation, depending on which we deem easier. For example, integrating the latter 
leads to 
v(x, y) = In|y| + k(a), 


where k(x) is an unknown function of x only and it should be determined later. We differentiate the equation 
v(a,y) = In|y|+ k(x) with respect to x and get 


sinh x 


Ye = ki (a) = 


cosh x 


Integration leads to k(x) = Incosha + C; with some constant C) and the potential function becomes 
w(a,y) = In|y| + Incoshz = In(|y| cosh z). 


We drop the constant Ci because a potential function is not unique and it is defined up to an arbitrary constant. 
Thus, the given differential equation has the general solution in implicit form: In(|y| cosh x) = C for some constant 
C. 

Another way to approach this problem is to look at the ratio 


ie) e dO. { coshaz 
M,—N, _ 3, (Sinh«) — 3, (sms ) sinha 1 


M sinh x 4 y sinha y 


which is a function of y only. Therefore, there exists another integrating factor, 


uy) = exp {— | “tu ay} =e [ “4h = exp (n|ul} = 


as a function of y. Multiplication by u(y) = y yields the exact equation y sinhadx + coshady = 0 for which a 
potential function ~(a, y) exists. Since 


dj =ysinhadx+coshady => oY = y sinha and oY  coshe. 
w y 


Integration of ~, = y sinh x yields 
(vy) = y cosha + h(y), 


where h(y) is a function to be determined. We differentiate 7(x,y) with respect to y and equate the result to 
coshz. This leads to h’(y) = 0 and h(y) is a constant, which can be chosen as zero. So the potential function is 
w(x,y) = y cosha, and we obtain the general solution in implicit form: 


y cosha =C, 


where C is an arbitrary constant. This equation can be solved with respect to y and we get the explicit solution 


C 


= = he. 
cosh x Cone 


y(z) 


Example 2.4.3: Find the general solution to the following differential equation 
(y + 2a) dx = (x + 1) dy. 
Solution. We begin by looking for an integrating factor as a function of only one variable. The following equations 


0 ane Oly + 2x) 0 ae O(—a — 1) 
—_—_ V — M —— 1 d — N — Ve SS —— —l, 
Oy (x; y) y Oy an Ox (a, y) Ox 
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show that the given differential equation is not exact. However, the ratio 


M,N, 2 


N  — ~(a%+1) 


is a function of x alone. Therefore, there exists an integrating factor: 


u(x) = exp a z : 1 a} = exp {—2In |x + 1|} = exp {In(w + 1)~*} = ae 


Clearly, multiplication by (a) makes the given differential equation exact: 


y + 2x 1 _ 


Hence, there exists a potential function ~(x, y) such that 


di(a,y) =U2da+ydy, with w= aaa 


Integrating the latter equation with respect to y, we obtain 


W(a, y) — 


for some function k(x). By taking the partial derivative 
with respect to x of both sides, we get 


Figure 2.24: Direction field for Example 


42 
Ve = a +k (2) = rit 2.4.3, plotted with Mathematica. 
Therefore, k’(x) = 2x(x + 1)~. Its integration yields the general solution 
2-y 2 
C= 1 1)’. 
Pay ee ee 


With the integrating factor (2) = (a + 1)~?, the vertical line solution x = —1 has been lost. 


Example 2.4.4: We consider the equation (xy — y? — y) dx + (a? — xy? + x) dy = 0, which we break into the form 
(2.4.3), where 


My(2,y) = (2? —y"), Ni (x,y) =a" =4"), Mo(a,y) ==; No(x, y) = @. 


First, we exclude from our consideration the equilibrium solutions « = 0 and y = 0. Using an integrating factor 
bi = (a? — y?)~1, we reduce the equation Mj(x,y) dx + Ni(x,y)dy = 0 to an exact equation with the potential 
function W(x, y) = xy. Therefore, all its integrating factors are of the form j11 (x, y) = (x? —y?)~1d1(xy), where ¢ is 
an arbitrary smooth function. The second equation, Mz dx + No dy = 0, has an integrating factor po(x, y) = (xy)7?; 
hence, its potential function is W2(x,y) = y/a. All integrating factors for the equation «dy — ydx = 0 are of the 
form [12(ax, y) = (xy)~*d2(y/x), where ¢2(-) is an arbitrary function. 

Now we choose these functions, ¢, and ¢2, so that the following equation holds: 


1 
ar — y2 


oi(zy) = = 2 (7) ; 


If we take @1(z) = 1 and $2(z) = z~1(1 — z7)71, the above equation is valid and the required integrating factor for 
the given differential equation becomes p(x, y) = u(x, y) = (x? — y?)~1. After multiplication by p(x, y), we get the 
exact equation with the potential function 


1 
w(x, y) = zy — 5 In 


1 (y #42). 
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The next examples demonstrate the integrating factor method for solving initial value problems. 
Example 2.4.5: Solve the initial value problem 
dxi+(a+y+1)dy=0, y(0)=1. 
Solution. This equation is not exact because M, # Nz, where M(a,y) =1 and N(z,y) =x+y+1. Although 
OM ON 1 |OM ON 
Si ll ee) ed 
Oy Ox 


——— = es 
Oy Ox M 
is independent of x. Therefore, from Eq. (2.4.7), we have u(y) = e¥. Multiplying both sides of the given differential 
equation by the integrating factor u(y) = e”, we obtain the exact equation: 


eYda+e¥(at+yt+1)dy=0. 


To find the potential function w(x, y), we use the equations w, = e¥ and y, = e” (x +y+1). Hence, integrating the 
former yields (x, y) = xe” + h(y). The partial derivative of w with respect to y is 


wy =xe¥ +h'(y) = («@+y4+ le’. 


From this equation, we get 

Wiy)=(yt Vey = h(y) = ye® + Ci, 
where C{ is a constant. Substituting back, we have w(x, y) = (w+y)e4¥+C\. We may drop C\ because the potential 
function is defined up to an arbitrary constant and therefore is not unique. Recall that w(x, y) = C for some constant 
C defines the general solution in implicit form. Hence, adding a constant C; does not change the general form of 
a solution. This allows us to find the potential function explicitly: q(x,y) = («+ y)e¥ and the general solution 
becomes 

(x +y)je% =C. 

From the initial condition y(0) = 1, we get the value of the constant C’, namely, C = e. So (x + y)e¥ = e€ is the 
solution of the given differential equation in implicit form. 


We can use line integral method instead since we know the differential of the potential function 
div(x,y) =e% da +e¥(a+y+4+1) dy. 


We choose the path of integration from the initial point (0,1) to an arbitrary point (x,y) on the plane along the 
coordinate axis. So - r 
wen) = f cde f ey (x +yt+1)dy= (r+ y)e¥ —e. 
0 1 
To find the solution of the given initial value problem, we just equate (x, y) to zero to obtain e = (a + y)e¥. 


There is another approach to solving the given initial value problem using substitution (see §2.3). To do this, we 
rewrite the differential equation as y’ = —(x+y+1)~!. We change the dependent variable by setting v= «+y+1. 
Then v’ =1+~y’. Since y’ = —v—1, we have 

1 v-1l 


v =(atytly/=1--= 
v v 


Separation of variables and integration yields 


i kd a= f (1+ ! Jav= far=2+e 
v—1 v—1 


v+Inju-lj=a2+C. 
We substitute v = x +y+1 back into the above equation to obtain 


or 


etytil+in|jetytl-lj=2+C or ytl+hn|z+y/=C. 


Now we can determine the value of an arbitrary constant C’ from the initial condition y(0) = 1. We set « = 0 and 
y = 1 in the equation y + 1+I]n|2 + y| = C to obtain C = 2, and, hence, 


y+inet+yl=1 


is the solution of the given initial value problem in implicit form. 
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Example 2.4.6: Solve the initial value problem 
3ydx+2xedy=0, y(—1) =2. 


Solution. For M = 3y and N = 22, we find out that M, #¢ Nz and the equation is not exact. Since M, — Nz = 1, 
the expression (M, — N,)/M = 1/(3y) is a function of y alone and the ratio (M, — N,)/N = 1/(2z) is a function 
of z only. Thus, the given differential equation can be reduced to an exact equation by multiplying it by a suitable 
integrating factor. We can choose the integrating factor either as a function of x or as a function of y alone. Let us 
consider the case when the integrating factor is w = p(y). 

Integrating (M, — N;)/M with respect to y, we obtain 


1 
u(y) = exp {-/ zw} = exp {in|yl-¥3} Sire 


Multiplication by p(y) yields 
3yly|— 1/3 dx + 2a |y|- 1/3 dy = 0. 


This is an exact equation with the potential function 
b(2,y) = 3ayly|"/? = 3ary?/? 


since the variable y is positive in the neighborhood of 2 (see the initial condition y(—1) = 2). Therefore, the general 
solution of the differential equation is 


sey/8 =C or = =y=C|a|-9/?. 
From the initial condition, we find the value of the constant C' = 2. Hence, the solution of the initial value problem 


is y = 2(—2)~9/? for x <0 since 


lal= x, for positive a, 
7 x, for negative x. 


On the other hand, the integrating factor as a function of x is 


= da | _ 1/2\ _ ),.)1/2 
pla) = exp f So} exp {in \ = |e ; 


With this in hand, we get the exact equation 
3y|a|/? da + 2ax|x|1/? dy = 0. 
For negative x, we have 
3y(—2)/? dx + 2a(—x)*/? dy = 0. 


The potential function is o)(x, y) = 2ay(—«)'/? and the general solution (in implicit form) becomes 2xy(—2) 
The constant c is determined by the initial condition to be c = —4. 


1. Show that the given equations are not exact, but become exact when multiplied by the corresponding integrating factor. 
Find an integrating factor as a function of x only and determine a potential function for the given differential equations 
(a and b are constants). 


nee ile 


(a) y/+y(1+2) =0; (b) 2° y!=2y+2"; 

(c) aydax + brdy =0; (d) 5dx—e’* dy =0; 

(e) (ya%e*Y — 2y?) da + (w*e™4 + 3ay?)dy=0; (f) gy?da + (e* —y)dy =0; 
(g) (ay? — y)da + (2a°y + x) dy = 0; (h) xy" dy — (x? + y*) da = 0; 
(i) (a? —y? + y) dx + x(2y — 1) dy = 0; (Gj) (5 -6y + 2°”) dx = dy; 
(k) (a? — 3y)da+ady=0; (1) (e?” +. 3y — 5) da = dy. 


2. Find an integrating factor as a function of y only and determine the general solution for the given differential equations 
(a and b are constants). 


(a) (y+ 1)dx—(x—y)dy=0; (b) (#-1) ders (20 Lal =) dy = 0; 
(c) aydx+brdy =0; (d) y(a@+yt+1)dr+ a(x + 3y 4+ 2) dy = 0; 
(e) (Qry?+y)da —ady =0; (f) 2(a+y) da — x(a + 2y — 2) dy = 0; 

(g) ty?da—(a?+y")dy=0, (bh) (w@t+ay)dr + (2*+y" —3)dy =0, 

(i) (y+1)da = (2x +5) dy; (j)  4y(a® — y°) dx = (3a* + Bary?) dy. 
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2.5 First-Order Linear Differential Equations 


Differential equations are among the main tools that scientists use to develop mathematical models of real world 
phenomena. Most models involve nonlinear differential equations, which are often difficult to solve. It is unreasonable 
to expect a breakthrough in solving and analyzing nonlinear problems. The transition of a nonlinear problem to a 
linear problem is called linearization. To be more specific, the differential equation y’ = f(x,y) is linearized by 
replacing the slope function f(x,y) with its first-order approximation 


y’ = f(z, yo) + fy(, yo) (y — Yo), (2.5.1) 


where y is restricted to some small interval containing a value yo of particular interest. In practice, yo is usually 
a critical point of the slope function where f(z, yo) = 0. Before computers and corresponding software were read- 
ily available (circa 1960), engineers spent much of their time linearizing their problems so they could be solved 
analytically by hand. 

For instance, consider a nonlinear equation y’ = 2y — y? — sin(t). Its linear version in a proximity of the origin 
is obtained by dropping the square y? and substituting t instead of sint when t is small. We plot in Maple the 
corresponding direction fields and some solutions (see the figures on page 39): 


ode6:=diff (y(t) ,t)=2*y(t)-y(t) *y(t)-sin(t) ; 
inc:=[y(0)=0, y(0)=0.3, y(0)=0.298177, y(0)=0.298174] ; 
DEplot (ode6, y(t) ,inc,t=-1..8,y=-2..2, color=black, linecolor=blue, dirgrid=[25,25]); 


A first-order linear differential equation is an equation of the form 


a(x) y’(x) + ao(x)y(x) = g(a), (2.5.2) 


where ao(x), a1(x), and g(x) are given continuous functions of 7 on some interval. Values of x for which a;(x) = 0 
are called singular points of the differential equation. Equation (2.5.2) is said to be homogeneous (with accent 
on “ge”) if g(a) = 0; otherwise it is nonhomogeneous (also called inhomogeneous or driven). 

A homogeneous (undriven) linear differential equation a; (x)y'(x)+ao(x)y(x) = 0 is separable, so it can be solved 
implicitly. In an interval where ai(”) 4 0, we divide both sides of Eq. (2.5.2) by the leading coefficient a;(a) to 
reduce it to a more useful form, usually called the standard form of a linear equation: 


y' +a(x)y = f(a), (2.5.3) 


where a(x) = ao(x)/ai(x) and f(x) = g(x)/a1(x) are given continuous functions of x in some interval. As we saw 
in §1.6 (Theorem 1.1, page 22), the initial value problem for Eq. (2.5.3) always has a unique solution in the interval 
where both functions a(x) and f(x) are continuous. Moreover, a linear differential equation has no singular solution. 

The solutions to Eq. (2.5.3) have the property that they can be written as the sum of two functions: y = yn +Yp, 
where yp, is the general solution of the associated homogeneous equation y’+a(x)y = 0 and yp is a particular solution 
of the nonhomogeneous equation y’ + a(x)y = f(x). The function yp, is usually referred to as the complementary 
function (which contains an arbitrary constant) to Eq. (2.5.3). 

In engineering applications, the independent variable often represents time and conventionally is denoted by t. 
Then Eq. (2.5.2) can also be rewritten in the form with isolated y: 


p(t) = +y = Fit) or pithy ty = F(t), (2.5.4) 
where the term F'(t) is called input, and the dependent variable y(t) corresponds to a measure of physical quantity; 
in particular applications, y may represent a measure of temperature, current, charge, velocity, displacement, or 
mass. In such circumstances, a particular solution of Eq. (2.5.4) is referred to as output or response. 

We shall generally assume that p(t) 4 0, so that the equations (2.5.3) and (2.5.4) are equivalent. If p(t) were 0, 
then y would be exactly F(t). Therefore, the term p(t)y describes an obstacle that prevents y from equating F(t). 
For instance, consider an experiment of reading a thermometer when it is brought outside from a heated house. The 
output will not agree with the input (outside temperature) until some transient period has elapsed. 


Example 2.5.1: The differential equation 
y +y=5 
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is a particular example of a linear equation (2.5.4) or (2.5.3) in which a(x) = 1 and f(x) = 5. Of course, this is a 
separable differential equation, so 


ep Y= fae. 
y—5 y—5 


Then after integration, we obtain 
In|y — 5| = —a+InC, 


where C is a positive arbitrary constant. The general solution of the given equation becomes 


ly—5|=Ce* or y—-5=+Ce”. 


We can denote the constant +C' again as C' and get the general solution to be y=5+Ce™”. 


There is another way to determine the general solution. Rewriting the given equation in the form M(y) da+N dy 
= 0 with M(y) = y —5 and N(x) = 1, we can find an integrating factor as a function of x: 


pep a rae alae 


Hence, after multiplication by u(x) = e*, we get the exact equation 
e*(y — 5) dx +e* dy =0 


with the potential function w(a,y) = e*(y —5). Therefore, the general solution is e”(y—5) = C (in implicit form). 


Generally speaking, Eq. (2.5.3) or Eq. (2.5.4) is neither separable nor exact, but can always be reduced to an 
exact equation with an integrating factor u = u(x“). There are two methods to solve the linear differential equation: 
the Bernoulli method and the integrating factor method. We start with the latter. 

Integrating Factor Method. In order to reduce the given linear differential equation (2.5.3) to an exact one, 
we need to find an integrating factor. If in some domain an integrating factor is not zero, the reduced equation will 
be equivalent to the original one—no solution is lost and no solution is added. Multiplying both sides of Eq. (2.5.3) 
by a nonzero function p(x), we get 


u(a)y'(@) + w(a)a(x)y(a) = w(x) f(a). 
Adding and then subtracting the same value p’(a)y(x), we obtain 
w(ayy'(a) + pw (a)yy(a) — w'(a)y(@) + wla)a(a)y(@) = w(x) f(2). 
By regrouping terms, we can rewrite the equation in the equivalent form 


d(py) 


HY = (wl (e) = ale) u(@)] yl) + w(x) (0) 


since (yy)/ = wy’ + p’y according to the product rule. If we can find a function p(#) such that the first term in the 
right-hand side is equal to zero, that is, 


u(x) — a(x)p(x) = 0, (2.5.5) 


then we will reduce Eq. (2.5.3) to an exact equation: 


< (u(x)y(a)] = ala) f(a). 


Excluding singular points where p(x) = 0, we obtain after integration 


y(ae)y(a) = / u(x) f(«) de + C, 
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where C is a constant of integration and the function s(x) is the solution of Eq. (2.5.5): u(x) = exp {f a(x) da}. It 
is obvious that j(a) is positive for all values of x where a(x) is continuous, so (a) # 0. Hence, the general solution 
of the nonhomogeneous linear differential equation (2.5.3) is 


y(“) = ra + aw / p(x) f(a)da, p(x) = exp {| a(x) ac} ; (2.5.6) 


Once we know an integrating factor, we can solve Eq. (2.5.3) by changing the dependent variable 


w(e) = v(e)n(e) =u(a) exp f ate) ac}. 


This transforms Eq. (2.5.3) into a separable differential equation 


because 


Sef / ates ar} =) exp { / ee) ar} 


From Eq. (2.5.3), it follows that y’ + ay = f; therefore, w’(x) = f(«)u(x). This is a separable differential equation 
having the general solution 


w(t) =C+ / f(a)ule) de, 


which leads to Eq. (2.5.6) for y(x). | 
Bernoulli’s”? Method. We are looking for a solution of Eq. (2.5.3) in the form of the product of two functions: 
y(x) = u(#)v(x). Substitution into Eq. (2.5.3) yields 


du du 
S v(x) + u(x) F + a(e) u(ao(a) = (2). 


If we choose u(a) so that it is a solution of the homogeneous (also separable) equation u’ + au = 0, then the first 
and third terms on the left-hand side drop out leaving an equation easily seen to be separable with respect to v: 
u(x) v’ = f(x). After division by u, it can be integrated to give u(x) = f f(x) u7'(x) dz. All you have to do is to 
multiply u(a#) and v(x) together to get the solution. 

Let us perform all the steps in detail. First, we need to find a solution of the homogeneous linear differential 
equation (which is actually a separable equation) 


u' (x) + a(x) u(x) = 0. (2.5.7) 
Since we need just one solution, we may pick u(x) as 
u(x) = eW fale) de, (2.5.8) 
Then v(x) is a solution of the differential equation 


wey a fe) or Ba LDA payelorer 


Integrating, we obtain 


u(x) = c+ | f(z) ef 0) 4 ay 


with an arbitrary constant C. Multiplication of u and v yields the general solution (2.5.6). o 
If coefficients in Eq. (2.5.2) are constants, then the equation a ,y’ + aoy = g(x) has an explicit solution 


1 
y(x) = — gamelan [oe grr de eC ewe, (2.5.9) 
ay 


251t was first formalized by Johannes/Johann/John Bernoulli (1667-1748) in 1697. 
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Example 2.5.2: Find the general solution to 


yf +t=aa’, 0<4<o. 
x 


Solution. Integrating Factor Method. An appropriate integrating factor u(«) for this differential equation 
is a solution of xy’ = ys. Separation of variables yields 
du dx 
Lo 
and after integration we get u(x) = x. Multiplying by p(x) both sides of the given differential equation, we obtain 


d 
zy ty=2° or ay (ty) = 
Integration yields (with a constant C) 
ree © 


Bernoulli’s Method. Let u(x) = 1/x be a solution of the homogeneous differential equation u’ + u/x = 0. 
Setting y(x) = u(x)v(x) = v(x)/x, we obtain 


y =v" (x) /x — v(x) /x? + u(x) /2? = 2? or =v (x) = 2. 


Simple integration gives us 
4 
£ 


v(2) _ va +C, 


where C' is a genuine constant. Thus, from the relation y(«) = u(x)/ax, we get the general solution 


Cx 
ye)= +5. 
x 4 


Example 2.5.3: Consider the initial value problem for a linear differential equation with a discontinuous coefficient 


y+p(x)y=0, y(0) = yo, 
where 


wf 2 #0<e<1, 
PTI) 0, ife>1. 


Solution. The problem can be solved in two steps by combining together two solutions on the intervals [0,1] and 
[1,co). First, we solve the equation y’ + 2y = 0 subject to the initial condition y(0) = yo in the interval [0,1]. Its 
solution is y(a) = yo e~?”. Then we find the general solution of the equation y’ = 0 for x > 1: y=C. 

Now we glue these two solutions together in such a way that the resulting function becomes continuous. We 
choose the constant C from the condition yy e~? = C. Hence, the continuous solution of the given problem is 


2 


ye, if0<a<l; 
y(x) = : 
yoe ~, ifl<x%<o. 


Example 2.5.4: Let us consider the initial value problem with a piecewise continuous forcing function: 


4z, «x 50, 


y-w=sa)={ ¢ y(0) = lim y(z) =0. 


x<0, z—>+0 
Solution. The homogeneous equation y’ — 2y = 0 has the general solution 
y(x) = Ce?” (—0o < & < ov), 


where C' is an arbitrary constant. Now we consider the given equation for positive values of the independent variable 
a: 


¢! —2¢ = 42, x>0. 
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—2x 


Solving the differential equation for an integrating factor, yu’ + 2u = 0, we get p(x) = e~**. Multiplication by u(x 
g 


reduces the given equation to an exact equation: 


wp —2Wbp=4eu or (up)' = Ave **. 
Integration yields 
p(x)b(a) = 4 fre dzi+ kK = a/ ze “det K, 


xo 
where K is a constant of integration. Here zg is a fixed positive number, which can be chosen to be zero without 
any loss of generality. Indeed, 


x 0 me 
/ re ** dx = get act [ mes oe av= | ee de + Cy; 
xo xo 0 0 


where C, = he xe?” dx can be added to an arbitrary constant K: 


u(x)b(x) = e~** d(x) = af ze "dx + K. 


0 


To perform integration in the right-hand side (but avoid integration by parts), we consider the auxiliary integral 


= 1 1 
F®) = | ek® dx = — ek® = 
. k k 


for an arbitrary parameter k. Differentiation with respect to k leads to the relation: 


d fl 1 1 i d oe 
P= 5 (Ge -g)-e-Betee =! ef da. 
0 


Changing the order of integration and differentiation, we get 


u 1 ka Uke F d ka ‘ ka 
Bp Re Reni -| qa” dz = ne dz. 
Setting in the latter equation k = —2 to fit our problem, we obtain 


1—e7?* — Qn ee? = af ve 2” de. 
0 


Using this value of the definite integral, we get the solution 
e ** d(z) =1-e * —-2Qne ** + K =(K4+1)-e°™* -2re-™". 
Hence, the general solution of the given differential equation becomes 


(2) (K+1)e2"-1-22, x>0, 
4 on 
Ce, x <0. 


This function is continuous when the limit from the right, lim,,+9 y(a) = K +1—1= K, is equal to the limit from 
the left, limz.-o y(z) = C. Therefore C = K = 0 in order to satisfy the initial condition y(0) = 0. This yields the 


solution 
(x) e* _1 I, 2«>0, 
zt) = 
os 0, a <0. 


To check our calculation, we use Maple: 
f :=x->piecewise(x>=0,4*x) ; 
dsolve ({diff (y (x) ,x)-2*x=f (x) ,y(0)=0}, y(x)) 
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Example 2.5.5: Find a solution of the differential equation 


2Qsin? x 


d 
(sin x) 2 — y cosz = — (ec Aka, k=0,+1,+2,...) 


dx 


oo 
such that y(a) > 0 as > co. 
Solution. Using Bernoulli’s method, we seek a solution as the product of two functions: y(x) = u(x) v(2). 

Substitution into the given equation yields 

du du 2sin? x 

— v(x) sinx + — u(x) sina — u(x)v(x2) cosa = — 

dx dx 
If u(x) is a solution of homogeneous equation u’ sina — u cosx = 0, say u(x) = sina, then v(x) must satisfy the 
following equation: 


ye 


dv . 95 2sin? x du 2 
— sin’ x =— or —=-—. 
dz x3 dx x3 


The function v(x) is not hard to determine by simple integration. Thus, v(7) = 2-2 +C and 


sin x 


y(x) = u(x) v(~) = C singe + = 


The first term on the right-hand side does not approach zero as x — oo; therefore, we have to set C = 0 to obtain 
y(x) =sinx/x? (see Fig. 2.25). 


Example 2.5.6: Let the function f(x) approach zero as x — co in the linear differential equation y’ + a(x) y(x) = 
f(x) with a(x) > c > 0 for some positive constant c. Show that every solution of this equation goes to zero as 
zr ow. 

Solution. Multiplying both sides of this equation by an integrating factor yields 


< (u(a)y(a)] = ula) f(), 


where In p(x) = a(x). Hence, 


ue) exo a(t) ar} pig [ exp a(t) ar} One 
u(e) =en{- fo a(t) a} lwo + een if a(t) a} f(r) dr). 


The first term vanishes as 7 — oo because 


exp {— fo a(t) ar} < exp - [ car} =exp{—c(x—29)} for all z. 


We can estimate the second term as follows. 


po ed a(t at} [ exp | a(t) at} Pas 
= [ exp{— fo anaes | a(t ae} fae 


. [ exp {— [ a(tar} f@)arl'< ‘| * exp {ew — 1} FI dr 


x/2 : 
/ exp {oe} far + f exp {—c(x — 7) } |f(7)| dr 
ea x/2 


(0) 


and 


1 T=2 1 gay 
< ae If(r)| = exp {—e(x — 7) } hee + If(r)|= exp {—e(a — 7)} ear 
1 1 
= = p—@/2 [1 = ee) = [1 -_ eal 
sos f(r) <e € eee If(7)| € 0 
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Figure 2.25: Example 2.5.5. A solution plotted with Figure 2.26: Example 2.5.7. The solution of 

Mathematica. Eq. (2.5.10) when a = 0.05, A = 3, and p(0) = 1. 

as © — oo since [1 ag OR -*8) | and Max,,<r<2/2 |f(T)| are bounded and ee |f(7)| > 0. Thus, both 
u/2<tcu 


terms approach zero as x — oo and therefore y(x) also approaches zero. 


Example 2.5.7: (Pollution model) Let us consider a large lake formed by damming a river that initially holds 
200 million (2 x 108) liters of water. Because a nearby chemical plant uses the lake’s water to clean its reservoirs, 
1,000 liters of brine, each containing 100(1 + cost) kilograms of dissolved pollution, run into the lake every hour. 
Let’s make the simplifying assumption that the mixture, kept uniform by stirring, runs out at the rate of 1,000 
liters per hour and no additional spraying causes the lake to become even more contaminated. Find the amount of 
pollution p(t) in the lake at any time ¢. 

Solution. The time rate of pollution change p(t) equals the inflow rate minus the outflow rate. The inflow 
of pollution is given, which is 100(1 + cost). We determine the outflow, which is equal to the product of the 
concentration and the output rate of mixture. The lake always contains 200 x 10° = 2 x 108 liters of brine. Hence, 
p(t)/(2 x 108) is the pollution content per liter (concentration), and 1000 p(t)/(2 x 10°) is the pollution content in 
the outflow per hour. The time rate of change p(t) is the balance: 


p(t) = 100(1 + cost) — 1000 p(t) /(2 x 10°) 


or 
p(t) + ap(t) = A(1 + cost), (2.5.10) 


where a = 5 x 10-° and A = 100. This is a nonhomogeneous linear differential equation. With the aid of the 
integrating factor y(t) = e**, we find the general solution of Eq. (2.5.10) to be 


a cost + sint 1+a? 2a? +1 
t) = p(0)e%* + A —<——_ 5-_ A ————_ — 4 
PIE) = BM) a2 +1 bs a(a? + 1) a(a? + 1) : 


—at 


Example 2.5.8: (Iodine radiation) Measurements showed the ambient radiation level of radioactive iodine in 
the Chernobyl area (in Russia) after the nuclear disaster to be five times the maximum acceptable limit. These 
radionuclides tend to decompose into atoms of a more stable substance at a rate proportional to the amount of 
radioactive iodine present. The proportionality coefficient, called the decay constant, for radioactive iodine is 
about 0.004. How long will it take for the site to reach an acceptable level of radiation? 

Solution. Let Q(t) be the amount of radioactive iodine present at any time t and Qo be the initial amount after 
the nuclear disaster. Thus, @ satisfies the initial value problem 
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Since this equation is a linear homogeneous differential equation and also separable, its general solution is Q(t) = 
C e—9-004t | where C is an arbitrary constant. The initial condition requires that C = Qo. 
We are looking for the period of time ¢ when Q(t) = Qo/5. Hence, we obtain the equation 


1 
Qo e70-004t _ = 


—0.004¢ __ 
Qoe ~ "5 5 


Its solution is t = 250 In5 402 years. 


1. 


10. 


11. 


12. 


Find the general solution of the following linear differential equations with constant coefficients. (Recall that sinh x = 
0.5e" —0.5e * and cosha = 0.5e" +0.5e *). 


(a) y'+4y=17sing; (b) y'+4y=2e%; (ce) y’+4y=e*; 
(d) y'-2y=4; (e) yi -2y=2+42; (f) y'-2y=3e*; 
(g) y —2y=e7; (h) y’-2y=5sine; (i) y'+2y=4; 

(i) y' +2y=4e"*; (k)  y' +2y =e"; (1) y' + 2y =3 cosha; 
(m) y/+2y=3sinhz; (n) y’—y=4sinhz; (0) y’—y=4coshz; 


(p) yy! =2yt+a7+3. 


Find the general solution of the following linear differential equations with variable coefficients. 


(a) y/+ay=a; (b) sy’ + (Ba+ly=e*; (c) ay’ + ay =1; 

(d) zy'+(Qr+1)y=4e; (e) (1—2)y’—ayta(l1—a’)=0; (f) y+ (cotz)y=a; 

(g) «mary’+y=9e% Ine; (h) (2—ysinax) dx =cosxdy; (i) (cosx)? y’ +y = tan. 
Use the Bernoulli method to solve the given differential equations. 

(a) sy’ =y+a7e*; (b) y+ 2ry = 42; 

(c) y'=(y—l)tanz; (d) (1+a)y'=ay+2°, 

(e) awlnzy’+y=a; (f) (l+a?)y’+(1—2?)y—2xre7* =0; 


(g) y =y/e+4e* Inez; (bh) y’ + 2ey/(1—2*) +42 =0 (|a| <1). 


. Solve the given initial value problems. 


(a) y'+2y=10, y(0)=8; (b) 27y'+2ey-2+1=0, y(1)=0; 
(c) y'=y+6a?, y(0)=-2; (d) 27 y'+4ay=2e, y(1)=1; 
(e) wy’ =yt2n?, y(5)=5; (f) wy’ +(e+2)y=2sinz, y(r)=1; 


(g) wy +2y=a2, y(3)=1; (h) ay’+2y=2sinz, y(r) =1; 
(i) y’ty=e", y(0)=1; (j) a? y'- dey = 4p, (1) 13 
(k) yty=e*, y(0)=1; (Q) yi t+y/(2°-1)=1+2, y(0)=1 


Find a continuous solution of the following initial value problems. 


(a) y'+2y=fe(x), y(0)=0; (b) yi t+y=fr(x), y(0) = 0; 
(c) y-y=felz), yO=1; (d) y'-2y=fe(z), y(0) =1; 
for each of the following functions (& = 1, 2): 


1, «<3, _ 1, z<il, 
f(e)={ 0, «>3; fate) = { 2-a, «>. 


Show that the integrating factor u(r) = K eS (2) de where K # 0 is an arbitrary constant, produces the same general 
solution. 


Find a general solution of a linear differential equation of the first-order when two solutions of this equation are given. 
Which nonhomogeneous linear ordinary differential equations of first-order are separable? 


One of the main contaminants of the nuclear accident at Chernobyl is strontium-90, which decays at a constant rate of 
approximately 2.47% per year. What percent of the original strontium-90 will still remain after 100 years? 


In a certain RL-series circuit, L = 1+? henries, R = 1 —t ohms, V(t) = t volts, and J(0) = 1 ampere. Compute the 
value of the current at any time. 


In a certain RL-series circuit, L = t henries, R = 2t + 1 ohms, V(t) = 4¢ volts, and J(1) = 2 amperes. Compute the 
value of the current at any time. 


Find the charge, q(t), t > 1, in a simple RC-series circuit with electromotive force F(t) = t volts. It is given that R =t 
ohms, C = (1+ t)~' farads, and q(1) = 1 coulomb. 
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13. 


14. 


15. 


16. 


Tt: 


18. 


19. 


20. 


21. 


22. 


23. 
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An electric circuit, consisting of a capacitor, resistor, and an electromotive force (see [14] for details) can be modeled 


by the differential equation Rq@ + a = E(t), where R (resistance) and C (capacitance) are constants, and q(t) is the 
amount of charge on the capacitor at time t. If we introduce new variables gq = CQ and t = RC7, then the differential 
equation for Q(T) becomes Q’ + Q = E(r). Assuming that the initial charge on the capacitor is zero, solve the initial 
value problems for Q(7) with the following piecewise electromotive forces. 


1, O<7 <2, NF 0<7<i1, 
(a) Br) = | 0, 2<T<o; (b) Br) = | 1/t, 1<T <0; 

0, O<T<2, Jr O<7r<il, 
(c) Br) ={ 9 2<T<@; (d) Bo) ={ 7 1<T<o. 


Solve the initial value problems and estimate the initial value a for which the solution transitions from one type of 
behavior to another (such a value of a is called the critical value). 


(a) y(0) =a; (b) y(0) =a. 

Solve the initial value problem and determine the critical value of a for which the solution behaves differently as t > 0. 
(a) Py +tt-2)y+3e", y)=a; (b) (sint)y+(cost)y=e%/", y(F) =a; 
(c) tyt+ty=tcost, y(m)=a; (d) yt+ytant=cost/(t+1), y(0)=a. 


Find the value of a for which the solution of the initial value problem y’ — 2y = 10 sin(4z), 
as t > oo. 


y’ —y =10 sin(3z), y’ — By = 5e?*, 


y(0) = a remains finite 
Solve the following problems by considering x as a dependent variable. 


dy oy. dy _y-y" dy  ¥ 
(=) dx x+y?’ (B) det’ a) dx 2r+1° 


Solve the initial value problem: y = ayt fy y(t) dt, y(0) =c, where a, b, and c are constants. Hint: Set k = i y(t) dt, 
solve for y in terms of k, and then determine k. 


CAS Problem. Consider the initial value problem 


xy +2y =sina, y(m/2) = 1. 


(a) Find its solution with a computer algebra system. 


(b) Graph the solution on the intervals 0 < x < 2,1 < a < 10, and 10 < x < 100. Describe the behavior of the 
solution near x = 0 and for large values of z. 


A tank contains 100 liters of pure water. Brine with 30 grams of salt per liter flows in at the rate of 2 liters per minute. 
The thoroughly stirred mixture then flows out at the rate of 3 liters per minute. 


(a) Find the amount of salt in the tank when the brine in it has been reduced to 50 liters. 

(b) When is the amount of salt in the tank greatest? 
Consider the ordinary differential equation y’ = e?”/2(y — xe?”). None of the methods developed in the preceding 
discussion can be used for solving this equation. However, by considering x as a function of y, it can be rewritten in 
the form dz/dy = —2x + 2ye~*", which is a linear equation. Find its general solution. 


Suppose that the moisture loss of a wet sponge in open air is directly proportional to the moisture content; that is, 
the rate of change of the moisture M, with respect to the time t, is proportional to M. Write an ODE modeling this 
situation. Solve for M(t) as a function of t given that M = 1,000 for t = 0 and that M = 100 when t = 10 hours. 


According to an absorption law*® derived by Lambert, the rate of change of the amount of light in a thin transparent 
layer with respect to the thickness h of the layer is proportional to the amount of light on the layer. Formulate and 
solve a differential equation describing this law. 


In each of Problems 14 through 19, determine the long-term behavior as t +» +00 of solutions to the given linear differential 


equation. 
14. y+ 2y —e-* sint; 16. y+ 2y =e” sint; 18. y+ 2ty = 2t; 
15. y+ 2y = e* cost; 17. ty + 2y = 4¢; 19. y+ 2ty=t+e. 


26 Johann Heinrich Lambert (1728-1777) was a famous French/Prussian scientist who published this law in 1760, although Pierre 
Bouguer discovered it earlier. By the way, Lambert also made the first systematic development of hyperbolic functions. A few years 
earlier they had been studied by the Italian mathematician and physicist Vincenzo Riccati (1707-1775). 
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2.6 Special Classes of Equations 


This section contains more advanced material, which discusses the issue of reducing given differential equations to 
simpler ones. We present several cases when such simplification is possible. Other methods are given in §4.1. 

When the solution of a differential equation is expressed by a formula involving one or more integrals, it is said 
that the equation is solvable?” by quadrature, and the corresponding formula is called a closed-form solution. So 
far, we have discussed methods that allow us to find (explicitly or implicitly) solutions of differential equations in a 
closed form. Only a few types of differential equations can be solved by quadrature. 


2.6.1 The Bernoulli Equation 


Certain nonlinear differential equations can be reduced to linear or separable forms. The most famous of these is 
the Bernoulli”? equation: 
y + p(x)y = g(x)y*, a is a real number. (2.6.1) 


If a =0 or a = 1, the equation is linear. Otherwise, it is nonlinear and it always has the equilibrium solution y = 0. 
We present two methods to solve Eq. (2.6.1): substitution (discovered by Leibniz in 1695) and the Bernoulli method. 
We start with the former. 

1. (Leibniz substitution) The Bernoulli equation (2.6.1) can be reduced to a linear one by setting 


u(x) = [y(x)]*~*. (2.6.2) 


We differentiate both sides and substitute y’ = gy* — py from Eq. (2.6.1) to obtain 


= (1—a)y “(g(x)y* — p(x)y) 


uo = (l-a)y %y’ 
(1 — a) (g(x) — p(x)y*~*). 


“=u, we get the linear differential equation for u: 


Since y!~ 
u’ + (1—a)p(a)u = (1 — a)g(z). 


2. (Bernoulli method) This equation (2.6.1) can also be solved using the Johann’s Bernoulli method (1697). 
Suppose there exists a solution of the form 


y(x) = u(x) v(2), (2.6.3) 


where u(a) is a solution of the linear homogeneous equation (which is also separable) 


u(t) +p(x)u(z)=0 => u(z) =exp {=f v(a)ac} 


Substituting y = wv leads to the separable equation with respect to v(x): 


Integration yields 


Thus, 
1/(1-a) 


y(x) = exp {-/ p(a) da \ lc +(1l-a) / g(x) u*—* (a) a] : | 
Example 2.6.1: (Logistic equation) A particular Bernoulli equation of the form 


yi —Ay=—-By? or y'=y(A- By), 


27The term quadrature has its origin in plane geometry. 
28 This equation was proposed for solution in 1695 by Jakob (=Jacobi = Jacques = James) Bernoulli (1654-1705), a Swiss mathematician 
from Basel, where he was a professor of mathematics until his death. 
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where A and B are positive constants, is called the Verhulst equation or the logistic equation. This equation 
has many applications in population models, which are discussed in detail in [14]. This is a Bernoulli equation with 
a = 2; hence, we reduce the logistic equation to the linear equation by setting u = y~'. Then we obtain 


u' + Au= B. 
Its general solution is u = C, e~4* + B/A, where C; is an arbitrary constant, so 
g 


0) = Gao eee 
WN) = (BTA) + O,e-4*  B+Ce-42’ 


where C' = AC; is an arbitrary constant. On the other hand, using the Bernoulli method, we seek a solution of the 
logistic equation as the product y = uv, where wu is a solution of the linear equation u’! = Au (so u = e4*) and v is 
the solution of the separable equation v’ = —Buv?. 


A similar equation y+ a(t) y = b(t) y? occurs in various problems in solid mechanics, where it is found to describe 
a propagation of acceleration waves in nonlinear elastic materials [7]. 

We can plot some selected solutions with Mathematica. 
curves = Flatten[Table[{A/(B + C*Exp[-A*x])}, {C, 1/4, 5/2, 1/2}]] 
Plot [Evaluate[curves], {x, 0, 4}, PlotRange -> A11] 


Figure 2.27: (a) Example 2.6.2 and (b) its solution, plotted with MATLAB. 


Example 2.6.2: Let us consider the curve y = y(x) that goes through the origin (see Fig. 2.27). Let the middle 
point of the normal to an arbitrary point on the curve and horizontal line (abscissa) belong to parabola y? = az. 
Find the equation of the curve. 

Solution. The slope of the tangent line to the curve is y/(x) at any point z. The slope of the normal line is 
k = —1/y'(x). With this in hand, we find the equation of the normal y = kx + b that goes through an arbitrary 
point A(z, y) on the curve and crosses the abscissa at B(X,0). Since X belongs to the normal, we obtain 


b 
X= = from the equation 0=kX +5. 


Substituting y — kx instead of b yields 


ene, 
x ke cr+y:y 


because k = —1/y’. The middle point C' of the segment AB has coordinates (X1,Y1) that are mean values of the 
corresponding coordinates A(x, y) and B(x + yy’,0), that is, 
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The coordinates of the point C satisfy the equation of the parabola y? = ax; thus, 


y\2 dy y 2x 
wa(Qem  Bopa 
2 a 7 dz 2a y 


This is a Bernoulli equation (2.6.1) with 


_ 


Setting u(x) = y'~° = y?, we obtain the linear differential equation with constant coefficients 


/ 
ae pie 
u (x) at x 


From Eq. (2.5.9), page 88, it follows that 
y?(a) = u(x) = 4ax + 4a? + Ce*/?. 
Since the curve goes through the origin, we have 0 = 4a2+C. Therefore, the required equation of the curve becomes 
y" (x) = 4ax + 4a? (1 = ev?) i 


To check our answer, we ask Mathematica: 

BerEq(x_, y_] := Dly[x], x] == -2*x*y[x]7(-1) + y[x]/(2*a) 
alpha = -1 

ypRule = Flatten[Solve[u’ [x] == D[y[x]*(1 - alpha), x], y’ [x]]] 
yRule = fy[x] -> ulx]*(1/(1 - alpha))} 

BerEg(x, y] /. ypRule /. yRule // Simplify 

term = PowerExpand[%/] 

NewLHS = Simplify[term[[1]]/u[x]*(alpha/(1 - alpha))] 

uRule = DSolve[NewLHS == 0, ulx], x] 

yRule /.uRule // Flatten 


Example 2.6.3: Solve xy’ + y= y? (20): 

Solution. This equation is not linear because it contains y?. It is, however, a Bernoulli differential equation of 
the form in Eq. (2.6.1), with p(x) = g(x) = 1/x and a = 2. Instead of making the Leibniz substitution suggested by 
(2.6.2), we apply the Bernoulli method by setting y(a) = u(x)v(a), where u(x) is a solution of the linear differential 


equation ru’ + u = 0. This is a separable (and exact) equation du/u = —da/x; hence, we may take u(x) = 1/2. 
Therefore, 
/ 
ya)= "9 ma yy = 2-9 ng. 
x : 2 


Substituting these formulas into the given differential equation, we get v'’ = x~?v?. This is again a separable equation 


with the general solution 
x 1 
SS d = ee 
ag ee 


with a constant of integration C. We check our answer using Mathematica: 

Urule = Flatten[DSolve[x u’ [x] + ulx] == 0, ulx], x]] /. C[1] -> 1 
Vrule = Flatten[DSolve[x u[x] v’ [x] == u[x]*2 v[x]*2 /. Urule, vix], x]] 
yl[x_] = ulx] v[x] /. Urule /. Vrule 


Example 2.6.4: Solve xy’ + y = y? Inz. 
Solution. This is a Bernoulli equation (2.6.1) with a = 2. By making the substitution (2.6.2) u(x) = [y(z)|~+, 
we get 
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an 
T 


Figure 2.28: Family of solutions from Example 2.6.4, plotted with Mathematica. 


2 


A direct substitution of y= u~! and y’ = —u~?u’ into the given differential equation yields 


U é 1 Ine m me 1 
a ———— r —_ U= Ing. 
wa) | ule) wa) = a 


ao. (=) =Ing or ac (=) = uy 


l ] d l 1 
wa f Shar =f inca(et) ==" - a ee 


x a cL 2 
Therefore, u(#) = Inz +1+4 Cz, and the solution of the given differential equation becomes 


1 
~ Ina t+14+C2’ 


y(z) 


Example 2.6.5: Solve the Bernoulli equation 
y +ay = y" elt, 


Solution. We use the Bernoulli substitution (2.6.3) by setting y(#) = u(a#)u(x), where u(a) = e~% is a solu- 
tion of the homogeneous linear differential equation u’ + au = 0. Substitution of the derivative y/(a) = (uv)! = 
—ae~** v(x) + e~* v'(x) into the equation leads to 


eo at v' (x) _ oot eo 20x v2 (x) or v= v2 eo-a)a 


This is a separable differential equation, and we have 


Integrating, we obtain 
1 fore taxed, 
—“£+C ifa=b. 


Therefore, 


zy=<4 ote 
ul) 7 ifa=b. 


yee at 
ao =cesper ifad, 
C-2 
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2.6.2 The Riccati Equation 


Let us consider a first-order differential equation in the normal form: y’ = f(x,y). If we approximate f(z, y), while 
x is kept constant, we get 
f(x,y) = Pla) + yQ(x) +’ Ra) ++. 
If we stop at y? and drop higher powers of y, we will have 
y' + v(a)y = g(x)y? + h(a). (2.6.4) 


This equation is called the Riccati?® equation. It occurs frequently in dynamic programming, continuous control 
processes, the Kolmogorov—Wiener theory of linear prediction, and many physical applications. The reciprocal of 
any solution of Eq. (2.6.4), v = 1/y, is a solution of the associated Riccati equation 


uv! — p(x)v = —g(z) — A(x)v?. (2.6.5) 


When h(x) = 0, we have a Bernoulli equation. The Riccati equation has much in common with linear differential 
equations; for example, it has no singular solution. Except special cases that we will discuss later, the Riccati 
equation cannot be solved analytically using elementary functions or quadratures, and the most common way to 
obtain its solution is to represent it in series. Moreover, the Riccati equation can be reduced to the second order 
linear differential equation by substitution: 


ue) =exp{ = f aleu(a)de} or y(e) =- 
Indeed, the derivatives of u(x) are 


1) =O {- a(a)y(2) ach =n ayule), 


u(x) = —g' (x) y(a)u(a) — g(x) y"(@) ua) — g(a)y(@) u(x). 


Since 
u’=—gyu and y!=—py+gy’ +h, 
we have 
ul (x) = —g'(x) y(x)u(x) — g(x) [—p(x)y(2) + g(x)y?(a) + h(a)Ju(x) + 9°(x)y?(x)u(z) 
and 
u(x) = —g'(x) y(a)u(x) + g(x)y(@)p(x)u(a) — g(x)h(x)u(a). 
Eliminating the product yu = —u’/g from the above equation, we obtain 
u" + a(x) u'(x) + B(x) u(x) = 0, (2.6.6) 
where (a) 
g (x 
a(z) = - , d(x) = g(ax)h(x). 2.6.7 
(x) = p(x) ae) (x) = g(x)h(«) (2.6.7) 
This linear second order homogeneous differential equation (2.6.6) may sometimes be easier to solve than the original 
one (2.6.4). 


Conversely, every linear homogeneous differential equation (2.6.6) with variable coefficients can be reduced to 
the Riccati equation 
y +y" +a(z)y + d(x) = 0 


u(x) = exp f y(2) ar} 


It is sometimes possible to find a solution of a Riccati equation by guessing. One may try such functions as 


y =ax® or y = ae’*, with undetermined constants a and b, or some other functions that the differential equation 


suggests. 


by the substitution 


?°Tn honor of the Italian mathematician Jacopo Francesco (Count) Riccati (1676-1754), who studied equations of the form y’ + by? = 


on™, 
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Without knowing a solution of the nonlinear differential equation (2.6.4), there is absolutely no chance of finding 
its general solution explicitly. Later we will consider some special cases when the Riccati equations are solvable in 
quadratures. Indeed, if one knows a solution of a Riccati equation y(x), it could be reduced to a Bernoulli equation 
by setting w = y— y. Then for w(x) we have the equation 


w! + p(x) w = g(x) w? + 29(x) p(x) w. 


In practice, one may also try the substitution y = y + 1/v (or v = (y — y)~'), which leads to the linear differential 
equation 


v' + [2p(2) g(x) — p(x)]v = —g(2). 
Example 2.6.6: Solve the second order linear differential equation with variable coefficients: 


2 


cu" + (2-2? —1)u' —2?u=0, oO. 


Solution. To reduce the given differential equation to a first-order one, we make the substitution y = u’/u and 
divide by x to obtain 


1 
++ (1-0-2) y—-—x=0. 


This Riccati equation has a solution y = x (which can be determined by trial). By setting y = x + 1/v, we get the 
linear differential equation for v: 
aul +(1—2—2")u-—2=0. 


—ax—x? /2 


Multiplying by an integrating factor p(a) = e , we reduce the equation to the exact equation: 


< [-(@) get -@ 12) = get 2/2, 


dx 


Integration yields 


dt 
v(x) xe7** /? = f re? "Pant => vr) = = g@t0"/2 / ze® 2 dz+C 


x 


and 


1 -1 
y(z) =2+—~=a24+n20e7-* 7? [eotParse : 
v(x) 


To check our answer, we type in Mathematica: 

Ricc[x_, y_] = y’ [x] + y[x]*2 + (1 - x - 1/x)*y[x] - x 
Expand[v[x]*2 Ricc[x, Function[t, t + 1/v[t]]]] 
DSolve[% == 0, vi[x], x] 


Example 2.6.7: Let us consider the differential equation 
y _ y" 4 x. 
The substitution u’(x) = exp[— f y(x) da] reduces this equation to u” + xu = 0. By setting y = —u’/u, we get 


, —u" ut (u')? 
a 
u 


Therefore, from the equation y’ = y? + x”, it follows that 
—u"u+ (u')? 1\ 2 
culut (wu) = me + x. 


Multiplication by u? yields 


—u" ut (uw)? = (vw)? 4+ 2? u? or —y" = 27 u. 
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The main property of a Riccati equation is that it has no singular solution as it follows from Picard’s Theorem 
1.3 (see §1.6 for details). Therefore, the initial value problem for Eq. (2.6.4) has a unique solution. By a linear 


transformation, the Riccati equation can be reduced to the canonical form 


y = ty? + R(x). 


(2.6.8) 


Theorem 2.4: Let R(x) be a periodic function with period T, R(a +7) = R(x). If the Riccati equation 


y’ = y? + R(x) has two different periodic solutions with period T, say yi (x) and yo(zx), then 


Je 
i [yi (x) + yo(x)] da = 0. 


For two solutions y;(#) and y2(x), we have 
Ww=y+Ra) and yy=y2+ Ra). 
Subtracting one equation from another, we have 


d 
PP (yr — y2) = Yi — Yd = (yr + 2) (1 — Y2)- 


Let u = y1 — ye; then the above equation can be rewritten as u! = (y, +y2)u or u-tdu = (y1 + y2) dz. Its integration 


leads to ji 
yi (x) — y2(x) = exp {/ [yi(t) + ya(t)] at [yi (0) — y2(0)]. 
Setting « = T, we obtain 


T 
yi(T) — y2(T) = exp 1 [yi(t) + yo(t)] a [yi (0) — yo(0)]. 


The functions y; (a) and y2(x) are periodic with period T; therefore, 


a a 
exp 1 [yi(t) + ye(t)] a =1, which leads to | [yi(t) + yo(t)] dt = 0. 
0 0 


Example 2.6.8: Integrate the Riccati equation 


y + y(y — 2a) = 2. 


Solution. It is easy to verify that y(x) = 2 is a solution of this differential equation. We set y(x) = 2a + u(x); 


then y’ = 2+ u’(«#) and we obtain 


u'(x)+(Qe+u)u(z)=0 or wu 4+2ru=—u?. 


This is a Bernoulli equation, so we seek a solution of the above equation in the form u = vw, where w is a solution 


of the homogeneous equation w’ + 27w = 0. Separating variables yields 


d 
cS = 2x dx 
w 
and its solution becomes In w = —x? +1InC. Hence, w = Ce-*. We set C = 1 and get u(x) = v(x) e-*. Then the 
function v(x) satisfies the separable equation 
7? = 2 —2Qa? 5 v! _ x? 
u(aje"” = —v* (x) e or ia 
Integration yields Ta) =—f e-®” dx + C; hence, v(x) = (eee Thus, 
a? og? 
e e 
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Example 2.6.9: Solve the Riccati equation 
y = e** + (1—2e*) y+y’. 
Solution. If we try y = ae”, we find that y(x) = e” is a solution. Setting y = y(x) + 1/v, we obtain the linear 
differential equation for v to be v' +v+1=0. Its general solution becomes 


1 


=Ce*-1., Hence, =e? + ——__ 
v(x) € ence y(x) =e fea 


is the general solution of the given Riccati equation. 


Generally speaking, it is impossible to find solutions of the Riccati equation using elementary functions, though 
there are many cases when this can happen. We mention several®° of them here. Except for these and a few other 
rare elementary cases, the Riccati equation is not integrable by any systematic method other than that of power 
series. 


1. The Riccati equation 
2 


gyrate & +240, 
x 22 
is reduced to a separable equation by setting y = v./z. 
2. The Riccati equation 
j an) c 
y=ay +r-ytay 
x x 
can also be simplified by substituting y = u/a. 


3. The equation 
yf =? — 2a? +30 
has a particular solution ya(z) = ax — 1/2. 


4. The Riccati equation 
y’ =ay* + bx? 


has the general solution of the form 


_, 2aXr ax 2axr is 


where C is an arbitrary constant and ) is a root of the quadratic equation aA? + A+ b = 0. 
5. The Special Riccati Equation is the equation of the form 
y’ = ay? + br, 


and can be integrated in closed form?! if and only if°? 


a ‘| is an integer. 
The special Riccati equation can be reduced to a linear differential equation u” + aba® u = 0 by substituting 
y(a) = —u’/(au), which in turn has a solution expressed through Bessel functions J, and Y, (see §4.9): 
Ch Ji /(2¢) (2 x!) + C2 Y1/(24) (2 x") ; if ab > 0, 


u(x) = Jax (2.6.9) 


Cr ijag (SE 21) + C2 Kiya (G2 2"), if ad <0, 


where q = 1+ a/2 = (2+a)/2. 
°0The reader could find many solutions of differential equations and Riccati equations in [37]. 


31 Here closed form means that. all integrations can be explicitly performed in terms of elementary functions. 
32This result was proved by the French mathematician Joseph Liouville (1809-1882) in 1841. 
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Remark. The function u(x) contains two arbitrary constants since it is the general solution of a second order 
linear differential equation. The function y(#) = —u’/(aw), however, depends only on one arbitrary constant C = 
Cy /C2 (C2 #0), the ratio of arbitrary constants C, and C2. 


Example 2.6.10: (Example 2.6.7 revisited) For the equation 
yaa ty’, 


it is impossible to find an exact solution expressed with elementary functions since a = 2 and the fraction a/(2a+4) = 
1/4 is not an integer. This equation has the general solution y(x) = —u’/u, where 


u(x) = Vx [Ci Ji a(x? /2) + Co¥1/4(27/2)] - 


The function u(2) is expressed via Bessel functions, J, /4(z) and Y,/4(z), which are presented in §4.9. Similarly, the 
Riccati equation 
y’ = 7? = y? 


has the general solution y = u’/u, which is expressed through modified Bessel functions: u(x) = /% [Ci 11 /4(2?/2) 
+ C2K 1 4(2?/2)] . 


Example 2.6.11: Solve the Riccati equation (see its direction field in Fig. 2.29) 
y = ay? +2°y — 207 +1. 


Solution. By inspection, the function y;(«) = x is a solution of this equation. After the substitution y = x + 1/u, 
we have u’ + 32?u = —2, which leads to the general solution 


u(x) =e"* (c = / ze” ar) 


Example 2.6.12: Solve the differential equation y’ + y? = 6 x~? (see its direction field in Fig. 2.30). 
Solution. This Riccati equation has a particular solution of the form y(#) = k/x. Substituting this expression 
into the differential equation, we obtain 
k ke 6 
=z or k?-k=6. 


vw gg? 


The quadratic equation has two solutions, kj = 3 and kg = —2, and we can pick either of them. For example, 
choosing the latter one, we set y(x) = u(x) — 2/a”. Substitution leads to 
2 4 4 6 4 
u(x) + = +7 (2) + — -—— u(x) = — or = u(x) — — u(x) + u? (x) = 0. 
ae? x 8 x? x 


This is a Bernoulli differential equation of the form of (2.6.1), page 95, with p(x) = —4/x, g(x) = —1, anda = 2. 
According to the Bernoulli method, we make the substitution u(x) = v(x) v(x), where y(z) is a solution of its linear 
part: y!(x) — 4y(x)/x = 0, so v(x) = x*. Thus, u(x) = x* v(x) and we obtain the separable differential equation for 
v(a): 


d 
v (2) +a4v2(24)=0 or - > = o* dex. 

Integration yields 

1 2 4. C ; (x) 5 

-=—+— r v(x“) = 

v 5 5 r+C 
Thus, 

Sat 2 3a — 20 
y(x) = = 


~ 2 +C 2 2(a>+C)’ 


Similarly, we can choose k; = 3 and set y(x) = u(x) + 3/x. Then we obtain the solution y(x) = Ake, which is 


equivalent to the previously obtained one because kK = —1/C is an arbitrary constant. 
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Example 2.6.13: The equation 


y’ = y" 4 a4 
can be integrated in elementary functions since a = —4 and a/(2a+4) = 1. We make the substitution y = u/2? to 
obtain 
u uw 1 j u2 
z= +; or US SU SS oR 


ye A 


eo <a 


Setting u = v — in this equation, we get v’ = (v? + 1)/x?. Separation of variables yields 


dv dz du = | G=-z+e 
ve+t1 2 = v+t1 vt : 


Thus, the general solution becomes 


Cx“ —-1 Cx“—-1 
arctan v = — and x” y(x) = tan —o. 
x x 

2 4 4 Retry yyrys 
HVA 

at MA VAAN 

| AW 

Figure 2.29: Example 2.6.11 (Mathematica). Figure 2.30: Example 2.6.12 (Mathematica). 


Example 2.6.14: (A rod bombarded by neutrons) Suppose that a thin rod is bombarded from its right end 
by a beam of neutrons, which can be assumed without any loss of generality to be of unit intensity. Considering only 
a simple version of the problem, we assume that the particles only move in one direction and that the rod is also in 
one dimension. On average, in each unit length along the rod a certain fraction p of the neutrons in the beam will 
interact with the atoms of the rod. As a result of such an interaction, this neutron is replaced by two neutrons, one 
moving to the right and one moving to the left. 

The neutrons moving to the right constitute a reflection beam, whose intensity we denote by y(a), where x is the 
rod length. For small h (which is an abbreviation of common notation Ax), consider the portion of the rod between 
x —h and «x. Since every neutron that interacts with the rod in this interval is replaced by a neutron moving to 
the left, the right-hand end of the portion of the rod from 0 to z — h is struck by a beam of intensity one. This 
produces a reflected beam of intensity y(a — h). Since the neutrons interact within the interval [x, 7 — hl], it causes 
an additional reflected beam of intensity ph, proportional to its size. In addition, because of interaction that occurs 
between the neutrons in either of these reflected beams, we have a contribution to y(a#) of magnitude ph + y(a#—h). 

Moreover, since some of the neutrons in each of the reflected beams just described will interact with the rod, 
each beam will give rise to a secondary beam moving to the left, which in turn will produce additional contributions 
to the reflected beam when it strikes the right-hand end of the portion of the rod between 0 and « —h. The 
intensities of these secondary beams are, respectively, ph y(a — h) and (ph)?, and they will contribute ph y?(a — h) 
and (ph)? y(a — h) to the reflected beam. Repetitions of this argument yield a third and ternary contributions, and 
so forth. 

To derive the equation, we neglect terms containing h? because of its small size, which results in the equation: 


y(x) = ph + y(a — h) + phy? (a — A). 
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Assuming that the function y(«) is differentiable, we approximate further y(x—h) along the tangent line: y(a—h) = 
y(x) —hy'(a). This substitution produces the equation 


y(x) = ph + y(x) — hy'(x) + ph(y — hy')’, 
which again is simplified to the Riccati equation 
y (x) = p+ py*(2), 


because of neglecting terms with h?. Solving the equation using the initial condition y(0) = 0 yields y(x) = tan(pz). 


—- 


0 r-h «& 
Figure 2.31: Example 2.6.14. 


2.6.3. Equations with the Dependent or Independent Variable Missing 


If the second order differential equation is of the form y” = f(x,y’), then substituting p = y’, p' = y” leads to the 
first-order equation of the form p’ = f(x,p). If this first-order equation can be solved for p, then y can be obtained 
by integrating y’ = p. In general, equations of the form 


aah cag 0 
can be reduced to the equation of the (n — k)-th order using the substitution y“”) = p. In particular, the equation 
y”) = F(a,y-), 


containing only two consecutive derivatives, can be reduced to a first-order equation by means of the transformation 
p= yr), 
Example 2.6.15: Compute the general solution of the differential equation 
yl”! Ss y! = 0. 
x 


Solution. Setting p= y", the equation becomes 
1 
p'——p=0, 
x 
which is separable. The general solution is seen to be p(x) = Cx (C is an arbitrary constant). Thus, y” = Cz. 


Integrating with respect to 2, we obtain y’ = Cx?/2+ Cy. Another integration yields the general solution y = 
Ci 2° + Cox + C3 with arbitrary constants C; = C/6, C2, and C3. 


Definition 2.9: A differential equation in which the independent variable is not explicitly present is called 


autonomous. 


We consider a particular case of the second order autonomous equation 


y=fy.9), 


where dot stands for derivative with respect to t. A solution to such an equation is still a function of the independent 
variable (which we denote by t), but the equation itself does not contain t. Therefore, the rate function f(y, y) is 
independent of t, and shifting a solution graph right or left along the abscissa will always produce another solution 
graph. Identifying the “velocity” y with the letter v, we get the first-order equation for it: 7 = 0 = f(y,v). Since 
this equation contains three variables (¢, y, and v), we use the chain rule: 
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Replacing # by v (du/dy) enables us to eliminate explicit use of t in the velocity equation + = f(y,v) to obtain 


du 
"ay = f(y,v): 


If we are able to solve this equation implicitly, ®(y, v,C,) = 0, with some constant C), then we can use the definition 
of v as v = dy/dt, and try to solve the first-order equation ®(y, y,C1) = 0, which is called a first integral of the 
given equation 7 = f(y, y). In some lucky cases, the first integral can be integrated further to produce a solution of 
the given equation. The following examples clarify this procedure. 


Example 2.6.16: Solve the equation 
oe +\2 
yi —(y) =9. 
Solution. Setting y = v and y = v(dv/dy), we get the separable differential equation 


The general solution of the latter equation is v = Cy or dy/dt = Cyy. This is also a separable equation and we 
obtain the general solution by integration: In |y| = Cit + InC2 or y = C2 exp(Cit). 


Example 2.6.17: Solve the initial value problem 


y=2yy, y(O)=1, y(0) =2. 


Solution. Setting v = y, we get the first-order equation 
v— = 2yv => dv = 2ydy. 


Integration yields the first integral: 


you=ytt) or after separation of variables 


because v(0) = y(0)? + C; = 1+ C = 2. Integration gives 
arctany =t+C = y(t) = tan(t + C2). 


The value of the constant C2 is determined from the first initial condition y(0) = tan(C2) = 1, so Cp = 7/4, and we 
get the solution 


y(t) = tan (t + “) ; 


Another type of autonomous equations constitute second order equations where both the independent variable 
and velocity are missing: 


i= Ff): (2.6.10) 


If antiderivative F(y) of the function f(y) is known (see §9.4), so dF'/dy = f(y), then the above equation can be 
reduced to the first-order equation by multiplying both sides by y: 


1 def" 


i-9=fO)-9 5 = SFU. (2.6.11) 


Then the first integral becomes 


(yy =2FQy) tC, <= ga=tV2FQ)4G. 


Since the latter is an autonomous first-order equation, we separate variables and integrate: 


dy 
2 Fy) + C} 


+dt = +t+C. 


dy _ 
/ PFO. — 
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It should be noted that while the autonomous equation (2.6.10) may have a unique solution, Eq. (2.6.11) may not. 
For instance, the initial value problem 


y+y=O0, y(0)=0, y(0)=1 


can be reduced to (y)? + y* = 1, y(0) = 0. However, the latter has at least three distinct solutions: 


. _ sin t, forO<t<§, 
. sint, for0<t< > T 
yi = sint; y2 = A for 2 <tc y3 = 41, for 5 <t<T, 
; or = loon 
2* cos(t—T), for T <t<ov. 


Example 2.6.18: (Pendulum) Consider the pendulum equation (see also Example 9.4.1 on page 515) 
6+w sind = 0, 


where @ is the angle of deviation from the downward vertical position, 6=d70 /dt? , w? =g/¢, is the length of the 
pendulum, and g is acceleration due to gravity. Multiplying both sides of the pendulum equation by d@/dt, we get 


d79 d@ 4. .dé 1d/do\* 4d 
ae a” ea = —= sa() =w* — cos. 


This leads to the first integral: 


2 
(F) = 2w* cos6 + Cy — ay = 2/5 epee FOL, 


dt 


dé 
where the value of constant C; is determined by the initial conditions — = + /V2w?cosé + C1 ; 
t=0 


t=0 =o 
and the sign + depends to what side (left or right) the bob of the pendulum was moved initially. Next integration 
yields 


dé - ‘ i dr 
J2u2cosO0+C, dg, W/2w? cost + Cy 
because initially at t = 0 the pendulum was in position #9. Unfortunately, the integral in the right-hand side cannot 
be expressed through elementary functions, but can be reduced to the incomplete elliptic integral of the first kind: 


? dt 
F(y,k =} ee 
(ek) 0 1—k? sin? t 


2.6.4 Equations Homogeneous with Respect to Their Dependent Variable 
Suppose that in the differential equation 
F(z,y,y',y",---,y™) =0 (2.6.12) 


the function F(x, y,p,...,q) is homogeneous with respect to all variables except x, namely, F(x, ky,kp,...,kq) = 
k’ F(a,y,p,---,q). The order of Eq. (2.6.12) can then be reduced by the substitution 


y = exp { fo) ar} or u= 7 (2.6.13) 


where v is an unknown dependent variable. 
The same substitution reduces the order of the differential equation of the form 


y" = flayy) 


with homogeneous function f(x,y, y’) of degree one in y and y’, that is, f(x, ky, ky’) =kf(ax,y,y’). If the function 
F(a,y,y,y",.-.,y™) in Eq. (2.6.12) is homogeneous in the extended sense, namely, 


Py Oe gk ES Fata ae a), 
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then we make the substitution x = e’, y = ze”, where z = z(t) and ¢ are new dependent and independent variables, 
respectively. Such a substitution leads to a differential equation for z that does not contain an independent variable. 


Calculations show that 
, dy dydt dy 1 =dyl dy _, 


Yao at ds dt dt ef ae” 


With this in hand, we differentiate y = ze” with respect to t to obtain 


d?y d (dz 
yo Ee tN (se (m—1)t 
da? dt (F +mz) . 

d?z dz 2 et) 


Substituting the first and second derivatives into the second order differential equation 
PU, Y, y', y") > 0, we obtain 


F (c’, ze™, (z' + mz) gia de (2” + 2mz' +m?) elma) =0, 


or 
ec F (1, 2, (2’ +mz), (2” + 2mz' + m?)) =0. 


Example 2.6.19: Reduce the following nonlinear differential equation of the second order to an equation of the 
first order: 


a2 y" = (y’)? at x. 


Solution. To find the degree of homogeneity of the function F(x, y',y”) = x?y” — (y’)? — x7, we equate it to 


F(ka,k™~1ty’, k™~2y’"), which leads to the algebraic equation m = 2m — 2 = 2. Therefore, m = 2 and we make the 
substitution y = ze?! that leads to the equation 


zg" + A2! + de = (2! + 22)? +1. 


This is an equation without an independent variable, so we set p= z’ and z” = pp’. Thus, we obtain the first-order 
differential equation with respect to p: 


pp +4p4 4z = (p+ 2z)? +1. 


Example 2.6.20: Find the general solution of the differential equation 
yy” — (y')? = bay’. 


Solution. Computing the derivatives of the function (2.6.13), we have 


7 Hes { fo) a} .. iy eee { foe ax} Sep { fo) a} 


Substituting into the given equation, we obtain v’ = 6x. Hence v(x) = 3x? + C; and the general solution becomes 


y(x) = Cy exp { [130 +C] a} = Cy exp eg + Cia} ; 
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2.6.5 Equations Solvable for a Variable 


We present another method of attacking first-order equations when it is possible to solve for a variable. Suppose 
that the equation F(z, y, y’) = 0 is solved for y, it is then in the form y = ®(z, y’). Setting p = y’ and differentiating 
the equation y = ®(a, p) with respect to x, we obtain 
; O® dp O® 
Y =P> 5. a0 Tb an 
Op dx Ox 
which contains only y’ = p, x, and dp/da, and is a first-order differential equation in p = y’. If it is possible to 


solve this equation for p, then we can integrate it further to obtain a solution. We demonstrate this approach in the 
following example. 


Example 2.6.21: Consider the equation y’ = (x + y)?, which, when solved for y, gives 


To 23 dp 
y= Sy -t=\Vp-c => ae a tN 


Separating variables yields 


a dx y’ =p=tanh?(C +2). 


2,/p(p + 1) 


Since y' = p = (4 + y)?, we get the general solution to be (a + y)? = tanh?(C + 2). 


A similar method is possible by reducing the equation F(z, y, y’) = 0 to the form x = ®(z,y’). Differentiating, 
we obtain the equation 


O® O® dy’ 
l= —7/ 4+ — zee 
Oy Oy’ da 
which contains only the variables y and p = y’, and the derivative dy! and which may be written as dy! = sp = o. 


That is, the process above has replaced 2 with a new variable, p = y’. If it’s possible to solve it for p and then 
integrate, we may get the general solution. 


Example 2.6.22: Reconsider the equation y’ = (x + y)?, which, when solved for , gives 


1 d 
x= /p—y, or after differentiation, 1= =p a 


2 dy 


Since the integration by separation of variables is almost the same as in Example 2.6.21, we omit the details. 


1. Solve the following Bernoulli equations. 


(a) y! + 2y = dy"; (b) (1+a7)y'—2ay=2/y(1+27); (ce) y'—2y=y7?; 
(d) 42° y/ =y(y* +527); (e) y'ty=e*y’; (f)  2ay’+y=y7(2° +2"); 
(g) y' —4y = 2y’; (h) 2ry'’—yt+y? sinx =0; (i) 2? y' + ary? = 0; 
Gj) wy =ay+y?; (k) yo ty=ey’; (Il) y! +y/2 = (1 — 2a) y*/2; 
(m) yy’ =a-ay"; (n)  (w@—1)y! — 2y = Vy(a? — 1); (0) 2?y' —ay=y’. 
2. Find the particular solution required. 
(a2) y= SE y-2y°, yA) =1 (b) ay’ +y?=2y; y(1) =4; 
(c) yf +2y/e=27y?, y(l)=h (d) ay’ t+ty=aty*®, y(1)=-1; 
3 
(e-) y+#ootr=2., y(n/2)=1; (ff) ov -Zy= RB, y= 
3. Using the Riccati substitution u’ = yu, reduce each second order linear differential equation to a Bernoulli equation. 
(a) u’+9u=0; (b) uw’ +2u'’+u=0; (c) wu” —2u' +5u=0; 


(d) wu’ —4u'+4u=0; (e) u’—5u’+6u=0; (f) wu’ —2u’+u=0. 
4. Reduce the given Riccati equations to linear differential equations of the second order. 
(a) y' = y = («4 1)?y? } CE SUEE (b) wy’ =ycotat+ y? sin 7 + (sina)~?; 
(c) y= 248 yta?y? +a (d) y+ (244) y=y?/x4+ 5a; 
(e) y+ 2y=y? e+ 4c; (f) y+ (2-32)ysa?y?+a-%. 
5. By the use of a Riccati equation, find the general solution of the following differential equations with variable coefficients 
of the second order. 
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10. 


11. 


12. 


13. 


14. 
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(a) uw” — 2a? u! — 4cu = 0. Hint: y! + y? — 2a? y — 4a = 0, try y = 22”. 


(b) wu” — (24+. 4) w —27e**u=0. Hint: y’ +y? — (2+ 4) y—a’e** =0, try y= are 


(c) u” + (tana — 2x cosa) u! + (a? cos? « — cosa)u = 0. Hint: y’ + y? + (tana — 2x cos 2) y + (x? cos? a — cos x) = 0, 
try y= zx cosz. 


(d) u” — 3u’ — e®*u=0. Hint: y’ + y? — 3y — e&” = 0, try y = e** 
(e) wu’ —4u'-(14+27In?a)u=0. Hint: y+ y’ eae In? x) = 0, try y=aInz. 


(f) incu” —u'—acln?cu=0. Hint: y’ +y? — 4 —-In’? x =0, try y=Inz. 
(g) xu” +(1—sinz)u’ —coscu=0. Hint: y! +y? + -8*y — SS =0, try y= BE. 


(h) u’ —-4u'+4u=0. Hint: y +y? -Sy+4 =0, try y=20” 


Given a particular solution, solve each of the following Riccati ee 
(a) yf +ay=y? +2(1—2*), yi(e) = 20; (b) ay! — 3y ty? = de? + der, yn (a) = 
(c) yay P+ ae — at, yi(x) = 2; (d) yo =05y° 40.5077, y(z) = ie 
(e-) yty+#-3=0, wle)=2/e; (f) yt 2y=y?+2—-27, w(x) =(e+1). 
Find a particular solution in the form y = kz of the following Riccati equation and then determine the general solution. 
(a) yf =14324+4; (b) y+2yy—2)=1; (©) yo = 40% +y/e- cy’. 
Find a particular solution of each Riccati equation satisfying the given condition. 


(a) y'+y?=4, y(0) =6. (b) y= 4-24 +2, y(l)=1. Ty y= 20. 
(c) y'+y?=1, y(0) =-1. (4) y = 4 -y/2? —2/x?, y(3) =7/2. Try y= 2. 
(e) ty ty —2=0, y(0)=4. (£) ny cay (1-22) y-+2—1 y(1) =3. Try y=1-2/z. 


(g) syy’—y?+a°=0, y(1)=2. (h) yey. +y—e~*, y(0) =0. Try y=e™*. 
(i) Jat hos y1=0 Gj) Y="-By-Bate y2)=1. 
t), y(0) =a. Prove that |y(t)| is bounded for t > 0 if |a| and 


maxzso |g(t)| are sufficiently small. 


Prove that a linear fractional function in an arbitrary constant C, 


_ Cvr(z) + p2(x) 
a aor INES 


is the general solution of the Riccati equation, provided that yi(x), yo(x), #1(x), and 2(x) are some smooth functions. 


Let y1, y2, and ys be three particular solutions of the Riccati equation (2.6.4). Prove that the general solution of 
Eq. (2.6.4) is 
Ca¥ Br 
yYy-Y1 Y3— y2 


Solve the equations with the dependent variable missing. 


(a) ay" ty! =a; (b) zy" +y! = f(a); (c) ay! + 2y! = da; 
(d) y+ y!=4 sinha; (ec) wy" =(L4a*)(y'—1); (fF) oy” + aly’) = 05 
(g) vy" +aly’)?+y'=0; (ha) 9" + (2y')® = 2y'; (i) dy" — (y')? +4 = 05 
G) yy” —-y" =1; (k) yr — 2y" = 0; (1) 2a* y" + (y')” — 2ay’ = 0. 
olve the equations with the independent variable missing. 
Solve th h the ind d ie 
(a) 2y" + 3y? = 0; (b)  y” +w?y =0; (c)  y" —w*y =0; 
(d) yy" ty + yy’)? = 0; Ce) y”—2y")? = 0; (FY (+ yyy" = (897 + y's 
(g) yy + 2(y ')? = 0; (bh) y”+2y(y')?=0; (i) 4y +y=0; : 
G) yy” =y? + (y')?; (k) yy" = (y')?s Q) yy” +(y')” = 2y? (y’)’. 
Reduce the order of the differential equations with homogeneous functions and solve them. 
(a) yy” =(y')? + 28y? Va; ; (b) yy" — v= (2? + 1 yy’; 
c ip uw (a 4 a : d a : 
( ) (yy (y) ) ee a # ” is 1\2 21/3 
ie), oy =A) eee (f) yy = (yy + 28y" x 
(ge) syy'—s yy = ws (bh) vy = ay)" 
(i) ayy" aly’) = (2 + Dw Gi) yy" = WY 
(k) wyy"=(y 


= a (1) yy” = yy! + (y')? 2/2; 


(y— xy 
(m) y(zy"+y)=(y'P(@-1); (mn) ogy” = (nF ley’). 
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2.7 Qualitative Analysis 


In the previous sections, we concentrated our attention on developing tools that allow us to solve certain types of 
differential equations (§§2.1—2.6) and discussed existence and uniqueness theorems (§1.6). However, most nonlinear 
differential equations cannot be explicitly integrated; instead, they define functions as solutions to these equations. 
Even when solutions to differential equations are found (usually in implicit form), it is not easy to extract valuable 
information about their behavior. 

This conclusion pushes us in another direction: the development of qualitative methods that provide a general 
idea of how solutions to given equations behave, how solutions depend on initial conditions, whether there are 
curves that attract or repel solutions, whether there are vertical, horizontal, or oblique asymptotes, determination 
of periodic solutions, and any bounds. Realizing that qualitative methods fall into a special course on differential 
equations, we nevertheless motivate the reader to look deeper into this subject by exploring some tools and examples. 
More information can be found in [2, 24]. We start with a useful definition. 


Definition 2.10: The nullclines of the differential equation y’ = f(x,y) are curves of zero inclination that are 


defined as solutions of the equation f(x,y) = 0. 


Nullclines are usually not solutions of the differential equation y’ = f(x,y) unless they are constants (see Def- 
inition 2.1 on page 39). The curves of zero inclination divide the xy-plane into regions where solutions either rise 
or fall. The solution curves may cross nullclines only when their slope is zero. This means that critical points of 
solution curves are points of their intersection with nullclines. We clarify the properties of nullclines in the following 
four examples. 
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Figure 2.32: Example 2.7.2: Direction fields, nullclines, and streamlines for (a) y’ = y? — 2? and (b) y’ = y? —a, 
plotted with Maple. Points where solution curves intersect with nullclines are their critical points (local maxima or 
minima). 


Example 2.7.1: Consider the linear differential equation ay + ty = b with some nonzero constants a and b. Using 
the integrating factor method, we find its general solution (see Eq. (2.5.9) on page 88) to be 


t 
y = CeB/2a) 4 © 6 /(20) y eI 020) as, 
a 0 


where C is an arbitrary constant. The first term, C’ ent / 28) decays exponentially for any value of C 4 0 ifa> 0. 
The second term can be viewed as the ratio of two functions: 


t 

t 

def ae FHag | o8?/(28) dg = f(t) 
0 


oes g(t)’ 
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where f(t) = 2 fe e* /(2) ds and g(t) =e’ /@%), To find the limit of yp(t) when t + +00, we apply l’Hépital’s rule: 


lim yp(t) = 


se tay ee ee Ee he 2 P1020) [Eo /20 mee 
t—>-+00 t—-Foo g(t) t—>-+00 g'(t) t—>+00 2a t 


—e 
a 
Therefore, every solution of the given linear differential equation approaches the nullcline y = b/t as t > +00. 


Example 2.7.2: Let us consider two Riccati equations that, as we know from §2.6.2, have no singular solutions. 
The equation y’ = y? — x? has two nullclines y = + that are not solutions of the given equation. When |y| < |z|, the 
slope is negative and the solutions decline in this region. In contrast, the solutions increase in the region |y| > |2]. 
As can be seen from Figure 2.32(a), both nullclines y = +a are semistable because half of them attract solutions 
whereas the other half repel solutions. 

Consider another differential equation y’ = y? — x, which also cannot be integrated using elementary functions 
(neither explicitly nor implicitly). It has two nullclines y = ,/z and y = —,/Z that are not solutions of the given 
differential equation. Slope fields and some plotted streamlines presented in Figure 2.32(b) strongly indicate that 
there are two types of solutions: those that approach oo for large x, and those that follow the nullcline y = —./z 
and ultimately go to —oo. Moreover, these two types of solutions are separated by an exceptional solution, which 


can be estimated from the graphs. The same observation is valid for the equation y’ = y? — 2”. 


Example 2.7.3: Consider a Bernoulli equation t?4 = y(y? — 2t). Its general solution is y = ¢(t) = Vt/V/24+ Ct. 
The equation has the equilibrium solution y = 0 and two nullclines y = +V2t. If C 4 0, solutions approach the 
critical point y = 0 as t+ +oo. If C = 0, then the solution ¢(t) = \/t/2 follows the nullcline y = V2t as t > +00. 
The following MATLAB codes allow us to plot the solutions. 

s=dsolve(’t*2*Dy = y~3- 2*t*y’) | /* invoke Maple’s solver */ 


for cval=0:5 hold on /* assign 6 integer values to cval */ 
ezplot(subs(s(2), ’C11’, cval)); /* C11 is a constant from dsolve */ 
hold off end 


axis([0 5,0 2 ]); 
title(’solution to t*2*Dy = y~3- 2*t*y’); 
xlabel(’t’); ylabel(’y’); 

While Examples 2.7.1 and 2.7.2 may give an impression that solutions either approach or repel the nullclines, 
generally speaking, they do not. There could be many differential equations with distinct solutions that share the 
same nullclines. As an example, we recommend plotting direction fields with some solutions for the differential 
equations y! = (y? — 27)/(a4 +1) or y’ = (y? — x)/(x? + 1), which have the same nullclines as the differential 
equations in Example 2.7.2. Nevertheless, nullclines provide useful information about the behavior of solutions: 
they identify domains where solutions go down or up, or where solutions attain their local maximum or minimum. 
For instance, solutions for the differential equation y’ = y? — x decrease inside the parabola y? — x = 0 because their 
slopes are negative. Outside this parabola all solutions increase. Similar observations can be made for the equation 
y’ = y’? — 27: solutions decrease inside the region |y| < |z}. 

The first partial derivative f, = Of /Oy of the rate function f(x,y) gives more information about disintegration 
of solutions. Solutions to y’ = f(,y) pull apart when f, is large and positive; on the other hand, solutions tend to 
stay together when f, is close to zero. As an example, consider the equation y’ = f(x,y) with the slope function 
f =y* —2”. Its partial derivative f, = 2y is positive when y > 0, so solutions fly apart in the upper half plane, and 
the nullclines y = +a cannot be stable in this region. 


Example 2.7.4: Let us consider the differential equation y’ = f(x,y), where f(x,y) = (y—1)(y— sina). The 
horizontal line y = 1 in the xy-plane is its equilibrium solution, but the nullcline y = sinz is not a solution. The 
former is unstable for y > 1; however, when y < 1, it is neither attractive nor repulsive. Since f(x,y) contains the 
periodic function sin x, the slope field is also periodic in x. 

None of the solutions can cross the equilibrium point y = 1 because the given equation is a Riccati equation that 
has no singular solution. The solutions do not approach the nullcline y = sinz, but “follow” it in the sense that 
they have the same period. 


Drawing a slope field for a particular differential equation may provide some additional information about its 
solutions. Sometimes certain types of symmetry are evident in the direction field. If the function f(z, y) is odd with 
respect to the dependent variable y, f(«,—y) = —f(z,y), then solutions of the differential equation y’ = f(z, y) 
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1/((25 19 + 2)(5 t))/2 


Figure 2.34: Example 2.7.4. Black lines are so- 


Figure 2.33: Example 2.7.3: solutions to the equa- lutions, and blue ones are nullclines (plotted with 
tion t?y’ = y? — 2ty, plotted in MATLAB. Maple). 

are symmetric with respect to abscissa y = 0. So if y(x) is a solution, then z(a) = —y(z) is also a solution to the 
equation y’ = f(x,y). Similarly, if f(z,y) = —f(—2,y), then solutions are symmetric with respect to the vertical 


y-axis (see Fig. 2.3, page 43). 

Also, we have already discovered that solutions of autonomous differential equations are symmetric under trans- 
lations in the x direction. In general, a differential equation may possess other symmetries. Unfortunately, we cannot 
afford this topic for presentation, but we refer the reader to [3]. 

Many solutions to differential equations have inflection points, where the second derivative is zero: y” = 0. To 
find y’’, you need to differentiate f(x,y) with respect to « and substitute f(x,y) for y’. This expression for y’’ gives 
information on the concavity of solutions. For example, differentiating both sides of the equation y’ = y? — 1 with 
respect to x, we obtain y” = 2yy’ = 2y(y? — 1). Therefore, solutions to the given differential equation have three 
inflection points (which are actually horizontal lines): y = 0 and y = +1. 

As we will see in Chapter 3, numerical methods usually provide useful information about solutions to the differ- 
ential equations in the “short” range, not too far from the starting point. This leads us to the next question: “how 
do solutions behave in the long range (as x — +00)?” More precisely, “what are the asymptotes (if any) of the 
solutions?” Sometimes it is not always the right question because a solution may fly up in finite time, that is, has a 
vertical asymptote. We can answer these questions in a positive way at least for solvable differential equations (see 
§2.1 — 2.6). For instance, the following statement assures the existence of vertical asymptotes for a certain class of 
autonomous equations. 


Corollary 2.1: Nonzero solutions to the autonomous differential equation y’ = Ay® (A is a constant) have 
vertical asymptotes for a > 1. 


However, what are we supposed to do when our differential equation is not autonomous, or if it is autonomous, 
but is not of the form y’ = A y*? The natural answer is to use estimates, which are based on the following Chaplygin 


33 inequality. 


Theorem 2.5: [Chaplygin] Suppose the functions u(a) and u(x) satisfy the differential inequalities 

u(x) — f(x,u) > 0, u (x) — f(z,v) <0 
on some closed interval [a,b]. If u(ao) = v(%o) = yo, where x € [a,b], then the solution y(x) of the differential 
equation y’ = f(z, y) that passes through the point (xo, yo) is bounded by u and v; that is, v(x) < y(x) < u(a) 


on the interval [a, 6]. The functions u and v are called the upper fence and the lower fence, respectively, for 
the solution y(x). 


33The Russian mathematician Sergey Alexeyevich Chaplygin (1869-1942) proved this inequality in 1919. 
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Let y = ¢(a) be a solution for the differential equation y’ = f(x,y) and v(x) be its lower fence on the interval 
|a, b|; then the lower fence pushes solutions up. Similarly, the upper fence pushes solutions down. 


Example 2.7.5: Consider the Riccati equation y’ = y? + x7. On the semi-open interval [0,1), the slope function 
f(x,y) =2? + y? can be estimated as 


Pc ty<y+1 (0<a<1) 


because y = 0 cannot be a solution of the given equation (check it). If a(x) is a solution to the equation a’ (x) = 2”, 


so a(x) = x3/34+ C, then a(z) is a lower fence for the given differential equation on the interval (0,1). Hence, every 
solution of y! = x? + y? grows faster than «3/3 + C on (0,1). To be more specific, consider a particular solution of 
the above equation that goes through the origin, y(0) = 0. To find its formula, we use Maple: 


deqi :=dsolve ({diff (y(x) ,x)=x*x+ty (x) *y (x) ,y(0)=0},,y(x)); 


which gives the solution 


Y_3/4 (=) — J_3y4 =) 
d(x) = & —— nn 
Sijs () — Viva (F) 


where J,(z) and Y,(z) are Bessel functions (see §4.9). Its lower fence is, for instance, the function a(x) = 23/3—0.1, 
which is the solution to the initial value problem a’ = x”, a(0) = —0.1. To find an upper fence on the interval 
(0,1), we solve the initial value problem 8’ = 6? 41, 6(0) = 0.1, which gives B(x) = tan(x + c), where c = 
tan(0.1) * 0.1003346721. Since at the origin we have estimates a(0) = —0.1 < 0 = ¢(0) < 0.1 = §(0), we expect 
the inequalities a(x) < ¢(a) < B(x) to hold for all x € [0,1]. Therefore, the solution y = ¢(a) to the initial value 
problem (y’ = y? + x”, y(0) = 0) is a continuous function on the interval [0, 1]. 

To find a lower estimate for z > 1, we observe that y? + 1 < y? +2? (1 <x < oo). Since we need the value of 
the solution ¢(x) at z = 1, we use Maple again: 
phi:=dsolve ({diff (y (x) ,x)=x*xty (x) *y(x),y(0)=0},y(x) ,numeric); phi(1) 
This provides the required information: (1) ~ 0.3502319798054. Actually, this value was obtained by solving 
the IVP numerically—we do not need to know the exact formula (2.7.1). To find the lower fence in the interval 
x > 1, we solve the initial value problem: a/ = a? + 1, a(1) = 0.35. This gives us a(x) = tan(x + C), where 
C = arctan(0.35) — 1 & —0.6633. At the point « = 1, we have the inequality 0.35 = a(1) < (1) = 0.35023. Since 
the tangent function has vertical asymptotes, the lower fence a(x) = tan(x + C) blows up at + C = 3, when 
x = 5—C #& 2.2341508. Therefore, we expect that the solution ¢(x) has a vertical asymptote somewhere before 


2 
x < 2.234. Indeed, the solution y = ¢(x) has a vertical asymptote around x ~ 2.003147359—the first positive root 


(2.7.1) 


of the transcendent equation J;/4 (=) = Visa (+). To plot fences for the given problem (see Fig. 2.35), we type 

in Maple 

cm:=arctan(0.35)-1.0; 

ym:=plot(tan(x+cm) ,x=1..2.1,color=blue) ; 

dd:=DEplot (deq1 ,y(x) ,x=0..2,y=0..2, [y(0)=0] ,dirgrid=[16,16], 
color=blue, linecolor=black) ; 

display (ym,dd) 


Let us consider the differential equation y’ = f(x,y) under two different initial conditions y(a%p) = yoi and 
y(%o) = Yoo. If yor is close to yo2, do the corresponding solutions stay close to each other? This question is of 
practical importance because, in mathematical modeling, the initial condition is usually determined experimentally 
and is subject to experimental error. In other words, we are interested in how close the solution to the initial 
value problem with incorrect initial data will be to the exact solution. A similar problem is important in numerical 
calculations and is discussed in Example 3.4.2, page 171. Actually, the fundamental inequality (1.6.12) provides us 
with useful information. 


Corollary 2.2: Suppose the function f(a, y) satisfies the conditions of Picard Theorem 1.3 and that y;(a) and 
y2(x) are solutions to the equation y’ = f(x,y) subject to the initial conditions y(xo) = yo: and y(xo) = Yoo, 
respectively. Then 

lyr (x) — yo(a)| < |yor — Yoo] e*!*- 71, (2.7.2) 


where L is the Lipschitz constant for the slope function. <J 
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Figure 2.35: Example 2.7.5: direction field along with (a) lower and upper fences for the solution on the interval 
(0,1) and (b) lower fence on the interval (1,2), plotted with Maple. 


A similar result is valid when the initial values vp and yo in the initial condition y(ao) = yo are continuous 
functions of a parameter. 

As follows from the inequality (2.7.2), the Lipschitz constant or f, = Of /Oy evaluated at a particular point (if 
it exists) measures the stability of solutions. If f, is small, then error in initial conditions is less critical; however, 
if |f,| is large, then solutions fly apart exponentially. However, inequality (2.7.2) only provides an upper bound to 
the difference between solutions. This worst case scenario may not always occur. 

As we learned from the numerous examples, many solutions to the given differential equations belong to different 
classes that behave in similar ways. Some of them may tend to infinity, some asymptotically approach a certain 
curve, and some may approach a particular value. Classes of solutions which behave similarly are often separated 
by solutions with exceptional behavior. For instance, the differential equation y’ = y(y? — 9) considered in Exam- 
ple 2.7.15, page 120, has two unstable critical points: y = +3. Such equilibrium solutions are exceptional solutions 
from which all solutions tend to go away. If the initial conditions are close to these two unstable critical points, the 
perturbed solutions may fly away in different directions exponentially, as inequality (2.7.2) predicts. 


Example 2.7.6: Consider the initial value problem for the following linear differential equation 
y =yt3-t, y(0) =a. 
The general solution to the above equation is known to be (see §2.5) 
y(t) =Ce'+t—2, 


where C is an arbitrary constant. This is an exponentially growing function approaching +oo if C > 0 and —oo if 
C <0. However, when C = 0, the solution grows only linearly as t + oo. Therefore, ¢(t) = t — 2 is an exceptional 
solution that separates solutions that satisfy the initial condition y(0) = a > —2 from solutions that satisfy the 
initial condition y(0) = a < —2. Therefore, this exceptional solution ¢(t) is called a separatrix. 


Example 2.7.7: Consider the differential equation 
y =sin(rzy). 


Since the slope function is an odd function, it is sufficient to consider solutions in the positive quadrant (x,y > 0). 
The nullclines zy = k (k = 1,2,...), shown in Fig. 2.36, divide the first quadrant into regions where solutions to the 
equation y’ = sin(7ay) are eventually either increasing or decreasing once they reach the domain y < x/7 (marked 
with a dashed line). Moreover, a simple calculus argument shows that, once a solution enters a decreasing region, it 
cannot get out provided the solution enters to the right of the line y = 2/7. As seen from Fig. 2.37, solutions can 
be very sensitive to the initial conditions. The given differential equation has a stable equilibrium solution y = 0. 
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Figure 2.37: Two solutions to the equation y’ = 
Figure 2.36: Example 2.7.7. Nullclines (in black) sin(7xy) subject to the initial conditions y(0) = 
and solutions, plotted with Mathematica. 1.652 and y(0) = 1.653. 


2.7.1 Bifurcation Points 


A differential equation used to model real-world problems usually contains a parameter, a numerical value that 
affects the behavior of the solution. As a rule, the corresponding solutions change smoothly with small changes of 
the parameter. However, there may exist some exceptional values of the parameter when the solution completely 
changes its behavior. 


Example 2.7.8: Consider a linear differential equation 


ax 


y =a-—2ayt+e™, 


where a is a number (parameter). The equation has one nullcline: y = x (a+e~%). All solutions approach this 


nullcline when the parameter a is positive, and repel when it is negative. From Fig. 2.38, you may get the impression 
that solutions merge to y = 0.5 = lim ¢ (a + e~®”) when x + +00 if a > 0 and when x + —oo if a < 0; however, 


we know from §2.5 that a linear differential equation has no singular solutions. When a = 0, the slope becomes 1, 
and the equation y’ = 1 has a totally different general solution y = 2+C. 
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Figure 2.38: Example 2.7.8: direction fields, nullclines, and streamlines for (a) a > 0 and (b) a < 0, plotted with 
Maple. 
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Definition 2.11: An autonomous differential equation y’ = f(y,a@), depending on a parameter a, would have 
a solution that also depends on the parameter. In particular, if such an equation has a critical point y*, then 


it also depends on the parameter y* = y*(a). The value of the parameter a = a* is called the bifurcation 
point of the differential equation if in its neighborhood the solutions exhibit a sudden qualitative change in 
their behavior. 


Bifurcation points are of great interest in many applications because the nature of solutions is drastically altered 
around them. After a bifurcation occurs, critical points may either come together or separate, causing equilibrium 
solutions to be lost or gained. For example, when an aircraft exceeds the speed of sound, it undergoes a different 
behavior: supersonic motion is modeled by a hyperbolic equation, while subsonic motion requires an elliptic equation. 
More examples can be found in [28]. 

For visualization, it is common to draw a bifurcation diagram that shows the possible equilibria as a function of 
a bifurcation parameter. For an autonomous differential equation y’ = f(y,a) with one parameter a, there are four 
common ways in which bifurcation occurs. 


e A saddle-node bifurcation is a collision and disappearance of two equilibria. This happens if the following 


conditions hold: - 
O° f(y, @) 40 Of (y, a) 
Oy? , da 
2 


where y* is the equilibrium solution and a* is the bifurcation value. A typical equation is y’ = a — y*. 


#0, (2.7.3) 


y=y* ,a=a* y=y* ,a=a* 


e A transcritical bifurcation occurs when two equilibria change stability. Analytically, this means that 


O f(y, a) 
a #0. (2.7.4) 


A typical equation is y’ = ay — y?. 


e Pitchfork bifurcation: supercritical. A pitchfork bifurcation is observed when critical points tend to 
appear and disappear in symmetrical pairs. In the ay-plane, it looks like a pitchfork, with three tines and a 
single handle. A supercritical bifurcation occurs when two additional equilibria are stable. A typical equation 
is y’ =ay—y?. 

e Pitchfork bifurcation: subcritical occurs when two additional equilibria are unstable. A typical equation 
is y’ = ay +y°. In the case of pitchfork bifurcation (either supercritical or subcritical), the slope function 
f(y, @) satisfies the following conditions at the critical point y = y*, a = a*: 


0° f(y, a) a f(y, a) 


Oy? ro OyOa 


y=y" ,a=a* 


#0. (2.7.5) 


y=y",a=a* 


Example 2.7.9: (Saddle-node bifurcation) Consider the differential equation 


y=a-y’, 


containing a parameter a. Equating a — y” to zero, we find the equilibrium solutions to be y = +,/a. When a < 0, 
there is no equilibrium solution. When a = 0, the only equilibrium solution is y = 0. If a > 0, we have two 
equilibrium solutions: y = +./a. One of them, y = ,/a, is stable, while another, y = —,/a, is unstable. Hence, 
a = 0 is a bifurcation point, classified as a saddle-node because of a sudden appearance of stable and unstable 
equilibrium points. 

The function f(y,a@) = a — y? satisfies the conditions (2.7.3) at the point y = 0, a = 0: fyy(0,0) = —2, 
fa(0,0) = 1. 


Example 2.7.10: (Transcritical bifurcation) The logistic equation 

y=ry-y =y(r—y), 
has two equilibrium solutions: y = 0 and y =r. When r = 0, these two critical points merge; hence, we claim that 
r = 0 is a bifurcation point. The function f(y,r) = ry — y” satisfies the conditions (2.7.4) at the point y = 0, r = 0: 


fyy(0,0) = —2, fyr(0,0) = 1. The equilibrium solution y = 0 is stable when r < 0, but becomes unstable when the 
parameter r > 0. 
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Figure 2.39: Supercritical pitchfork bifurcation dia- Figure 2.40: Subcritical pitchfork bifurcation dia- 
gram, plotted with MATLAB. gram, plotted with MATLAB. 


Example 2.7.11: (Pitchfork bifurcation, supercritical) Consider the one-parameter first order differential 
equation 

yi =ay—y’. 
Obviously, a = 0 is a bifurcation point because the function f(y) = ay — y® can be factored: f(y) = y(a— y?). 
When a < 0, there is only one critical point y = 0. However, when a > 0, there are two additional equilibrium 
solutions y = +,/a. The slope function f(y) = ay — y? satisfies the conditions (2.7.5) at the point y = 0, a = 0: 


8 f(y.a) a. & f(y.a) = 
dy" y=0,a=0 : OyOex y=0,a=0 
The equilibrium y = 0 is stable for every value of the parameter a < 0. However, this critical point becomes unstable 
for a > 0. In addition, two branches of unstable equilibria (y = +,/a) emerge. 


Example 2.7.12: (Neuron activity) Let y(t) denote the level of activity of a single neuron (nerve cell) at time 
t, normalized to be between 0 (lowest activity) and 1 (highest activity). Since a neuron receives external input from 
surrounding cells in the brain and feedback from its own output, we model its activity with a differential equation 
of the form 


y=Syt+E—-O@)—-y  (y=dy/de), 
where S is a response function, F is the level of input activity from surrounding cells, and 6 is the cell’s threshold. 
The response function S(z) is a monotonically increasing function from 0 to 1 as z + +00. By choosing it as 
(1+ e742)? with an appropriate positive constant a, we obtain a simple model for neutron activity to be 


dy dl 


dt 1+ e~uytE£—-86) _ y(t). (2.7.6) 


The equilibrium solutions of the differential equation are determined by solving the transcendent equation 


1 


with respect to y; however, its solution cannot be expressed through elementary functions. Because the response 
function is always between 0 and 1, the critical points must be within the interval (0,1) as well. 

Assuming that the level of activity is a positive constant, we draw graphs y = x and y = (1 + e~a(e+E-6)) to 
determine their points of intersection. As seen from Fig. 2.41, for small threshold values there will be one equilibrium 
solution near « = 1, and for large @ there will be one critical point near x = 0. However, in a middle range there 
could be three equilibrium solutions. 

Using the numerical values F = 0.1, a = 8, and 6 = 0.55, we find three critical points: yj ~ 0.03484, y5 ~ 0.4, 
and y3 ~% 0.9865. (Note that these equilibrium solutions will be different for other numerical values of parameters E, 


= 
a, and 0.) The slope function f(y) = (1 + eae t) — y is positive if y < yj, negative if yf < y < y3, positive 


when y is between y5 and y3, and negative if y > y;. Therefore, two critical points yj and y3 are stable, while y3 is 
the unstable equilibrium solution. 
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Figure 2.41: Example 2.7.12: (a) one bifurcation point when 6 = 0.8 and (b) three bifurcation points when 6 = 0.55, 
plotted with Maple. 


To find the bifurcation points, we must determine the values of 6 for which the graph of y = (1 + e~@@+#~9)) = 
is tangent to the line y = x. This point of tangency is a solution of the following system of equations: 


- E-0) _1 
l+e a(2+ Jat, 


d 1 _ preeree 4 (2.7.7) 
dz \ 1+e-e@F8-8% } — a (1te-e(@+2-9))7 te 
From the first equation, we express the exponential function as e~“(*+#—®) = 1-1 = £*. Hence, the second 


equation is reduced to the quadratic equation with respect to x: 


1-2 1 1 4 

2 

————— = ] =-+- —_— 4). 
ax 7 = => 2= 5 es (a > 4) 


a(x1,2+E-6) — 1 — £1,2 — 


Now the bifurcation points are found from the transcendental equation e~ = —,—> which 
V1,2 a 
leads to 7 
1. 1- 0.523 for 2; = 44 anda = 48, 
=e P+ in _ "w+ : i - 
a %1,2 0.477 for x2 = a and a= =: 


2.7.2 Validity Intervals of Autonomous Equations 


The existence and uniqueness theorems discussed in §1.6 do not determine the largest interval, called the validity 
interval, on which a solution to the given initial value problem can be defined. They also do not describe the 
behavior of solutions at the endpoints of the maximum interval of existence. The largest intervals of existence for 
the linear equation y’ + a(x)y = f(x) are usually bounded by points (if any) where the functions a(x) and f(z) are 
undefined or unbounded. Such points, called singular points or singularities, do not depend on the initial conditions. 
In contrast, solutions to nonlinear differential equations, in addition to having fixed singular points, may also develop 
movable singularities whose locations depend on the initial conditions. 

In this subsection, we address these issues to autonomous (for simplicity) equations. Actually, Theorems 2.1 and 
2.2 on page 48 describe the behavior of solutions at critical points of the differential equation. Before extending 
these theorems for the general case, we present some examples. 


Example 2.7.13: Using the well-known trigonometric identity (sin x)’ = cosx = \/1— sin x, we see that the sine 
function should be a solution of the differential equation 


y =VJ1-y?. 
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After separation of variables and integration, we get its general solution to be arcsin(y) = 2 + C, where C is a 
constant of integration (see Fig. 2.42). To have an explicit form, we apply the sine function to both sides: 


y(x) =sin(e+C), if --C<2<e5-6 


Every member of the general solution is defined only on the finite interval |-5 -—C, ies cl, depending on C’. Hence, 


2 


these re do not intersect. The envelope of singular solutions consists of two equilibrium solutions y = +1. 
Since for f(y) = \/1— y?, the integral (2.1.11) converges, so we expect the uniqueness of the initial value problem 
to fail. a every member of the one-parameter family of the general solution touches the equilibrium singular 
solutions y = +1. Moreover, we can construct infinitely many singular solutions by extending the general solution: 
1, iz? > —, 
yp(z)=4sin(e+C), if -$-C<a<§$-C, 
—1, Los — ec. 


Example 2.7.14: Consider the initial value problem 


y=y'-9, y(O)=1. 


The differential equation has two equilibrium solutions: y = +3. One of them, y = 3, is unstable, while another one, 
y = —3, is stable. Using Mathematica 


DSolve[{y’ [x] == yl[x]*y[x] - 9, y[0] == 1}, y[x], x] 


we find its solution to be 
3(e®* — 2) 
eS +2 © 


y(z) = 


The maximum interval of existence is (—00, co); y(x) approaches 3 as x + +00, and y(a) approaches —3 as x goes 
to —oo. If we change the initial condition to be y(0) = 5, the solution becomes 


3(4 + e®”) 
4 — 6 


y(z) = 


The maximum interval is (—oo, In 2); y(x) blows up at 41n2, and y(x) approaches —3 as x goes to —co. We ask 
Maple to find the solution subject to another initial condition, y(0) = —5: 


dsolve ({diff (y(x) ,x)= y(x)*y(x) - 9, y(0)=-5}, y(x)); 


3(e~ © + 4) 
w(t) = —Se 
The maximum interval of existence becomes (—} In 2,00); y(x) approaches —oo as « > —3In2 © —0.2310490602 


(from the left), and y+ —3 as x > +00. 
The initial value problem for the given equation with the initial condition y(0) = 3 has the unique constant 
solution y = 3 in accordance with Theorem 2.2, page 48. 


Example 2.7.15: Consider the initial value problem 


y=yyr—-9), yO) =1. 


This equation has three equilibrium solutions, one of which, y = 0, is stable, while the two others, y = +38, are 
3 
unstable. The solution y(a) = ——————— . exists for all x € R; yWH 0 as x > +c and ys 3 as & — —oo. To plot 
y(a) = ra PLS y y P 


streamlines (see Fig. 2.43), we use Mathematica: 
StreamPlot[{1, y (y°2 - 9)}, {x, -2, 2}, fy, -5, 5}, StreamStyle -> Blue] 
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Figure 2.42: Example 2.7.13: direction field, plotted Figure 2.43: Streamlines of the differential equation 
with Maple. y’ = y(y? — 9), plotted with Mathematica. 
cy it : 15 : . 
For another initial condition, y(0) = 5, we get the solution, y(z) = —————, which has the maximum 


interval of existence (—oo, #1In }); the function y(«x) approaches +00 as + In 2 = $n? & 0.02479372792, 


while y(x) H+ 3 as @ > —o0. 
15 


V25 — 16 el8* 
of existence (—oo, 4 In $); the function y(x) approaches —oo as a + 7 In 3, while y(x) + —3 as x + —o0. 

The slope function f(y) = y(y? — 9) has three zeroes, y = 0, y = +8, that are critical points of the given 
differential equation. Since 1/f(y) is not integrable in the neighborhood of any of these points, the equilibrium 
solutions y* = 0, 3, or —3 are unique solutions of the given equation subject to the corresponding initial conditions. 
Therefore, these critical points are not singular solutions. 


Again, changing the initial condition to be y(0) = —5, we get y(a) = — , with the maximum interval 


Example 2.7.16: Solve the initial value problem 


dy _ 62? + 4¢41 
de QWw—-1 °’ 


and determine the longest interval (the validity interval) in which the solution exists. 
Solution. Excluding the horizontal line 2y = 1, where the slope is unbounded, we rewrite the given differential 
equation in the differential form as 
(6x? + 42 + 1) dx = (2y — 1) dy. 


The line 2y = 1 separates the range of solutions into two parts. Since the initial condition y(0) = 2 belongs to the 

half plane y > 0.5, we integrate the right-hand side with respect to y in this range. Next, integrating the left-hand 

side with respect to x, we get y?—y = 2x? +2x?+a+C, where C is an arbitrary constant. To determine the required 

solution that satisfies the prescribed initial condition, we substitute « = 0 and y = 2 into the above equation to 

obtain C = 2. Hence the solution of the given initial value problem is given implicitly by y? — y = 2x”? + 227+ 242. 
Since this equation is quadratic in y, we obtain the explicit solution: 


‘lity. 
y= sta vot 4a + 8a? + 82°. 


This formula contains two solutions of the given differential equation, only one of which, however, belongs to the 
range y > 0.5 and satisfies the initial condition: 


1 arab 
y= $2) = 5 +5 V9t 4a + 8a? + 82°. 
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Figure 2.44: Example 2.7.16. (a) The streamlines for the equation aa a Grant and the solution y = $(2), plotted 
pe Mathematica; (b) the direction field and solution for the reciprocal equation ce = a. plotted with 
aple. 


The form of solution indicates that it consists of two branches. Therefore, it is more natural to consider the reciprocal 
equation 

da 2y—1 

dy 622+ 4r+1’ 
considering y as the independent variable. As can be seen from Fig. 2.44, the solution y = ¢(a) is only part of 
the solution z = x(y). Finally, to determine the interval in which the solution y = 4 + 4 V9 + 4x + 82? + 823 is 
valid, we must find the interval in which the quantity under the radical is positive. Solving numerically the equation 
9+ 4a + 8x7 4+ 82° = 0, we see that it has only one real root, zg & —1.28911, where the slope is infinite. The desired 
interval is x > ap. On the other hand, the reciprocal equation has the solution 2(y), which exists for any y. 


Is it possible to determine the maximum interval of existence of the solution to y’ = f(y), and the behavior of the 
solution near the ends of that interval without actually solving the problem? To answer this question, we observe 
that the zeroes of the slope function f, known as the critical points of the differential equation, divide the real line 
into subintervals so that f has a constant sign on each interval. This allows us to formulate the following results. 


Theorem 2.6: Let f(y) in the autonomous equation y’ = f(y) be a continuous function on some interval (a, b), 
a <b. Suppose that f(y) has a finite number of simple zeroes y1, y2, .-., Yn on this interval, so that f(y,) = 0 
and f’(y,) #0, k =1,2,...,n. These points, called equilibrium solutions, divide the interval into subintervals 
where f has a constant sign. 

If on subinterval (yz, yx+1) the function f(y) is positive, then any solution of the equation y’ = f(y) subject 
to the initial condition y(%o) = yo © (Ye, yx+1) must remain in (yz, yx41) and be an increasing function. This 
makes the equilibrium point y = yxz+41 stable, and the critical point y = yx, unstable (if the function f(y) is 
negative outside this subinterval). 

If on subinterval (yx, yx+1) the function f(y) is negative, then any solution of the equation y’ = f(y) with 
an initial value y(xo) = yo within this subinterval is also decreasing. The point y = yx,41 is an unstable critical 
point, while y = yz is stable (if the function f(y) is positive outside this subinterval). 


Theorem 2.6 can be extended for multiple nulls of odd degrees; it allows us to consider an autonomous equation 
y’ = f(y) on the interval bounded by critical points. 


Theorem 2.7: Suppose that a continuously differentiable function f(y) has no zeroes on an open interval (a, b). 
Then the maximum interval on which the solution to the initial value problem 


y =fly), ylto) = yo € (a,8), 
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can be defined is either 


“du > du 
x +f —, £ +/ —_— if f(y) is positive, (2.7.8) 
( : Y’O (u) 4 V’O ) 
or 
b a 
du du 
x +f —, 2 +f — if f(y) is negative. (2.7.9) 
( : Yo (u) : Yo “) 
It is similar to the proof of Theorem 2.1, page 48. See details in [10]. | 


At the end of this section, we reconsider previous examples along with two new ones. 


Example 2.7.17: (Previous examples revisited) In Example 2.7.13, the domain of the function f(y) = /1-— y? 
is the closed interval [—1, 1]. The corresponding initial value problem has two equilibrium singular solutions y = +1. 
To find the largest interval on which a solution to the initial value problem 


y= Valen y(0) = 0, 


can be defined, we apply Theorem 2.7. Since 


“1 dy T d | t dy T 
SS = -F5 an ———. = -, 
0 VWl-y? 2 o fl1-y 2 
we conclude that the given initial value problem has the unique solution, y = sin x, on the interval (-4, z). 


In Example 2.7.14, the function f(y) = y? — 9 = (y — 3)(y +3) has two zeroes: y = +3. These equilibrium 
solutions are not singular because solutions approach/leave them, but do not touch them. In the interval (—3, 3), 
the function f(y) is positive. According to Theorem 2.1, page 48, any initial value problem with the initial condition 
within this interval has the validity interval (—o0, 00). The initial condition y(0) = 5 € (3, 00) is outside the interval 
(—3,3) where it is negative. Since 


3 oe) 
: Wand i Wy _ 1 in2 ~ 0.2310490602, 
5 ye 9 5 Y= 9 3 
the maximum interval of existence for such a problem is (—00, +n 2). Similarly, we can determine the validity 
interval for the initial value problem with the initial condition y(0) = —5 to be (-3 In 2,00). In general, the validity 
interval for the initial value problem 
y=y'—9, — y(0) = yo, 


depends on the value of yo: 


1. Jyo- 
—=| f 3 
( 6 Io) 1 Yo < 9; 
(—o0, 00) , if —3< yo <3; 
1 yot+3 . 
—oo, = 1 f 3. 
co, gin |) i Yyo> 


In Example 2.7.15, the function f(y) = y(y? — 9) = y(y —3)(y +3) has three equilibrium (nonsingular) solutions 
y = +3 and y = 0. The initial condition y(0) = 1 belongs to the interval (0,3) where f(y) is negative. Using 
Theorem 2.7, we conclude that the maximum interval of existence is (—0o, oo) because 


3 dy 0 dy 
Te mee y| = —c and i = CO 
1 y(y? —9) LP = 2) 


If the initial condition y(0) = yo € (3,00), f(yo) > 0, the maximum interval of existence is determined by Eq. (2.7.8): 


Catt att beh BIT) (oul -§) 
—_—, ——~ = —ln = | —oo,-— In : 
0 ONS) leg Ge =O) 18 eis 18 


9 
ye 


9 
V5 


1— 1— 
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Example 2.7.18: Find the maximum interval of existence of the solution to the initial value problem 
y =y+3y?t4y+2, — y(0) =2. 


Since y® + 3y? + 4y +2 = (y+ 1)(y? + 2y 4+ 2), the equation has only the one equilibrium solution y = —1. The 
function f(y) = y? + 3y? + 4y 4 2 is positive for y > —1; therefore, the maximum interval of existence will be 


se a 
( 7 sa eS = (-n0.5 In >) = (—00, 0.05268). 


For the initial condition y(0) = —2, the validity interval will be 


es =) _ (-20.5 in2) = (—00, 0.34657). 


Example 2.7.19: Consider the initial value problem 
yf =y?— 2y?—1y+12, — y(-1) = wo. 


The slope function f(y) = y? — 2y? — lly +12 = (y+3)(y—1)(y—4) has three zeroes: y= -—3, y=1, andy=4. 
These are equilibrium solutions, two of which (y = —3 and y = 4) are unstable and one (y = 1) is stable. If the 
initial value yo < —3, the largest interval on which the solution exists is 


—3 —00 — 4ji/21 1/28 
(-1+ / a+ | Wy) = (-e0 =m ee). 
vo F(u) vo F(u) lyo — 1| 


If the initial value yo is within the interval (—3,1), the validity interval becomes 


If the initial value yo € (1,4), then the maximum interval of existence is 


Finally, when the initial value exceeds 4, the validity interval becomes 


4 ie _ gyl/21 1/28 
(-1+ 1+ f wh) = (-x n-ne oee). 
yo F(a) vw FC) [yo — 17 


Problems 
1. Consider the equation y’ = 4,/1 — y?. 

(a) Find its general solution. 

(b) Show that the constant functions y(t) = 1 and y(t) = —1 are solutions for all t. 

(c) Show that the following function is a singular solution: 
ils, for t > 4, 

p(t)= 4 sin(4t), for -3<t<, 

—1, fort <—. 

(d) What is the maximum interval of existence for the IVP y’ = 4\/1— y?, y(0) = 0? 


2. For each of the following autonomous differential equations, draw the slope field and label each equilibrium solution as 
a sink, source, or node. 


(a) y’=y-y)(y-3);  (b) y =y?A-y)y-3); 
(ce) y =y-y)?y-3); (dd) xy =y?O-y)?(y— 3); 
(e) y=y-y)?y-3) (fF) y =¥-y)?(y—-3)°. 


2.7. 
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3. Find the general solution y(t) to the autonomous equation y’ = Ay* (a > 1) and determine where it blows up. 


4. For each of the following differential equations, find its nullclines. By plotting their solutions, determine which nullclines 


the solutions follow. 
(a) Py=y?—4ty, (b) gay-t?y (c) y=sin(y)sin(t); 
. 2 a . . 
(4) g="-eay: (Ce) g=siny-?); (fF) g=y?-®. 
Consider the initial value problem y—ty = 1, y(0) = yo. Using any available solver, find approximately (up to 


5 significant places) the value of yo that separates solutions that grow positively as t + +oco from those that grow 
negatively. How does the solution that corresponds to the exceptional value of yo behave as t + +00? 


Consider the differential equations: 
@ y= (bh) g=¥ th (©) yHy?-1 
(a) Using any available solver, draw a direction field and some solutions for each equation. 


(b) Which solutions have vertical/horizontal asymptotes? 


In each of Problems 7 through 12, 


9. 


Draw a direction field for the given linear differential equation. How do solutions appear to behave as x becomes large? 
Does the behavior depend on the choice of the initial value a? Let ao be the value of a for which there exists an 
exceptional solution that separates one type of behavior from another one. Estimate the value of ao. 


Solve the initial value problem and find the critical value ao exactly. 


Describe the behavior of the solution corresponding to the initial value a. 


.y —y=2sing, y(0) =a; 10. y =3y+#, y(0) =a; 
-y =2y+te™*, = y(0) =a; ll. y =a? +2y, y(0) =a; 
y =2+3y, y(0) = a; 12. y =3?y+t?, (0) =a. 


In each of Problems 13 through 18, determine whether solutions to the given differential equation exhibit any kind of 
symmetry. Is it symmetrical with respect to the vertical and/or horizontal axis? 


13. 
14. 


y’ = 2y; 15. y’ =2y+1; 17. y’ =sin(xy); 
y =y tea; 16. y’ +2y=e ty’: 18. y’ = cos(xy). 


In each of Problems 19 through 24, prove that a vertical asymptote exists for the given differential equation. 


19. 
20. 


y =y*— 2; 21. yf =e 4 +4"; 23. y’ = y? + sin(a); 


y =y? — ya"; 22. y! =y? + 2y? —y— 2; 24. y’ =y—4y?. 


In each of Problems 25 through 30, find the maximum interval of existence of the solution to the given initial value 


problem. 
25. y= (y— y+ 1(y? +2 +2) Sy? —y? —4y?—2y—4,  y(0) =0; 
26. y = (y? + 2y + 3)(y? — 1) = y* + 2y? + 2y? — 2y— 3, (0) = 0; 
27. y= -Yy+y¥ +)sytyt+y—y—-y—1, y(0)=0; 
28. y= (y® — 1)(y* + 2y +4) Sy? + 2y* + 4y? —y? — 2y—4, (0) = 2; 
29. y= (y*-Dy+yt+H=syety ty —y—-y-1, y(0) =0; 
30. 9 = (y? +y + 2)(y> + y + 10) = 9? + 2y* + 12y? +y?+12y4+20, (0) =0. 
31. Consider the differential equation y’ = —a/y. 
(a) Use Picard’s theorem to show that there is a unique solution going through the point y(1) = —0.5. 
(b) Show that for |x| <1, y(x) = —V1 — 2? is a solution of y’ = —a/y. 
(c) Using the previous result, find a lower fence of the solution in part (a). 
(d) Solve the initial value problem in part (a) analytically. Find the exact x-interval on which the solution is defined. 
32. Suppose that y = y* is a critical point of an autonomous differential equation y = f(y), where f(y) is a smooth function. 


Show that the constant equilibrium solution y = y* is asymptotically stable if f(y) <0 (> 0) and unstable if f(y) > 0 
(<0) for y>y" (y<y’"). 


In each of Problems 33 through 36, find the solution to the given initial value problem. Then plot the solution defined 
implicitly and determine or estimate the validity interval. 
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33. y! = 3a? /(3y? — 3.4), y(1) =0; 35. y! = (2+ 6x) /(y? —2y), y(0) =1; 
34. y' = (y? — 2y)/(2+627), y(0) = 1; 36. y/ = (y? + 2y)/(@” — 42), y(1)=1 


Summary for Chapter 2 


1. The differential equation dy/dz = f(x,y) is a separable equation if the slope function f(x,y) is a product of two 
functions of dependent and independent variables only, that is, f(x,y) = p(x) q(y). The integral 


c= f payac- f 


2. A separable differential equation y’ = f(y) is called autonomous. 


defines the general solution in implicit form. 


3. A nonlinear differential equation y’ = F(ax + by +c) can be transformed to a separable one by the substitution 
v=ar+by+c,b40. 
4. A nonlinear differential equation xy’ = y F(xy) can be transformed to a separable one by the substitution v = ry. 


5. The differential equation y’ = F (4) , « #0, can be reduced to a separable differential equation for a new dependent 
variable v = v(x) by setting y = vx. This equation may have a singular solution of the form y = kz, where k is (if any) 
a root of the equation k = F(k). 


6. A function g(x, y) is called a quasi-homogeneous function with weights a and @ if 
g(A*w, Px) = AP g(a, y), 


for some real numbers a and £. If a = £, the function is called homogeneous. By setting y = 28/% oy y= ux’/*, the 
differential equation y’ = g(x,y) with a quasi-homogencous (or isobaric) equation g(x, y) is reduced to a separable one. 


dy _p ax + by 
dx Az + By 


is an example of a differential equation with a homogeneous slope function. 


7. The differential equation 


8. The differential equation 
dy _ pl{ 2 + by+c 
dx (ess) 
has a slope function which is not a homogeneous function, if either C' or c is not zero. The solution of this differential 
equation is case dependent. 
(a) Case 1. If aB ¢ bA, then this equation is reduced to a differential equation with a homogeneous slope function 
by shifting variables 
a=X+a and y=Y+6. 


To obtain this, one should determine the constants a and 6 that satisfy the following system of algebraic equations: 
aa+bB+c=0, Aa+BB+C=0. 


With this in hand, we get the differential equation with a homogeneous rate function 


dy _pP ( aX + bY ) 
dx AX + BY 
in terms of the new variables X and Y. 
(b) Case 2. If aB = DA, then we set 
v =k (ax + by) 
for an arbitrary nonzero constant k. Substituting a new variable v into Eq. (2.2.12), we obtain an autonomous 


differential equation 


vf = kat kbp (Re ) 


Av + kaC 


9. A differential equation 
M(a,y) dx + N(x, y) dy =0 (2.3.1) 


= oN. In this case, there exists a potential function ~(x, y) such that 


dy(x,y) =M(z,y)dx+N(a,y)dy, or 2 = M(x.y), SY = ney). 
y 


OM 


is called exact if and only if By 
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10. 


11. 


12. 


13. 


14. 


15. 


An exact differential equation (2.3.1) has the general solution in implicit form w(x,y) = C, where ~ is a potential 
function and C is an arbitrary constant. 


A nonhomogeneous equation 
M(a,y) da + N(a,y) dy = f(x) da (2.3.12) 


is also called exact if there exists a function #(z,y) such that dw(#,y) = M(a,y)dx + N(az,y)dy. This equation 
(2.3.12) has the general solution in implicit form 


vey) = f fle)ar +e. 


A differential equation of the first order M(x, y) da + N(«,y) dy = 0 can sometimes be reduced to an exact equation. 
If after multiplication by a function pu(z, y) the resulting equation 


(a, y)M (a, y) da + u(x, y)N (x,y) dy = 0 
is an exact one, we call (xz, y) the integrating factor. 


In general it is almost impossible to find such a function p(x, y). Therefore, we consider two of the easiest cases when 
an integrating factor is a function of only one of the variables, either x or y. Thus, 


wa) = ef [| M2 ach. uly) =exp{ f Me ah. 


The integrating factors in these forms exist if the fractions (M, — Nz)/N and (M, — Nz)/M are functions of x and y 
alone, respectively. 


If the coefficients M and N of Eq. (2.3.1) satisfy the relation My(x,y) — Nz(x,y) = p(x) N(a,y) — q(y)M(a,y), an 
integrating factor has the form p(x, y) = exp { [oa dx + faw ay}. 


There are two known methods to solve linear differential equations (2.5.3) of the first order in standard form, 
y'(x) + a(x) y(x) = f(x), (2.5.3) 
namely, the Bernoulli method and the integrating factor method. 
(a) Bernoulli’s method. First, find a function u(«) which is a solution of the homogeneous equation 
u (x) + a(x) u(x) = 0. 


Since it is a separable one, its solution is 
_ o- f a(2) da 
u(x) =e F 


Then, we seek the solution of the given nonhomogeneous equation in the form y(x) = u(x)v(a), where v(x) is a 
solution of another separable equation u(x)v’(x) = f(x), which has the general solution 


v(z) = I) ay 
(x) [ FBarsre. 


UL 


Multiplying u(x) and v(«), we obtain the general solution of Eq. (2.5.3): 
y(x) = gis le +f f(x) ef (7) de aa] 


(b) Integrating factor method. Let p(x) be a solution of the homogeneous differential equation 


p(x) — a(x) w(x) = 0 = p(x) = ef oe) de 


Upon multiplication of both sides of Eq. (2.5.3) by yw, the given differential equation is reduced to the exact 
equation 

<[ul7) y(@)] = u(2) f(@), 
and, after integrating, we obtain 


1 
w(o)u(e)= f ula) fle)de oye) = > f mle) f(a). 
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Review Problems 


Review Questions for Chapter 2 


Section 2.1 of Chapter 2 (Review) 


1. 


10. 


11. 


12. 


13. 


14. 


15. 


Which of the following equations are separable? 


(a) yo =ye™t¥ (a? +1); (b) ay’ =14+y7; (ce) y’ =sin(zy); (d)  x(e¥ +4) da =e *" dy; 
(e) y! =cos(x + y); (f) wy +y=ay?; (eg) y’=tn(y*)4+e; (bh) y/ =2e%"—*; 
(i) y' =In|axy); (Gj) y' =tany. 


Solve the following separable equations. 
(a) x(y+1)? dx = (27 +1)ye%dy; (b) (y—1)de—(a@+1)dy=0;) (c) zyy’=(y¥+)D2-2); 


(d) tanzda+cot ydy =0; (e) (@?—1)dy+(y+1)dx=0; (f) adyt+V/1+y? dx =0; 
(g) (x4+1)sinyy’ +2xrcosy =0; (h) V1l+y?dxr+arydy =0; (i) «ylnyda = seca dy; 
G) (42? + 1)y’ =’; (k) (1-2) y' = 3y? — 2y; () yy! =c. 

Find the general solution of the following differential equations. Check your answer by substitution. 
(a) (y+lde—(e+2)dy=0;  (b) yde—2(y— 1) dy =0; 
(c) cosydx —-2z sinydy = 0; (d) wda+e” %dy=0; 


(e) (y—1)\d—2)de+aydy=0; (f) xyda— (x? +1)dy =0; 
(g) (y+1)de +4 (2? — 4) dy =0; (h) sin 2x dx + cos 3y dy = 0. 


. Obtain the particular solution satisfying the initial condition. 


(a) 2eyy’=1+y, y(1)=1; (b) tanyy’+2cos*y=0, y(0) =0; 
(c) (1—y’)y’=In|z|, y(1)=0; (d) cosydr+asinydy=0, y(1) =0; 
(e) ay’=VJ1—y?, y(1/2)=0; (f) symydr=secady, y(0) =e; 

(g) y =xy, y(0) = 2; (h) Vil+y?dr+aydy=0, y(1) =0; 


(i) y'=-y' sing, y(0) =]; (j) wde+ye*dy=0, y(0)=1. 
At time t = 0, a new version of WebCT software is introduced to n faculty members of a college. Determine a differential 
equation governing the number of people P(t) who have adapted the software for education purposes at time t if it is 
assumed that the rate at which the invention spreads through the faculty members is jointly proportional to the number 
of instructors who have adopted it and the number of faculty members who have not adopted it. 


Separate the equation dy/da = (4x — 3)(y — 2)?/? to derive the general solution. Show that y = 2 is also a solution of 


the given equation, but it cannot be found from the general solution. Thus, the singular solution y = 2 is lost upon 
division by (y — 2)?/3. 


Solve the equation a a +1, 
dx 


Replace y = sin u in the equation wy = V/1-—y? arcsin y and solve it. 
A gram of radium takes 10 years to diminish to 0.997 gm. What is its half-life? 


For a positive number a > 0, find the solution to the initial value problem y’ = x?y", y(0) = a, and then determine the 
domain of its solution. Show that as a approaches zero the domain approaches the entire real line (—oo, 00), and as a 
approaches +oo the domain contains all points except x = 0. 


Bacteria in a colony are born and die at rates proportional to the number present, so that its growth obeys Eq. (1.1.1) 
with \ = ki — kz, where ki corresponds to the birth rate and kz to the death rate. Find ki and kg if it is known that 
the colony doubles in size every 48 hours and that its size would be halved in 12 hours if there were no births. 


Are the following equations separable? If so, solve them. 
r cos e” cos x23 —y? 2_ 28 

(a) - = Fee aE (b) Ag? ee Y dr= 3y° e¥ dy. 
For each of the following autonomous differential equations, find all equilibrium solutions and determine their stability 
or instability. 

(a) ¢=327?+8r-—3; (b) @=2?-—8r+7; (c) #=22?-9r+4; (d) #=32?—-5r42. 
Sketch a direction field of the differential equation y = y(1— 2t) and determine the maximum value of a particular 
solution subject to the initial condition y(0) = 1. 


Suppose that a hole is available through the earth (it has an equatorial diameter of 12,756 km, and a polar diameter of 

12713.6 km). If an object were dropped down a hole through the center of the earth, it would be attracted toward the 

center with the force directly proportional to the distance from the center. Newton’s universal law of attraction gives 
d 

the differential equation for the velocity v of the ball as > = a where R & 6,371 km is the radius of the earth, 

g © 9.8 m/sec is acceleration due to gravity, and r is the distance of the ball from the center of the earth. Find the 

time to reach the other end of the hole. 
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16. Consider a conical reservoir 20 meters deep with an open top that has radius 60 meters. Initially, the reservoir is empty, 
but water is added at a rate of rm? /hr. Water evaporates from the tank at a rate proportional to the surface area, 
the constant of proportionality being 1/(97). Convert the differential equation that describes the volume V of water 
having height h 


into the differential equation, containing h and solve it. 


Section 2.2 of Chapter 2 (Review) 


1. Use appropriate substitution to solve the given equations. 


(a) y= VieF2y—-T, (b) y' =—*EBR, (c)  y! = (22+ 3y +1)? — 2; 
(d) y= See"; (e) yo! = V2 + 3y; (f) yo =(@ty+2)?-@ty—2)?; 
(ge) y =Qrt+3y)"; (hb) yx =V2eFy-2 (@) yx =y—e+14+(e-yt2)". 


2. Determine whether or not the function is homogeneous. If it is homogeneous, state the degree of the function. 


(a) 2? +4ey+2y%; (b) Inlyl—mnlel; () —; (d) YEH®, 
(ec) 2454, (fF) sin(y+a); — (g) sin(y/x); (a) ED. 


3. Solve the given differential equations with homogeneous right-hand side functions. 


(a) yy =ar+y; (b)  (a?y + ay? — y®) da + (xy? — 2°) dy = 0; 
(c) (e+y)y! =y; (4) ya? + yPde = (e/a? + y? +9") dy; 

(e) yi =e¥/™ + y/x; (f)  y! = (4ay? — 2x°)/(4y? — xy); 

(g) y=(y?+°)/ey’; (hb) 2?y! = 40? + Try + 2y?; 

(Gi) oy =22/(r+y); (j) (6a? — 5ay — 2y?) dx = (8xy — y? — 6x”) dy; 
(k) a®y’=y?+207y; (1) (+ = ae) a + 23 =0. 


4. Solve the given differential equation with a homogeneous right-hand side function. Then determine an arbitrary constant 
that satisfies the auxiliary condition. 


m2aj—Ayj? 

(a) yf = FESS WI) = (b)oa?y! = y? $ 2ary, y(1) = 1; 
3) a.0 

(c) yf = 2S, y1)=m2; (d) x =# (n#+1), y() =e. 


5. Solve the following equations with linear coefficients. 


(a) (c@-y)y=atyt2; (b) (4x + 5y—5)y' =4— 5x — 4y; 
(c) yl =a2-yt3; (d) (x +2y) da + (2% + 3y + 1) dy = 0; 


(e) (x+2y)y’ =1; (f) (a+ 4y — 3) da — (2x + by — 2) dy = 0; 
(g) (Qa+y)y' =6x + 3y—-1; (h) (3e+y-—1)dz — (6x + 2y — 3) dy =0; 
(i) (w@+3)y =a+y4+4 (j) («+y4+1)dz+ (8y — 3x +9) dy = 0; 
(k) (e-y)y’=ax+y-2; (1) (Qe+y—4)dx+(c—y+1)dy=0; 
(m) (x—y)dr4+(3%+y)dy=0; (mn) (a-3y4+8)dxr— (x+y) dy =0; 

(0) (e+y)y'=2-2-y; (p) (Qe+yt+1)y’ =4r + 2y +6. 
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11. 


Review Problems 


unknown curve 


sun rays 


O 

Solve the following equations subject to the given initial conditions. 

j x—2y+5 7 2— 2x — 3y 

= Hae b = —____ 1) = —2/73; 
(a) y jae y(Q1) = 1; (b) y =a y(1) /3; 

,  £+3y—5 , 4y—3x¢4+1 

= ——— 2) 2: d = ———_— 0) =1 
(c) y aa y(2) = 2; (d) y eaves y(0) = 1; 

, y—4a +6 ,_ 4a + 1ly— 15 

= 2) = -3; f pee es 2)=1 
@) ¥=E w@=-3 v= FEE v= 

, Ax+3y+2 ; 3y — 5a —-1 

= RENE SS i} ee h ap i ‘Ay, Bate 
(g) y Beg y(1) ; (hb) y see y(1) = 1; 

, Ta+3y4+1 , 7 2Qrz+y—8 

= ———_ 1) 1; = 3) = 7/3; 
(i) oy ray y(1) = 1; G) oy S499 212’ y(3) = 7/3; 

3a — 2y—5 , de ty +7 

k — 1)=1, 1 = —- —1)=6. 
(k) y dnt Ty 48" yl) = 1; Q) oy 3nty ts’ y(-1) 


. Consider a solar collector that redirects the sun rays into a fixed point. Its shape is a surface of revolution obtained 


by revolving a curve y = ¢(x) about the z—axis. Without loss of generality, you can assume that the rays are parallel 
to the x—axis and that they are focused at the origin (see above picture). Show that the curve y = ¢(«) satisfies the 


differential equation 
dy Vxty?—-«x 


daz y 


Then find its solution. Hint: The law of reflection says: Za = Zy = Z6 and 28 = 2Za. 


. Show that the nonseparable differential equation xy’ = —y+ F(x)/G(ay) becomes separable on changing the dependent 


variable v = ay. Using this result, solve the equation (4x3 + y cos(xy)) dx + a cos(ay) dy = 0. 


. Solve the differential equation «(y? + y) dx = (2? — y*) dy by changing to polar coordinates x = r cos 0, y = r sin 8. 
10. 


An airplane flies horizontally with constant airspeed v km per hour, starting at distance a east of a fixed beacon and 
always pointing directly at the beacon. There is a crosswind from the south with constant speed w < v. 


Locate the positive x axis along the east-west line through the beacon and with the origin at the beacon. Show that 
the graph y = y() of the airplane’s path satisfies 


d 2 
eee eee 
dx x v x 


Using substitution y = x u, solve the airplane equation. 


By making the substitution y = vx” and choosing a convenient value of n, show that the following differential equations 
can be transformed into equations with separable variables, and then solve them. 


’ y— xy? 


1l-ay? 
ia ; (c) y= x + 302y" 


7 l—ay? , 
(b) y tye 


(a) y ~ TO a2y ' 
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12. Find a particular solution for the initial value problem y’ = y?/x, y(1) = 1 with the isobaric slope function (see 
definition on page 67). 


13. In each of the following initial value problems, find the specific solution and show that the given function ys(x) is a 
singular solution. 


(a) y =(22 +30—y)?2 42243, yl)=4  ys(z) = 2? +32. 
(b) y= _E= y(1)=1; = ys(z) = 1. 
(c) 9y'=4 lz 7 Va oy), y(3)=1; —-ys(a) = &. 
(d) y= V2+y/e-2, y)=-2 — yo(w) = —22. 
Section 2.3 of Chapter 2 (Review) 


1. Write the given differential equation in the form M(x, y) dx + N(x, y) dy = 0 and test for exactness. Solve the equation 
if it is exact. 


dy y dy 38y + 327y? 1» 308? + 14y8 
bac: Aen 4 b at ‘ —0: 
(a) dx 4y3— a’ i) da 3a + Qxsy ’ i) as TUE +4 422° : 
6ry — 2/x?y ,  Cosx cosy + 2x ! “An —e” siny 
d eae eae Ee 2 a eee f = oe Se 
(qd) “y —3a? + 2/axy?’ ie) sin x sin y — 2y (f) oy e® cos yt yo 1/3? 
, £-3 ; 3a" + Qey+y - ,  e +42" + cosy 
= : h ee ee ee oe a ee 
(g) y po (h) y ay Dey tay (i) oy Asay 
2. Solve the exact differential equations by finding a potential function. 
(a) [ysinh(ay) + 1] dx + [2+ xsinh(«y)] dy = 0; (b)  2r6? dr + [2r?6 + cos 6] dé = 0; 
(c) [2r ee —r6sin(r0)] dr —r? sin(r6)d@=0; (d) (y+e?")dx 4+ (#+y?) dy = 0; 
(e) (4ay a) da + (2x? + y) dy = 0; (f) Qeye”™ dx +e” dy = 0; 
g nao++ tr oe y) ay =0; y sinxdax = (2ycosxz + Y; 
l d 2+ 2y) dy = 0; h) y’sinrdr = (2 1)d 
(i) oe + 3y) dx + (4y? + 3a) dy = 0; (ij) (#?/2+Iny)y’ +2y+Inz =0; 
(k) oe ") dy+(4+Iny) dz =0; (1) (a? + y) da + (y? + x) dy = 0; 
m yo dx = (+, — 3xry? —1) dy; n Qry +1) dx +x? dy = 0. 
xy 
3. Solve the exact differential equation and then find a constant to satisfy the initial condition. 
2 —1 Pe a ye ey? 
(a) ; dx+ = dt=0, «x(1) =2; (b) (= a +) dz = (= = r) dy, y(1) =1; 
(c) ye" da 4 (oe + 4y) dy =0, y(0) = 2; (d)  2eydx + (x —y’)dy = 0, y(0) = 3; 
(e) (v+=) dx = (4-1-2) dy, y(0)=1; (f) (a@? +y)dr+(a+e") dy =0, y(0) =0; 
(g) da + 2e” dy = =0, y(0)=1; (h) (2ry + 6x) dx + (x? + 2) dy =0, y(0) = 1; 
(i) 3de—-Sdy=0, y(1)=1; (j) (4427) dx + (In |x| — 2y)dy = 0, y(1) =1; 
(k) 2x? dz — “ay dy=0, y(1) =0; (1) eYda—(2a+y)dy=0, y(1) =2. 


4. Using the line integral method, solve the initial value problems. 


(a) (sey + =) dx = (see = 22”) dy, y(1)=1. 

(b) [e® sin(y?) + xe” sin(y?)] dx + [2aye” cos(y?) + yJdy=0, y(0) = 1. 
(c) (2+In(ay)) da + (1 - =) dy=0, y(1)=1. 

(a) (4~ay’) de = Cs +Sty)dy, y(l)=1 

(e) y(y® — 4x) da + x(4y* — 2*)dy=0, y(1)=1. 

(f) (2asin y + 3x7y) da + (a? cosy+a2?)dy=0, y(1)=7/6. 

(g) e(2+a)y/+24+2)y=14+327, y(1)=1. 

(h) aS (= — + 6y *) ay =o. y(0) = 1. 

(i) (w+ +) dw =(4- a y(0) = 1. 


(j) (e"siny +e -¥)dx = (xe~-¥ =e” cosy) dy, y(0) = 7/2. 
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11. 


12. 


13. 


14. 


15. 


16. 


Review Problems 


For each of the following equations, find the most general function N(x, y) so that the equation is exact. 


(a) (cos(x+y)+y*)dx+N(a,y)dy=0; (b) (ysin(zy) +e”) da + N(a,y) dy; 

(c) (e?siny +a?) dx + N(a, y) dy = 0; (d) (20 + peor) da + N(w,y) dy = 0; 
(e) (2ry+1)dx+ N(a, y) dy = 0; (f) (38a + 2y) dx + N(a,y) dy = 0; 

(g) (c’ +a) da + N(2,y) dy = 0; (h)  (2y°/? +1) a7? de + N(z,y) dy = 0; 
(i) (2+ y? cos 2x) dx + N(x, y) dy = 0; (j) (3a? + 2y) dx + N(a, y) dy = 0; 

(k) we %dx+N(a,y) dy =0; (1) 2ry~* da + N(a,y) dy = 0. 


Solve the equation xf (a? + y*) da + yf(x? + y”) dy = 0, where f is a differentiable function in some domain. 


Prove that if the differential equation M dx + N dy = 0 is exact, and if N (or, equivalently, 7) is expressible as a sum 

yor, ai(x)bi(y) of separated arbitrary (differentiable and integrable) functions a;(x) and 6;(y), then this equation may 

be integrated by parts. 

x? + Qay + 3y? 
(x? + y?)P 

For an exact equation M dz + N dy = 0, show that the potential function can be obtained by integration 


Determine p so that the equation (yda — x dy) is exact. 


v= [(x — 2o)M(E,n) + (y— yo) N(E,n)] dt, €=a20+t(a—20), n=yot+t(y—yo), 


provided that the functions M(ax,y) and N(x, y) are integrable along the straight line connecting (xo, yo) and (a, y). 


Solve the nonhomogeneous exact equations. 
(a) [2ay? + ye¥] da + [x(1+y)e% + 2yax?]dy=2rda; (b) y?dx+2xydy = cosada; 


(c) cos(ay) [yda + x dy] = 2a e* dz; (d) ade aa roe 


ry JI + xy? 
Consider the differential equation (y + e”) dy = (a — e~*) da. 
(a) Solve it using a computer algebra system. Observe that the solution is given implicitly in the form w(x, y) = C. 


(b) Use the contourplot command from Maple or the ContourPlot command from Mathematica to see what the 
solution curves look like. For your x and y ranges you might use -l << # <3 and -—2<y< 2. 


(c) Use implicitplot command (consult Example 1.3.2, page 6) to plot the solution satisfying the initial condition 
y(1.5) = 0.5. Your plot should show two curves. Indicate which one corresponds to the solution. 


Find the potential function (a, y) to the exact equation «dx + y~* (ydx — xdy) = 0, and then plot some streamlines 
(x,y) = C for positive and negative values of the parameter C. 


Given a differential equation y’ = p(x, y)/q(x,y) in the rectangle R = {(z,y) : a<a<b, c< y < d}, suppose that 
w(x, y) is its potential function. Show that each statement is true. 


(a) For any positive integer n, w" (x,y) is also a potential function. 
(b) If d(xz,y) #0 in R and if n is any positive integer, then w~"(x, y) is a potential function. 
( 


c) If F(t) and F’(t) are continuous functions on the entire line —oo < t < co and do not vanish for every t except 
t= 0, then F(a, y)) is a potential function whenever w(x, y) is a potential function for a differential equation. 


If the differential equation M(x, y) dx + N(x, y) dy = 0 is exact, prove that 
[M(x,y) + f(x)] da + [N(2,y) + g(y)] dy = 0 


is also exact for any differentiable functions f(x) and g(y). 


Show that the linear fractional equation 
dy axt+by+e 
dx Axr+By+C 
is exact if and only if A+b=0. 
In each of the following equations, determine the constant A such that the equation is exact, and solve the resulting 
exact equation: 


(a) (a? + 3xy) dx + (Ax? + 4y — 1) dy = 0; (b) (24 + Ay?) da — 2ay dy = 0; 

1. Az +1 i 
(c) (a+ P <) dx + B dy = 0; (d) (3 ) dz = ic - ES dy; 
(e) (1 —Ax?y — 2y?) da = (a? + dary + 4) dy; (f) pana) aay x? + 2xy) dy = 0; 


(g) (e*siny + 3y) dx + (Ae* cosy +3x)dy=0; (h) (£+6z2) da On lel 4 A) dy = 0. 


x 


Review Questions for Chapter 2 133 


17. In each of the following equations, determine the most general function M(x, y) such that the equation is exact: 


(a) M(x,y) dx + (Qry — 4) dy = 0; (b) M(a,y) dx + (2x7 — y*) dy = 0; 

(c) M(x,y) dx + (Qa + ») dy = 0; (d) M(a,y) da + (4y? — 20%y) dy = 0; 

(e) M(a,y) da + (307 +4ry—6)dy=0; (f) M(x,y)da +(y* +In|z|) dy = 0; 

(g) M(x,y) dx ii We y dy = 0; (h) M(a,y) dx + 2y sinx dy = 0; 

(i) M(a,y)dx+ (sec? y+ax/y) dy = 0; (j) M(a,y) dx = (e-¥ — sinx cosy — 2xy) dy; 
(k) M(x,y) dx + (xe +2/y2)dy=0; (1) M(x,y)dxe+ (2 = =) dy =0. 


Section 2.4 of Chapter 2 (Review) 


1. Find an integrating factor as a function of x only and determine the general solution for the given differential equations. 


(a) y(1+a)da +27 dy =0; (b) xy" dy — (x? + y*) da = 0; 

(c) (a? — 3y)da+ady =0; (d) (y—2ax)dx —ady = 0; 

(e) (8a? +y+4+3a°y)de+ady=0; (f) (2x? + 2ary? +1)ydx + (3y? + x) dx = 0; 
(g) sinydx + cosy dy = 0; (h) (3y? cota + + sin 2x) da = 2y dy. 


2. Find an integrating factor as a function of y only and determine the general solution for the given differential equations. 
(a) y(y?+1)dx+a(y2—-1)dy=0; (b) (y?—32? + ay + 8y + 2x — 5) dy + (y — 6x + 1) dx = 0; 
(ec) y(y2+2)dre+a(y?—4)dy=0; (d) (2x + 2ay) da + (x?y + 1+ 2y?) dy = 0; 


(e) © Yde +(x/y)e~¥ dy = 0; (f)  (80y — 2-2y) de = (0* — 30° *y4) dy; 

(g) ada = (xy + y°) dy; (h) (4 secy —tany) dx = (a — sec y Ina) dy. 
3. Use an integrating factor to find the general solution of the given differential equations. 

(a) (y? +y)da + (wy — x) dy = 0; (b) ycosada + y? dy = 0; 

(c) (e*+y—1) dxr—dy=0; (d) da+(ax/y+ cosy) dy = 0; 

(e) (A4a°y~* + 8y~") da + (Bay? + 4y)dy=0;  (f)  yda + (2ay —e~¥) dy = 0; 

(g) (l+y?)det y(a+y? +1) dy =0; (h) (1+ ay) dx + a? dy =0; 

(i) (1+ 2a? + 4ay) da + 2dy = 0; (j) yda+a?dy =0; 

(k) y(1+y*)da + a(y? — 2) dy = 0; (1) (a* + y*) da + 2ay? dy = 0. 
4. Using an integrating factor method, solve the given initial value problems. 

(a) 4ydr+5ady=0, y(1) =1. (b) (2xe” — y”) da + 2ydy =0, y(0) = V2. 


(c) 3a! y’ de — 2y dy = 0, y(4)=8. (d) da+(r2+2y+1)dy=0, y(0) =1. 

(e) (y+1)da+a?dy=0, y(1) =2. (f) awy?da+ (x?y? —2)dy=0, y(1)=1. 

(g) yi =e*+2y, y(0)=-1. (h) 2cosydx = tan2zx sinydy, y(7/4) = 0. 

(i) (a* — y) dz=ay’dy, y(1)=1.  (j) (2y — 9x) dx + 6x (1 — ru) dy = 0 y(1) = 1. 


5. Show that if the ratio (M, — Nz) /(yN —«M) isa function g(z) of the product z = ry, then pu(xy) = exp \/ g(z) az} 
is an integrating factor for the differential equation (2.4.1). 
Section 2.5 of Chapter 2 (Review) 


1. Solve the given linear differential equation with variable coefficients. 
(a) ty +@+)y=t (by) Atay! + ay = wl +27); 
(c) ay'=y+4e*Ine; (d) y!+ 2%) =2r ((z| <1); 
(e) 2y’+ ay = 22; (f) «(1 ay! =yta3 — 2Qyx?. 
2. Solve the given initial value problems. 
(a) (a? +1)y'+2ey=2, y(2)=1 (b) y'+2y=2", y(0)=1; 
(c) y’t+y=axcosz, y(0)=1; (d) y'-y=csinz, y(0)=1,; 
(e) wy! +4e7y+1=0, y(l)=1; (ff) ay’ +ay=2, y(1)=0. 
3. Solve the initial value problems and estimate the initial value a for which the solution transits from one type of behavior 
to another. 
(a) y'+5y=3e-7, y(0)=a; (b) yy’ +5y=29cos2a, y(0) =a. 


x 


4. Consider the differential equation y’ = ay + be”, subject to the initial condition y(0) = c, where a, b, and c are some 
constants. Describe the asymptotic behavior as x — oo for different values of these constants. 


5. Suppose that a radioactive substance decomposes into another substance which then decomposes into a third substance. 
If the instantaneous amounts of the substances at time ¢ are given by x(t), y(t), and z(t), respectively, find y(t) as 
follows. 

The rate of change of the first substance is proportional to the amount present so that a = ¢ = —az. The rate of 
increase of the second substance is equal to the rate at which it is formed less the rate at which it decomposes. Thus, 
y = ax(t) — By(t). Eliminate x(t) from this equation and solve the resulting equation for y(t). 


134 


10. 


Review Problems 


Let a(x), fi(x), and f2(a) be continuous for all « > 0, and let y: and y2 be solutions of the following IVPs. 


yta(zjy=filz), yO)=a andy’ +a(z)y=fa(z), y(0) =e, 
respectively. 
(a) Show that if fi(v) = fo(a) and ci > ce, then yi(x) > ye(x) for all x > 0. 
(b) Show that if fi(a@) > fo(x) for all x > 0, fi(0) = fe(0), and ci = ce, then y1(x) > ya(z). 


For what values of constants a, b # 0, and c, a solution to the initial value problem y’ = ax + by +c, y(0) = 1 is 
bounded. 


Formula (2.5.6) shows that the general solution of a first order linear differential equation (2.5.3) is a family of curves 
of the form 

y(x) = C- k(x) + A(x). 
Show, conversely, that the differential equation of any such family is a first order linear differential equation. 


Show that the differential equation y’ + p(x) y = q(x) y Iny can be reduced to a linear equation by substitution v = In y. 
Apply this method to solve the equation 
ry = xy + ay Iny. 


Show that Newton’s law of cooling, T= k(M — T), can be regarded as the linearization of Stefan’s law, T = 
—o (r* —M = \s near the equilibrium solution T(t) = M. Here the constant of proportionality k is positive if T > M. 


Section 2.6 of Chapter 2 (Review) 


1. 


Solve the following Bernoulli equations. 
(a) 627 y=y(2t+4y);  (b) oy tyvE= 3,/2: 


(c) y'+2y=e-*y’; (d) (1+ 47)y’ — 2ay = 4,/y(1 + 2?) arctan 2; 
(ce) y'+2y=e7y"; (f) y= (acost+b)y—y; 

(g) (P+)y=ty+y’; (hy) yp =y/(et 1+ @tyy™, 

(Gi) y+ 2ty +ty* =0; G) ty’ ty = ay"; 

(k)  t?y = 2ty— 4"; (1) 2y—y/t+y° cost = 0; 


3sing=0; (n) y-—y' cosr=y’cosa(1—sinz). 


(m) yf t+¥ty 
Find the particular solution required. 

(a) y'ty+zy?=0, y9(0)=1; (b) y+ 4ry =ay", y(0) =1; 

(c) ay! +t=2ay', y(1)=1; (a) ay’ —2y= doy”, y( 

(e) y-y=3e™, y(0)=1/2;  (f) ry’ = 2y — Sa*y", y(1 


Solve the equations with the independent variable missing. 


(a) yy" =4(y')? -y'; (b) yy" +3’)? =0; (ce) yy” —2(y')? =0; 
(4) (l+y*)yy” = 3y?-1)(y')?s (e) oy” + y')? = 267; (f) oy" — 2(y’)? = 0; 
(g) yo" +y'— 2y° =0; (h) y= y- (y's G)y’y” + u(y)? = 1/2. 


Find validity interval for the solution of the initial value problem: 
yVl+x22=ay, yO)=1. 


Solve the equations with the dependent variable missing. 
(a) y"=14(y')?s (b) wy" = 2ey’— (y's (ce) yy” -@=2y'5 (d) oy" ty =e™, 
Find the general solution of the given equations by solving for one of the variables x or y. 
(a) (y')?+2y’/-e=0; (b) aly’)? + 2y'=0; (ec) 2(y')? +(y')?-y=0; (a) (@? -4)y? =. 
Consider a population P(t) of a species whose dynamics are described by the logistic equation with a constant harvesting 


rate: ; 
P=r(1—P)P-k, 


where r and k are positive constants. By solving a quadratic equation, determine the equilibrium population(s) of this 
differential equation. Then find the bifurcation points that relay the values of parameters r and k. 


Assume that y = « + V9 — 2? is an explicit solution of the following initial value problem 


(y+axr)y' +ay+ br =0, y(0) = yo. 


Determine values for the constants a, b, and yo. 
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9. A block of mass m is pulled over a frictionless smooth surface by a string having a constant tension F' (it has units of 
force, e.g. newtons). The block starts from rest at a horizontal distance d from the base of the pulley. Using Newton’s 
law of motion, derive the (horizontal) velocity u(x) of the block as a function of position 2. 


10. Reduce the order of the differential equations with homogeneous functions (some of them are homogeneous in the 
extended sense) and solve them. 


(a) 207y"” =a? + 8y; (b) 345 +2(y')? = yy" +348; 
(ce) (y’)? — 2yy” = 2”, (a) yy +(y') = 2uy's 
(e) yy" =(y')?tyve/2; (f) yo" +34 =542(y'); 
(g) yy” =(y')? +2; (hy) ayy" + y?/x = 2yy' + 2/2; 
11. Solve the equation (x? siny — x)y’ + 2y = 0 by reducing it to the Bernoulli equation. Hint: Consider x as a dependent 


variable. 


12. A model for the variation of a finite amount of stock y(t) in a warehouse as a function of the time t caused by the 
supply of fresh stock and its renewal by demand is 


y 


y’ =ay(t) coswt — by?(t), 
with the positive constants a, b, and w, where y(0) = yo. Solve for y(t) and plot its solution using one of the available 
software packages. 


13. Stefan—Boltzmann law for the Kelvin-scale temperature T(t) of a body in an environment with ambient temperature 
M states that T = 0 (M* - a) for some positive constant o (with units K~3/sec). Using second order Taylor’s 
series approximation for the slope function f(T) =o (M — i hal around the equilibrium temperature T = M, derive a 
Bernoulli equation for T(t) and then solve it. 


14. In Example 2.6.14, show that when second order effects are taken into consideration the intensity of the reflected beam 
must satisfy the differential equation hy” /2 — (1+ 2phy)y' + p+ py? + ph(y ni y’) —0. 


Section 2.7 of Chapter 2 (Review) 


1. Show that the solution to the initial value problem y’ = a? + y?, y(0) = 1 blows up at some point within the interval 
(0, 1]. 


In each of Problems 2 through 7, find the maximum interval of existence of the solution to the given initial value problem. 


2 9=(Y+Dy ty? 4+2ey ty ty ty’? +2? +2, y(0) =0; 

3. g=(y* +9? + D(y— 1)? = y® — dy? + 2y* — 2y? + 2y? -—2y4+1, (0) =0; 

4. y=((y ty? +1)(y— 1° Sy" — 8y° + 4y? — 4y* + 4y? — 4y? + 3y—1, (0) = 0; 
5. = (y? + I)(y? + 2y +38) = y? + 2y* + By? +y?+2y¥4+3, y(0) = 2; 

6. 9 = (y* — I) (y? + 2y + 2) = 9? + Dy? + 2y*—y? —2y—2, (0) = 0; 

7. y= (y? + 2y +12)(y? + 2y +3) = y® + 4y* + 15y? + 4y? + 30y +36, y(0) = 0. 


In each of Problems 8 through 19, find the bifurcation value of the parameter a and determine its type (saddle node, 
transcritical, or pitchfork). 
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8. y=ltayty’; 12. y = ay — sin(y); 16. ¢=y(l —y") —a(l—e-*); 
9. y=y—ay/(1+y"); 13. y=a tanhy— y. 17. y=aln(l+y)+y; 
.ea=y =e; 14. y=ay+y/4—y/(1+y); 18. y=ayty?—y?; 
ll. y=ay-—y(1— y); 15. y=a-y-e’; 19. y =sin y(cosy — a). 
20. Consider the differential equation 

ae ad (,y) # (0,0), 

dix 0, (x,y) = (0,0), 


subject to the initial condition y(0) = 0. Verify that the slope function f(z, y) is continuous at the origin but does not 
satisfy the Lipschitz condition. Show that the initial value problem has infinitely many solutions 


y(z) = —- Vat+c4 for a real constant c. 


Chapter 3 


The graph on the right-hand side is obtained by plot- 
ting the solution to the initial value problem 


dy  2+2y—1 


dx 2r+4y+1’ 


using the standard Runge-Kutta algorithm imple- 
mented in MATLAB®. As seen from the graph, the 
numerical procedure became unstable where the slope 
is close to the vertical direction. 


ode dydx = (x+2*y-1)/(2*x+4*y+1) initial condition y(0) = 0 
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Numerical Methods 


As shown in the previous chapters, there are some classes of ordinary differential equations that can be solved 
implicitly or, in exceptional cases, explicitly. Even when a formula for the solution is obtained, a numerical solution 
may be preferable, especially if the formula is very complicated. In practical work, it is an enormous and surprising 
bonus if an equation yields a solution by the methods of calculus. Generally speaking, the differential equation 
itself can be considered as an expression describing its solutions, and there may be no other way of defining them. 
Therefore, we need computational tools to obtain quantitative information about solutions that otherwise defy 
analytical treatment. 

A complete analysis of a differential equation is almost impossible without exploiting the capabilities of computers 
that involve numerical methods. Because of the features of numerical algorithms, their implementations approximate 
(with some accuracy) only a single solution of the differential equation and usually in a “short” run from the starting 
point. In this chapter, we concentrate our attention on presenting some simple discrete numerical algorithms for 
solutions of the first order differential equation in the normal form subject to the initial condition 

y =f(z,y), — y(®0) = yo, 
assuming that the given initial value problem has a unique solution in the interval of interest. Discrete numerical 
methods are the procedures that can be used to calculate a table of approximate values of y(x) at certain discrete 
points called grid, nodal, net, or mesh points. In this case, a numerical solution is not a continuous function at 
all, but rather an array of discrete pairs, i.e., points. When these points are connected, we get a polygonal curve 
consisting of line segments that approximates the actual solution. Moreover, a numerical solution is a discrete 
approximation to the actual solution, and therefore is not 100% accurate. 

In two previous chapters, we demonstrated how available computer packages can be successfully used to solve 
and plot solutions to various differential equations. They may give an impression that finding or plotting these 
solutions is no more challenging than finding a root or logarithm. Actually, any numerical solver can fail with an 
appropriate initial value problem. The objective of this chapter is three-fold: 

e to present the main ideas used in numerical approximations to solutions of first order differential equations; 


e to advise the reader on programming an algorithm; 


e to demonstrate the most powerful technique in applied mathematics—iteration. 


137 


138 Chapter 3. Numerical Methods 


An in-depth treatment of numerical analysis requires careful attention to error bounds and estimates, the stability 
and convergence of methods, and machine error introduced during computation. We shall not attempt to treat all 
these topics in one short chapter. Instead, we have selected a few numerical methods that are robust and for which 
algorithms for numerical approximations can be presented without a great deal of background. Furthermore, these 
techniques serve as general approaches that form a foundation for further theoretical understanding. To clarify 
the methods, numerical algorithms are accompanied with scripts written in popular computer algebra systems and 
MATLAB. 

Since both differentiation and integration are infinite processes involving limits that cannot be carried out on 
computers, they must be discretized instead. This means that the original problem is replaced by some finite system 
of equations that can be solved by algebraic methods. Most of them include a sequence of relatively simple steps 
related to each other—called recurrence. Numerical algorithms define a sequence of discrete approximate values 
to the actual solution recursively or iteratively. Therefore, the opening section is devoted to the introduction of 
recurrences related to numerical solutions of the initial value problems. 


3.1 Difference Equations 


Many mathematical models that attempt to interpret physical phenomena often can be formulated in terms of 
the rate of change of one or more variables and as such naturally lead to differential equations. Such models are 
usually referred to as continuous models since they involve continuous functions. Numerical treatment of problems 
for differential equations requires discretization of continuous functions. So a numerical algorithm is dealing with 
their values at a discrete number of grid points, although there are some cases where a discrete model may be more 
natural. 

Equations for sequences of values arise frequently in many applications. For example, let us consider the amorti- 
zation loan problem for which the loan amount plus interest must be paid in a number of equal monthly installments. 
Suppose each installment is applied to the accrued interest on the debt and partly retires the principal. We introduce 
the following notations. Let 


A be the amount of the principal borrowed, 
m be the amount of each monthly payment, 
r be the monthly interest rate, and 

Pn be the outstanding balance at the end of month n. 


Since interest in the amount rp, is due at the end of the n-th month, the difference m— rp, is applied to reduce 
the principal, and we have 


Pntl =Pn—(M—Tpn) or Pasi -—(ltr)pn=—m (n=1,2,...). 


This equation and the initial condition p, = A constitute the initial value problem for the so-called first order 
constant coefficient linear difference equation or recurrence. 

So our object of interest is a sequence (either finite or infinite) of numbers rather than a function of a continuous 
variable. Traditionally, a sequence is denoted by a = {an}n>0 = {@0, a1, @2,...} or simply {a,}. An element a, of 
a sequence can be defined by giving an explicit formula, such as a, = n!, n = 0,1,2,.... More often than not, the 
elements of a sequence are defined implicitly via some equation. 


Definition 3.1: A recurrence is an equation that relates different members of a sequence of numbers y = 


{yo, Yi, Y2,---}, where y, (n = 0,1,2,...) are the values to be determined. 


A solution of a recurrence is a sequence that satisfies the recurrence throughout its range. 


Definition 3.2: The order of a recurrence relation is the difference between the largest and smallest subscripts 
of the members of the sequence that appear in the equation. The general form of a recurrence relation (in 


normal form) of order p is yy, = f(N,Yn—1; Yn—2s+++sYn—p) for some function f. A recurrence of a finite order is 
usually called a difference equation. 
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For example, the recurrence pn41 — (1+1r)pn = —m from the amortization loan problem is a difference equation 
of the first order. In general, a first order difference equation is of the form ®(y,,yn,_1) = 0, n = 1,2,.... In what 
follows, we consider only first order difference equations in normal form, namely, ynz+1 = f(n, Yn). When the function 
f does not depend on n, such difference equations are referred to as autonomous: y, = f(Yn—1)- 


Definition 3.3: If in the difference equation yn = f(Yn—1,Yn—2,---,Yn—p) of order p, the function f is linear 
in all its arguments, then the equation is called linear. The first and second order linear difference equations 
have the following form: 


AnYn+1 + bnyn = fn first order linear equation, 
AnYn+1 + bnYn + CnYn—-1 = fn second order linear equation, 


where {fn}, {an}, {bn}, and {c,} are given (known) sequences of coefficients. When all members of these 
sequences do not depend on n, the equation is said to have constant coefficients; otherwise, these difference 
equations (recurrences) are said to have variable coefficients. 

The sequence { f,,} is called a nonhomogeneous sequence, or forcing sequence of the difference equation. 
If all members of {f,} are zero, then the linear difference equation is called a homogeneous equation; 
otherwise, we call it nonhomogeneous (or inhomogeneous). 


A difference equation usually has infinitely many solutions. In order to pin down a solution (which is a sequence 
of numbers), we have to know one or more of its elements. If we have a difference equation of order p, we need to 
specify p sequential values of the sequence, called the initial conditions. So, for first order recurrences we have to 
specify only one element of the sequence, say the first one yo = a; for the second order difference equations we have 
to know two elements: yo = a and y; = 0; and so forth. It should be noted that the unique solution of a recurrence 
may be specified by imposing restrictions other than initial conditions. 

There are some examples of difference equations: 


Yn4i — y2 =0 (first order, nonlinear, homogeneous, autonomous); 

Ynt1 + Yn —NYn—-1 =O (linear second order, variable coefficients, homogeneous); 
( 
( 


Yn41 —Yn-1 =n? linear second order, constant coefficients, nonhomogeneous); 
Frat = Fat Fa-1 Fibonacci recurrence is a linear second order, constant coefficients, 
homogeneous, autonomous difference equation). 


Much literature has been produced over the last two centuries on the properties of difference equations due to their 
prominence in many areas of applied mathematics, numerical analysis, and computer science. 

Let us consider a couple of examples of difference equations. For instance, the members of the sequence of 
factorials, {O!, 1!,2!,...} can be related via either a simple first order variable coefficients homogeneous equation 
Ynt1 = (N+ 1)yn (n = 0,1,2...), yo = O! = 1, or a second order variable coefficients homogeneous equation 
Yn+1 = N[Yn + Yn—1], subject to the initial conditions yo = 1, yi = 1. 


Example 3.1.1: Consider now a problem of evaluating the integrals 
1 wT 
Tn =f x” e* dx and Ss | x” sina dz, n> 0. 
0 0 


Assuming that a computer algebra system is not available, solving such a problem becomes very tedious. Integrating 
Ini, and S42 by parts, we get 


1 

lag = aor e*| Po - | (n + 1)” e” dz, 

0 
LT I 
Snya = [-a"'* cose + (n+ 2)a"** sing]|—, — (n+ 2)(n+ yf x” sinx dz, 
0 
or 
1 
Ingd = e€-—(n4 Ih, Io =| e’ dx =e-—1; (3.1.1) 
0 


Sanya = wt? —(n+2)(n+1)Sn, SoS 2. Si=a: (3.1.2) 


140 Chapter 3. Numerical Methods 


Equation (3.1.1) is a first order variable coefficient inhomogeneous difference equation. It can be used to generate 
the sequence of integrals [,,, n = 1,2,.... For instance, 


I=e-h=1, h=e-2h=e-2, h=e—-3h=6—-2e, In =9e—24,.... 


Equation (3.1.2) is a second order variable coefficient inhomogeneous difference equation. However, it is actually a 
first order difference equation for even and odd indices because it relays entries with indices n and n+ 2. Similarly, 
we can generate its elements using the given recurrence: 


So=m?-4, S3=n°-6r, Sg=n*—12n7448,.... 


The problems studied in the theory of recurrences not only concern their existence and analytical representation of 
some or all solutions, but also the behavior of these solutions, especially when n tends to infinity, and computational 
stability. 

In this section, we consider only first order recurrences. We start with a constant coefficient case: 


Yn+1 = f (Yn), n=0,1,2,..., (3.1.3) 


where the given function f does not depend on n. If the initial value yo is given, then successive terms in the 
sequence {yo, ¥1, y2,---} can be found from the recurrence (3.1.3): 


yi=f(yo), y=fly)=Ff(flyo)), y3 = flye) =ffF(Yo),---- 


The quantity f(f(yo)) or simply ff(yo) is called the second iterate of the difference equation and is denoted by 
f?(yo). Similarly, the n-th iterate yp, is yn = f"(yo). So such an iterative procedure allows us to determine all values 
of the sequence {Yn}. 

It may happen that the recurrence (3.1.3) has a constant solution (which does not depend on n). In this 
case this constant solution satisfies the equation y, = f(yn) and we call it the equilibrium solution because 
Yn+1 = f (Yn) = Yn- 

The general linear recurrence relation of the first order can be written in the form 


Ynt+1 = PnYn + dn; n=0,1,2,..., (3.1.4) 


where p, and gy are given sequences of numbers. If g, = 0 for all n, the recurrence is said to be homogeneous and 
its solution can easily be found by iteration: 


Y1 = PoYo, Y2=P1Y¥1 =P1iP2Y0, Y3 = P2P1P2Yo, +--+, Yn=Pn-1---P1P2Yyo- 


As no initial condition is specified, we may select yo as we wish, so yo can be considered as an arbitrary constant. 
If the coefficient p, in Eq. (3.1.4) does not depend on n, the constant coefficient homogeneous difference equation 


Ynt1 = PYn, n=0,1,2,..., 


has the general solution y, = p” yo. Then the limiting behavior of y, is easy to determine 


0, if |p| < 1; 
lim Yn = lim p" yo = 4 Yo, if p = 1; 
oe — does not exist, otherwise. 


Now we consider a constant coefficient first order linear nonhomogeneous difference equation 
Ynt1 = PYn + Ans n=0,1,2,.... (3.1.5) 
Iterating in the same manner as before, we get 


Y1 = PYo + Qo, 
y2 = pyr +H = p(pyo +90) +H =P" Yo +P +H, 
y3 = py2 +42 =p yo + p'go + PU + |, 
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and so on. In general, we have 


n-1 


Yn =P" yo t+ > pay. (3.1.6) 
k=0 
In the special case where g, = q # 0 for all n, the difference equation (3.1.5) becomes 
Yn+1 = Pin +4, n=0,1,2,..., 
and from Eq. (3.1.6), we find its solution to be 
Yr =P" yo t+ (Lt ptp? +---+p™")a. (3.1.7) 


The geometric polynomial 1 + p+ p?+---+p"~! can be written in the more compact form: 
g poly 


1=p” : 1: 
l+ptpt+e-tpmtad Te Heels (3.1.8) 
n, ifp=1. 
The limiting behavior of y,, follows from Eq. (3.1.8). If |p| < 1 then y, + q/(1 — p) since p” > 0 as n > oo. If 
p=1 then y,, has no limit as n > oo since yn, = yo + ng > oo. If |p| > 1 or p= —1 then y, has no limit unless the 
right-hand side in Eq. (3.1.7) approaches a constant, that is, 


p'yotdtpt+p2+:--+p™')q constant (p41) asn—>oo. 


Since _ 
= —P q q 
P’yot(Lt+pt+p? +--+ +p" )q =p" yo + q==— +p" (yw -——}, 
l-p l-p l-p 
we conclude that y, = q/(1 — p) is the equilibrium solution when yo = q/(1— p). | 


Solution (3.1.6) can be simplified in some cases. For example, let g, = aq” for a constant a and q 4 p. We seek 
a particular solution of Eq. (3.1.5) in the form y, = Aq”, where A is some constant to be determined. Substitution 
into the equation yn+1 = Pp Yn + aq” yields 


Aq"*1 = pAq” +aq” or Aq=pA+a, 


because Yn41 = Ag"*+. Solving for A, we get 


and the general solution becomes 
nm a nm 


for some constant C. If yo is given, then 


nr 


a nm 
Yn =p” yo + ——(q"—p") (Gp), ae Op meee (3.1.9) 


q—Pp 
Now we consider the difference equation 
Yn4+1 = Pyn + ap”, n=0,1,2,.... 


We are looking for its particular solution in the form y, = Anp” with unknown constant A. Substitution into the 
given recurrence yields 
A(n +1)p"*1 = pnAp” + ap” or Ap=a. 


So the general solution becomes 
Yn =P” yo tanp™, m= 0,1,2, 055. (3.1.10) 
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If gn is a polynomial in n, that is, gn = @mn™ + Qm_—in™ + +--++ ao, then the solution of Eq. (3.1.6) has the form 


Yn =P" Yo + Bmn™ + Bm—-in™ "+--+ fo  (p#1), (3.1.11) 


where coefficients 3; are determined by substituting (3.1.11) into the given difference equation. 
The general linear difference equation of the first order (3.1.4) can be solved explicitly by an iterative procedure: 


mam (wt S424), n=0,1,2,..., (3.1.12) 
M1 72 Tn 

where 7 = 1, ™ = po, 72 = PoPi, «++, Tr = POP1-++Pn—-1- 

Example 3.1.2: Suppose we are given a sequence of numbers {po, p1, .--, Pn,---}. For any first n elements of the 


sequence, we define the polynomial 
P,,(x) = pox” + pie + +... + Dp: 
Now suppose we are given a task to evaluate the polynomial P,,(x) and some of its derivatives at a given point x = t. 


def 


Let yn = yo = P,,(t). We calculate the numbers y,, recursively by the relations 
Yn = tyn-1 + Dn, n= AD sna ey Yo = Po: (3.1.13) 
The given recurrence has the solution (3.1.12), where 7, = t”; so 


Un = yO) = pot” = pit? tere + Pn. 
Let y*) = P(t) be the value of k-th derivative of the polynomial P,,(x) at a given point x = t. For k = 1,2,..., 
we generate the sequence {ys*)} recursively using the difference equation 


gay), yO = ty +y"), 2 =0,1,2,.... (3.1.14) 


Note that the sequence ys) terminates when k exceeds n. 
As an example, we consider the polynomial P(x) = 8x° — 6x4 + 7x? + 32 — 5, and suppose that we need to 
evaluate P5(x) and its derivative P/(x) at « = 0.5. Using algorithm (3.1.13) with t = 0.5, we obtain 


yo = 8; 

yi = tyot+pi = 0.54 —6=4-6=—-2; 

yo = ty +p2=0.5y,+0=-1; 

yg = tygt+p3 =0.5y2+ 7 = 6.5; 

ys = tyg tp. =0.5y3+3 =3.254+3 =6.25; 
Y5 = tya+ps = 0.5y4 —5 = —1.875. 


So P;(0.5) = —1.875. To find P(x), we use the algorithm (3.1.14): 


io = wee 

yy? = ty + =8t+y1 =4-2=2; 
= +m =1-1=0, 

ys? = ty) +.y3 = ys = 6.5; 

yf? = tyf) + ys, = 3.254 6.25 = 9.5. 


Therefore, P3(0.5) = 9.5. 


Example 3.1.3: Compute values for [yg = fo x? e? dx using recurrence (3.1.1) with different “approximations” 
for the initial value Ip = e — 1. Using the Maple” command 
rsolve( f{a(n+1) = e-(nt+1) * a(m), a(0) =e-1 }, fal); 
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we fill out Table 143, where 
. x a 
€n(2) wy + x2 or t 3r t aaa (3.1.15) 


is the incomplete exponential function. This example demonstrates the danger in applying difference equations 
without sufficient prior analysis. 


As we can see in Example 3.1.3, a small change in the initial value for Jp produces a large change in the solution. 
Such problems are said to be ill-conditioned. This ill-conditioning can be inherent in the problem itself or induced 
by the numerical method of solution. 

Consider the general first order difference equation (3.1.4) subject to the initial condition yo = a. Let z, bea 
solution of the same recurrence relation (3.1.4) with the given initial value z9 = yo + ¢. The iterative procedure 
yields 


21 = pozo + Go = po(yo + €) + G0 = Y1 + Poe, 
2= maton =pilyr t+ poe) +H = piyi + 1 + pipoe = yo + P1P0E, 


and in general 


zn = Yn + Pn—1Pn—2 +++ Poe. 


Clearly, after n applications of the recurrence, the original error ¢ will be amplified by a factor pp—ipn—2-.-DPo- 
Hence, if |pz| < 1 for all k, the difference |y, — z,| remains small when ¢ is small and the difference equation is said 
to be absolutely stable; otherwise we call it unstable. If the relative error |y, — Zn|/|yn| remains bounded, then 
the difference equation is said to be relatively stable. 


Table 143: Solution of Eq. (3.1.1) for initial conditions rounded to various decimal places. (a) Exact value expressed 
with 7 decimal places; (b) 6 places; (c) 5 places, and (d) 3 places. 


1.7182818 1.718282 1.7183 1.718 
1.0 1.000000828  1.000081828 = 1.008281828 

.718281828 — .718280172 .718118172 .701718172 

.963436344 563441312 .963927312 .613127312 


265 e — 720 344684420 — .344088260 .285768260 = -5.618231740 


12! [Ip —e te e12(—1)] .114209348 -396.4991155 -39195.62872 —.3967 x 10” 


Example 3.1.4: We reconsider the recurrence (3.1.1), page 139, and let {z,,} be its solution subject to the initial 
condition zo = Ip + ¢, where ¢ is a small number (perturbation). Then z, = I, + (—1)"n!e and the difference 
|Z, — I,| = n!e is unbounded. Therefore, the difference equation (3.1.1) is unstable. 

The general solution of the recurrence relation (3.1.1) is 


In = (-1)" n! [Ip + e(€n(—1) — 1] = (-)"n! b = (1 7 x af x eee =)| ; 


which can be verified by substitution. Here e,, (x) is the incomplete exponential function (3.1.15). Hence, the relative 


error 
ee ee, 
In I, Io + e(€n(—1) — 1) Ion +1-—e 
because e€,(x%) + e” as n — oo. Since Ig = e — 1, we conclude that the recurrence relation (3.1.1) is absolutely 
unstable. 
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Example 3.1.5: Consider the following first order nonlinear difference equation 
Uk+1 — Uk = (b— aug)uR, k=0,1,2,..., uo is given. (3.1.16) 


This is a discrete logistic equation that describes the population of species uz at the k-th year when a, b are positive 
numbers. For instance, the logistic model that fits the population growth in the U.S. for about hundred years until 
1900 is as follows: 

Unga = 1.351 uz, — 1.232 x 10-Puz. 


However, this model cannot be used after 1930, which indicates that human population dynamics is more complicated. 
Actually Eq. (3.1.16) is the discrete Euler approximation of the logistic differential equation (see §2.6, page 96) when 


U ey u(th41) = u(tp) 


d 
the derivative with respect to time is replaced by the finite difference — at point t = ty. It is 


thoi — tk 
more convenient to rewrite the logistic equation (3.1.16) in the following form: 


Yk+1 = ryk(1— ye), R= 0,1 2,444, (3.1.17) 


where r = b+1 and au, = (b+1)yx. Equation (3.1.17) turns out to be a mathematical equation with extraordinary 
complex and interesting properties. Its equilibrium solutions can be found from the equation r(1—y)y = y, so there 
are two such solutions: y = 0 and y = 1-—1/r. 

Because of its nonlinearity, the discrete logistic equation is impossible to solve explicitly in general. Two particular 
cases are known when its solution can be obtained: for r = 2 and r = 4. The behavior of the solution to Eq. (3.1.17) 
depends on the initial condition, yo, that together with the value of the parameter r eventually determines the trend 
of the population. Recurrence (3.1.16) manifests many trademark features in nonlinear dynamics. There are known 
three different types of possible behavior of solutions to the discrete logistic equation: 

Fixed: The population approaches a stable value either from one side or 
asymptotically from both sides. 
Periodic: The population alternates between two or more fixed values. 
Chaotic: The population will eventually visit every neighborhood in a subinterval 
of (0,1). 
The equilibrium solution y, = 0 is stable for |r| < 1, and another constant solution y, = 1—1/r is stable for 
€ (1,3). A stable 2-cycle begins at r = 3 followed by a stable 4-cycle at r = 1+ V6 ~% 3.449489743. The period 
continues doubling over even shorter intervals until around r = 3.5699457 ..., where the chaotic behavior takes over. 
Within the chaotic regime, there are interspersed various windows with periods other than powers of 2, most notably 
a large 3-cycle window beginning at r = 1+ V8 © 3.828427125. When the growth rate exceeds 4, all solutions zoom 
to infinity, which makes this model unrealistic. 


1. For the recurrence relation (3.1.2), 


(a) find exact expressions for Ss, S6,..., S12; 
(b) express the sequence {z,} via {S;,} where z, satisfies (3.1.2) subject to the initial conditions zo = 2+€0, 21 = ™+€1; 
(c) based on part (b), determine the stability of the recurrence (3.1.2). Is it ill-conditioned or not? 
2. Calculate P(x), P’(x), and P’(x) for x = 1.5, where 
(a) P(x) = 42° + 5a — 622 —72?+82+9; (b) P(x) =a° — 2244 3x3 — 4x? + 5a — 6; 
(c) P(x) = 6a° —4a*4 82° — 32? +6r+3; (d) P(x) =2°4 62* — 323 — 827 4 32-1. 
3. Solve the given first order difference equations in terms of the initial value ag. Describe the behavior of the solution as 
n— co. 
(a) Ynt1 = 0.5 yn; (b)  ynt1 = 5 yn; (C) Yat = 4/ SS yn: 
(d)  ynti =(-2)"47? yn; (€)  ynt = 0.89n +20; (fF) yng =—1-5 yn — 1. 
4. Using a computer solver, show that the first order recurrence #n41 = 1+ an — 22 /4 subject to the initial condition 


xo = 7/4 converges to 2. Then show that a similar recurrence rn41 =1+2n x2 /4 - ern —2 /2 under the same initial 
condition diverges. 


5. Using a computer solver, show that the first order recurrence %n41 = %n/2+ 2/an subject to the initial condition 
xo = 7/4 converges to 2. Then show that a similar recurrence &n41 = Un/2+2/an — 4 (1 | 10'? (an - >A) under 
the same initial condition converges to —2. 
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. An investor deposits $250,000 in an account paying interest at a rate of 6% compounded quarterly. She also makes 


additional deposits of $4,000 every quarter. Find the account balance in 5 years. 


. A man takes a second mortgage of $200,000 for 30-years period. What monthly payment is required if the interest rate 


is 4%? 


. Find the effective annual percentage yield of a bank account that pays an interest rate of 2% compounded weekly. 


9. A college student borrows $40,000 for a flashy car. The lender charges an annual interest rate of 5.5%. What monthly 


10. 


11. 


12. 


13. 


14. 


15. 


16. 


17. 


18. 


19. 


payment is required to payoff the loan in 10 years? What is the total amount paid during the term of loan? 


If the interest rate on 15-years mortgage is fixed at 3.375% and $500 is the maximum monthly payment the borrower 
can afford, what is the maximum mortgage loan possible? 


A man wants to purchase a yacht for $200,000, so he wishes to borrow that amount at the interest rate 6.275% for 10 
years. What would be his monthly payment? 


A home-buyer wishes to finance a mortgage of $250,000 with a 15-year term. What is the maximum interest rate the 
buyer can afford if the monthly payment is not to exceed $2,000? 


Due to natural evaporation, the amount of water in a bowl or aquarium decreases with time. This leads to an increase 
of sodium concentration (as ordinary table salt, NaCl) that is present in almost every water supply. The amount of 
salt is unchanged, and eventually its concentration will exceed the survivable level for the fish living in the fishbowl. 
To avoid an increase of sodium concentration, a certain amount of water is periodically removed and replaced with 
more fresh water to compensate for its evaporation and deliberate removal. Let x, be the amount of salt present in the 
aquarium at the moment of the n-th removal of water. Then z,, satisfies the recurrence %n41 = Zn +7 — 6%n. Solve 
this first order difference equation assuming that parameters r and 6 are constants, and the initial amount of salt is 
known. 


Let the elements of a sequence {wn }n>o satisfy the inequalities 
Wn41 < (1+ a)un+ B, n=0,1,2,..., 
where a and B are certain positive constants, and let wo = 0. Prove (by induction) that 


22 b=), n=0,1,2,.... 
a 


Consider the discrete logistic equation (3.1.16). For what positive value of ux, is ux+1 = 0? What positive value of uz, 
provides the maximum value of ux41? What is this maximum value? 
Let g(y) = ry(1 — y) be the right-hand side of the logistic equation (3.1.17). 

(a) Show that g(g(y)) = r?y(1—y)(1—ry+y"). 

(b) By solving the equation y = g(g(y)), show that for any r > 3, there are exactly two initial conditions yo = 


(l+r+ Vr? — 2r —3)/(2r) within the unit interval (0,1) for which the discrete logistic recurrence (3.1.17) has 
nontrivial 2-cycles. 


Determine the first order recurrence and the initial condition for which the given sequence of numbers is its solution. 


1 1 1 il 
(b) an=1454-04=5; (0) ie La 


~ Any)? n n 


(a) an 


Starting at xo = 0.5, how many iterations are needed to find the root of the quadratic equation x? + « — 2 =0 with 7 
exact decimal places using each of the following recurrences. 


2 
(a) tny1 = 2-23; (b) en = 2-1; 
2+ a2 i a +an—2 
n4+1 = > d n = In — 3 
SS To (4) Snt1 = On — Fe tek + Bak + Oe, +2 
2 ts ve +403 —6an+1 
at1 = £2 — > 


(2% +1)? 


Consider Tower of Hanoi recurrence 


hn =2hn-1t1 (n=1,2,...), ho =0. 


For this sequence of (integer) numbers {hn }n>0 we can assign an infinite series H(z) = )7,,) hnz”, called the generating 
function. Find the generating function for the sequence of the Tower of Hanoi. Note: the Tower of Hanoi puzzle was 
introduced by the French mathematician Edouard Lucas (1842-1891) in 1889. 
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3.2. Euler’s Methods 


In this section, we will discuss the numerical algorithms to approximate the solution of the initial value problem 


ae = f(a, y); y(xo) = Yo (3.2.1) 


assuming that the problem has a unique solution, y = ¢(x) (see §1.6 for details), on some interval |a, b| including 
Xo, where usually the left endpoint a coincides with zo. 

A numerical method frequently begins by imposing a partition of the form a = 1% < 41 < @ << ++: < @y_1 < 
xn = 6 of the z-interval [a,b]. For simplicity, these points, called the mesh points, are assumed to be uniformly 
distributed: 

ty =Xto th, r2 =X + 2h,...,%, =XotNh=Ay,i+h, n=0,1,...,N, (3.2.2) 


where h = — . is the step size. Note that in practice, the uniform grid is used very rerely. The number N of 


subintervals is related to the step size by the identity b-— a = Nh. Therefore, the uniform partition of the interval 
[a, 6] is uniquely identified by specifying either the step size h or the number of mesh points N. The value h is called 
the discretization parameter. At each mesh point x,, the numerical algorithm generates an approximation y,, to the 
actual solution y = ¢() of the initial value problem (3.2.1) at that point, so we expect yp, © 6(@n,) (n = 1,2,...,N). 
Note that the initial condition provides us an exact starting point (20, yo). 

A preeminent mathematician named Leonhard Euler was the first to numerically solve an initial value problem. 
Leonhard Euler was born on April 15, 1707 in Basel, Switzerland and died September 18, 1783 in St. Petersburg, 
Russia. He left Basel when he was 20 years old and never returned. Leonhard Euler was one of the greatest 
mathematicians of all time. After his death, the St. Petersburg Academy of Science (Russia) continued to publish 
Euler’s unpublished work for nearly 50 more years and has yet to publish all his works. His name is pronounced 
“oiler,” not “youler.” 

In 1768, he published (St. Petersburg) what is now called the tangent line method, or more often, the Euler 
method. This is a variant of a one-step method (or single-step method) that computes the solution on a step- 
by-step basis iteratively. That is why a one-step method is usually called a memory-free method: it performs 
computation of the solution’s next value based only on the previous step and does not retain the information in 
future approximations. In the one-step method, we start from the given yo = y(xo) and advance the solution from 
Zo to x; using yo as the initial value. Since the true value of the solution at the point = x, is unknown, we 
approximate it by y, according to a special rule. Next, to advance from x; to x2, we discard yo and employ y; as 
the new initial value. This allows us to find yz, approximate value of the solution at x = x2, using only information 
at the previous point « = x;. And so on, we proceed stepwise, computing approximate values {y,,} of the solution 
y = $(x) at the mesh points {rn }n>o.- 

In the initial value problem (3.2.1), the slope of the solution is known at every point, but the values of the 
solution are not. Any one-step method is based on a specific rule or algorithm that approximates the solution at the 
right end of a mesh interval using slope values from the interval. From the geometric point of view, it defines the 
slope of advance from (a, Yn) to (@r41, Yn+1) Over the mesh interval [2,,%n4+41]. Its derivation becomes more clear 
when we replace the given initial value problem by its integral counterpart. If we integrate both sides of Eq. (3.2.1) 
with respect to x, we reduce the initial value problem to the integral equation 


ya) =o + | F(s,u(s))as. (3.2.3) 
xo 
After splitting the integral by the mesh points z,, n= 1,2,..., we obtain 
Ly 
y(#1) = yor f(s, y(s)) ds, 
ra 
y(x2) = yot | f(s,y(s))ds 


xo 


Syed Seyayaes / eco / ” Sealy 
joe ae Hu ayas 
= wtf sive)as+ [ fisuls)as=m+ [ fo.u(s))as, 


xo 
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Figure 3.1: Two Euler semi-linear approximations (in black) calculated for two step sizes h = 1 and h = 1/2 along 
with the exact solution (in blue) to the linear differential equation y = 2 cos(t) + y(t) — 1 subject to y(0) = 0, plotted 
with Mazima. 


and so on. In general, we have 
Tn+1 


lors) =ylan) +f Fls,uls))as. (3.2.4) 
Ln 

This equation cannot be used computationally because the actual function y(s) is unknown on the partition interval 

[2n,2n+41] and therefore the slope function f(s, y(s)) is also unknown. Suppose, however, that the step length h is 

small enough so that the slope function is nearly constant over the interval x, < s < %p+ 1. In this case, the crude 

approximation (rectangular rule) of the integral in the right-hand side of Eq. (3.2.4) gives us gee f(s, y(s)) ds & 

h f (an, y(@n)). Using this approximation, we obtain the Euler rule: 


Yn+1 = Yn + (fn41 — En) f (Ln, Yn) or Yn41 = Yn +hfn; (3.2.5) 


where the following notations are used: h = &n41 —2n, fn = f(@n, Yn), and yn denotes the approximate value of 
the actual solution y = ¢(«) at the point x, (n = 1,2,...). A new value of y at = 2,41 is predicted using the 
slope at the left end x,, to extrapolate linearly over a mesh interval of size h. This means that the Euler formula is 
asymmetrical because it uses derivative information, y’ = f(@n, Yn), only at the beginning of the interval [#p, %n+1]. 
Note that the step size h can be either positive or negative, giving approximations to the right of the initial value 
or to the left, respectively. 

Equation (3.2.5) is the difference equation or recurrence of the first order associated with the Euler method. 
Solving recurrences numerically may lead to instability and further serious analysis is usually required [13]. For 
some simple slope functions, the recurrence (3.2.5) can be solved explicitly. Sometimes it is beneficial to transfer 
the difference equation to another form that is more computationally friendly. To compute numerically the integral 
on the right-hand side of Eq. (3.2.3), we apply the left-point Riemann sum approximation to obtain the so-called 
quadrature form of the Euler algorithm: 


trei=yoth> fleiy),  n=0,1,.... (3.2.6) 
j=0 


Let us estimate the computational cost of Euler’s algorithms (3.2.5) and (3.2.6). The dominant contribution of 
these techniques is nf, the number of arithmetic operations required for evaluating f(x,y), which depends on the 
complexity of the slope function. At each step, the difference equation (3.2.5) uses 1 addition (A), 1 multiplication 
(M) by h, and ny operations to evaluate the slope. Therefore, to calculate y,, we need n(A + M+ ny) arithmetic 
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operations, where A stands for addition and M for multiplication. On the other hand, the full-history recurrence 
(3.2.6) requires much more effort: M + $n(n — 1)A + 4n(n—1)ny arithmetic operations. However, with clever 
coding the quadratic term n(n — 1)/2 can be reduced to the linear one: 


Skt41 = 8k + f (£41, Yo + Asx), so = f (Xo, Yo) (k =0,1,...), 


where s, = ar f (ai, ys) is the partial sum in the quadrature form (3.2.6). Evaluation of s,41 requires 2 additions, 
1 multiplication by h, and 1 value of the slope function, so the total cost to evaluate the partial sum for y, is 
(n—1)(2QA+1M+ny)+ny. Using this approach, the full-history recurrence (3.2.6) can be solved using (2n— 1)A+ 
n(M + ny) arithmetic operations, which requires only (n — 1) more additions compared to the Euler rule (3.2.5). 

The Euler algorithm, either (3.2.5) or (3.2.6), generates a sequence of points (xo, yo), (41, yi), -.- on the plane 
that, when connected, produces a polygonal curve consisting of line segments. When the step size is small, the naked 
eye cannot distinguish the individual line segments constituting the polygonal curve. Then the resulting polygonal 
curve looks like a smooth graph representing the solution of the differential equation. Indeed, this is how solution 
curves are plotted by computers. 


Example 3.2.1: Let us start with the linear differential equation 
y =2 cos(t) + y(t) — 1, 


having the oscillating slope function. First, we find its general solution by typing in Maple 

dsolve(D(y) (t) = 2*cos(t)+y(t)-1,y(t)) 

This yields y = ¢(t) = 1 +sint — cost + Ce‘, where C = y(0) is an arbitrary constant. We consider three initial 
conditions: y(0) = 0.01, y(0) = 0, and y(0) = —0.01, and let y = di (t), y = do(t), and y = ¢_1(t) be their 
actual solutions, respectively. The function ¢9(t) = 1+ sint — cost separates solutions (with y(0) > 0) that grow 
unboundedly from those (with y(0) < 0) that decrease unboundedly. Maple can help to find and plot the solutions: 


dsolve({D(y) (t) = 2*cos(t)-y(t)+1,y(0)=0.01},y(t)) 
u := unapply(rhs(%), t); plot(u(t), t =0.. 1) 
Now we can compare true values with approximate values obtained with the full-history recurrence (3.2.6): 


Yn41 =0.01— h(n +1) + 2h cos(jh)+hSoy;, yo = 0.01, 
j=0 j=0 


where y,, is an approximation of the actual solution y = ¢(t) at the mesh point t = t, = nh with fixed step size h. 
For instance, the exact solution has the value ¢(1) = 1.328351497 at the point t = 1, whereas the quadrature formula 
approximates this value as yi9  1.2868652685 when h = 0.1 and yio9 & 1.3240289197 when h = 0.01. The Euler 
algorithm (3.2.5) provides almost the same answer, which differs from the quadrature output in the ninth decimal 
place (due to round-off error): — Yn41 = Yn + h(2costy + Yn — 1). 

The following MATLAB code helps to find the quadrature approximations: 


yO=0.01; tN=2*pi; h=.01; % ‘final point is tN 
N=round(tN/h) ; % rounded number of steps 
y (N)=0; % memory allocation 
n=0; si=cos(n); % sum cos(jh) for n=0 
s2=y0; % sum of y(j) for j=0 
y (1) =yO-h* (n+1)+2*h*sith*s2; 
for n=1:N-1 
si=sit+cos(n*h); % sum of cos(jh), j=0,1,...,n 
s2=s2ty(n); % sum of y(j), j=0,1,...,n 
y (nt+1)=y0O-h* (n+1)+2*h*s1th*s2; 
end 


plot((0:1:N)*h, [yO y]); 


Example 3.2.2: Let us apply the Euler method to solve the initial value problem 


2y’'+y=32, (0) =0. 
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The given differential equation is linear, so it has the unique solution y = ¢(x) = 32 — 6 + 6e~*/?. 


equation (3.2.4) for f(a, y) = (3a — y)/2 becomes 


yon) = yen) +5 f Bs—yls))As, 


The integral 


2 


n 


and the Euler algorithm can be written as 
3 h 
Ynt1 = g itn + 1-5 Urs yo=0, %n=nh, n=0,1,2,.... (3.2.7) 
According to Eq. (3.1.6), this linear first order constant coefficient difference equation has the unique solution 
h n 
m=6 (1-5) — 64+ 3hn, n=0,1,2,.... (3.2.8) 


The natural question to address is how good is its approximation, namely, how close is it to the exact solution 
(a)? A couple of the first Euler approximations are not hard to obtain: 


3 9 3 3 
= — ip? — i p7p?—=p73 = 9h? — h - hi. 
w=0, wash, =a qh y= 9 Bhi + oh 
In general, 
1 
Yn = + n(n 1)h? 3 n(n —1)(n— 2) hP + 
Using the Maclaurin series for the exponential function, e~’ = 1 —t+ f - f +:++, we get 
1 tye 3 1 1 
an —a/2 _ — (-5) Car eee ae ee 2 
g(x) = 3x — 6 + Ge Dar 5 a2 — Rt + ee ; (3.2.9) 
So, for « = 2, = nh (n = 2,3,4,...), we have 
3 n3 nA 
h)=— 2p? ete h? je h* 
es aris 8 64 


Therefore, we see that the error of such an approximation, $(2,) — yn, is an alternating series in h starting with h?. 
So we expect this error to be small when h is small enough. 

Our next question is: can we choose fh arbitrarily? Unfortunately, the answer is no. Let us look at the general 
solution (3.2.8) of the Euler algorithm that contains the power (1—h/2)”. If 1—h/2 < —1, that is 4 < h, the power 
function x” for x = 1 — h/2 < —1 oscillates and the approximate solution diverges. 

Because Euler’s method is so simple, it is possible to apply it “by hand.” However, it is better to use a software 
package and transfer this job to a computer. Comparisons of the exact solution with its approximations obtained 
by the Euler method are given in Figure 3.2, plotted using the following Mathematica® script: 


flx_, y_] = (3*x - y)/2 

philx_] := 3*x - 6 + 6*Exp[-x/2]; 

x[0] = 0; y[0] = 0; h = 0.1; n = 10; 

Do[x[k + 1] = x[k] + h; y[k + 1] = ylk] + f[x[k], y[k]]*h; , fk, 0, n}] 
data = Table[{x[k], y[k]}, {k, 0, n}] 

Show[Plot[philx], {x, 0, 1}], ListPlot[data, PlotMarkers -> Automatic]] 


Similar output can be obtained with the following Maple commands: 


h:=0.1: n:=10: xx:=0.0: yy:=0.0: # initiation 
points:=array(0..n) points[0] :=[xx,yy] 
for i from 1 to n do 

yy :=evalf (xxth*f(xx,yy)) xx:=xxth 

points [i] :=[xx,yy] 
od 
plotpoints := [seq(points[i], i=0.. n)] 
ploti := plot(plotpoints, style = point, symbol = circle) 
plot2 := plot (3*x-6+6*exp(-.5*x), x = 0 .. 1) 
plots[display] (ploti, plot2, title = ‘Your Name‘) 
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—a/2 


Figure 3.2: Example 3.2.2, a comparison of the exact solution y = 3x — 6 + 6e with the results of numerical 


approximations using the Euler method with step size h = 0.1. 


Sometimes the Euler formulas can be improved by partial integration over those terms that do not contain 
an unknown solution on the right-hand side of Eq. (3.2.3). For instance, in our example, the term 3s/2 can be 
integrated, leading to the integral equation 


3 1 feet 
yleenas) = yl) +5 (aR, 02) —5 ful) as 


n 


Application of the right rectangular rule to the integral yields 


3 h 
Unt = 3 h? (2n+1)+ (1 - 5) Uns (3.2.10) 
which has the unique solution 
h h\” h 
m=3 (2-3) (1-5) +3hn-3 (2-3), TO) V2) ees (3.2.11) 


Comparing the value yi9 = 0.59242 obtained by the algorithm (3.2.7) and the value yi9 = 0.652611 obtained by the 
algorithm (3.2.10) for h = 0.1 and n = 10, we see that the latter is closer to the true value (1.0) = 0.63918. 


The previous example shows that there exists a critical step size beyond which numerical instabilities become 
manifest. Such partial stability is typical for forward Euler technique (3.2.5). To overcome stringent conditions on 
the step size, we present a simple implicit backward Euler formula: 


Ynt1 = Yn + (@r4 _ ta) i Gass te) = Yn +hfn4i- (3.2.12) 


Since the quantity fr41 = f(@n+41, Yn+1) contains the unknown value of y,+1, one should solve the (usually nonlinear 
with respect to Yn+1) algebraic equation (3.2.12). If f(a#,y) is a linear or quadratic function with respect to y, 
Eq. (3.2.12) can be easily solved explicitly. (Cubic and fourth order algebraic equations also can be solved explicitly 
but at the expense of complicated formulas.) Otherwise, one must apply a numerical method to find its solution, 
making this method more computationally demanding. Another drawback of the backward Euler algorithm consists 
in the nonuniqueness of y,+1 in the equation Yn41i = Yn +h f(a@n41, Yn41): it may have multiple solutions and the 
numerical solver should be advised which root it should choose. 


Example 3.2.3: Consider the initial value problem for the nonlinear equation 


1 
y= = (e —y+2)71, y(0) = 1. (3.2.13) 
Z-yt2 
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Application of algorithm (3.2.12) yields 


h 
Yn+1 = Yn + =. oo a —=> Uni — Yn+1 (2+ an41 + Yn) + Yn(2+ an) +h =0. 
Tn+1— Yn+1 + 2 


This is a quadratic equation in yn+1, and we instruct Maple to choose one root: 


f:=(x,y)-> (x-y+2)7(-1); xf0]:= 0: yl0]:= 1: h:= 0.1: 
for k from 0 to 9 do 
x[k+1]:= x[k]+h: bb := solve(b=y[k]+h*f(x[k+1],b),b); 
y(k+1]:= evalf(min(bb)) ; 
end do; 
a := [seq(xfi], i=0 .. 10)]; b := [seq(yli], i=0 .. 10)]; 
pair:= (x, y) -> [x, y]; dplot := zip(pair, a, b); 
plot(dplot, color = blue) 


In the code above, the command solve is applied because the quadratic equation in y [k+1] has an explicit solution. 
In general, the fsolve command should be invoked instead to find the root numerically. Similarly, Maxima can be 
used for calculations: 


load(newton1)$ 
f(x,y) := 1/(x-y+2); xf[0]:0$ y[0]:1$ h:0.1$ 
for k:0 thru 9 do ( 
x[k+1]: x[k] +h, 
y(k+1]: newton(-bty [k] th*f (x[k+1],b), b, y[k], 1e-6) ); 


Fortunately, the given IVP has the unique solution (consult §1.6) y = «+1. However, when the initial condition 
is different from 1, say y(0) = 0, the corresponding actual solution can be expressed only in implicit form: y = 
Inja—y+]|. 


Table 151: A comparison of the results for the numerical solution of y/ = (a — y+ 2)~+, y(0) = 0, using the 
backward Euler rule (3.2.12) for different step sizes h. 


x Exact h=0.1 h = 0.05 h = 0.025 h=0.01 

1.0 0.442854401 0.436832756 0.439848517 0.441352740 0.442254051 
2.0 0.792059969 0.781058081 0.786559812 0.789310164 0.790960127 
3.0  1.073728938  1.058874274  1.066296069 1.070011211  1.072241553 


There is too much inertia in Euler’s method: it keeps the same slope over the whole interval of length h. A better 
formula can be obtained if we use a more accurate approximation of the integral in the right-hand side of Eq. (3.2.4) 
than the rectangular rule as in Eq. (3.2.5). If we replace the integrand by the average of its values at the two end 
points, we come to the trapezoid rule: 


h 
Yntl = Yn + 5 lf len, Yn) oe f(tn41, Yn+1)]- (3.2.14) 


This equation defines implicitly y,41 and it must be solved to determine the value of y,+1. The difficulty of this 
task depends entirely on the nature of the slope function f(z, y). 


Example 3.2.4: (Example 3.2.2 revisited) | When the differential equation is linear, the backward Euler 
algorithm and the trapezoid rule can be successfully implemented. For instance, application of the backward Euler 
method to the initial value problem from Example 3.2.2 yields 


Yn Shanyi/2 Yn 3h?2(n + 1) 
eS TO. Lae A Ce 


h 
Yn+1 = Yn + 3 (3tn41 —Yn41) => 


since 2, = nh (n = 0,1,2,...). At each step, the backward algorithm (3.2.12) requires 10 arithmetic operations, 
whereas the forward formula (3.2.7) needs only 6 arithmetic operations. Solving the recurrence, we obtain 
nh? 


2 n 
Yn =6 (=) + 3hn — 6 = n(n —1)k? — 2 (n? + 3n +2) +0 
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Figure 3.3: The backward Euler approximations (in black) along with the exact solution (in blue) to the linear 
differential equation y = 2 cos(t) + y(t) — 1, plotted with Maxima. 


Comparison with the exact solution (3.2.9) shows that the error using backward Euler formula of this approximation, 
¢(2n) — Yn; is again proportional to h?. 
Now we consider the trapezoid rule, and implement the corresponding Mathematica script: 


x= 0; y=0; X=1; h=0.1; n = (X - x)/h; 

Forfi=1, i<n+i41, it+, x=x+t+h; 

y = Y /. FindRoot [Y == y + (h/4)*(3*(2*x - h) + y + Y), {Y, yh] 
Printly] (* to see outcome of computations *) 


Table 152: A comparison of the results for the numerical solution of 2y’ + y = 3a, y(0) = 0 using the implicit 
trapezoid procedure (3.2.14) for different step sizes h. 


x Exact h=0.1 h = 0.05 h=0.025 h=0.01 

0.1 0.00737654 0.0073170 0.07361682 0.00737283 0.00737595 
0.5 0.17280469 0.1725612 0.17274384 0.17278948 0.17280226 
1.0 0.63918395 0.6388047 0.63908918 0.63916026 0.63918016 


The algorithm (3.2.14) leads to the following (implicit) recurrence: 


h h 
Yn+1 = Yn + 4 (32 = Yn) = a (32n41 = Yn+1); n=0,1,2,.... 


Solving for yn+i1, we get 


— (L=h/4)yn + 8h(an +2n41)/4  4—h fs 3h?(2n + 1) 
Hie 1+h/4 ee 4+h ” 


which requires 12 arithmetic operations. If we calculate y,, for the first few values of n, 


ahr (4—h)3h2 Oh? (4—h)?3h? (4—h)9h?_—— 15h? 
y=, +, op = oe t+ t+ 
(4+h)2 4+h’ (4+ h)3 (4+ h)? A+h’ 
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then an unmistakable pattern emerges: 


A—h\" 3 nh3 
n= —— hn —6 = — (nh)? — — (2n? = 1) bese. 
y 6 (=) + 3hn —6 q (nh) ig 2" +8 )+ 
Hence, the difference ¢(nh) — yn between the true value and its approximation is proportional to h? because ¢(nh) = 


3 (nh)? — MAS 4... 


There are several reasons that Euler’s approximation is not recommended for practical applications, among them, 
(i) the method is not very accurate when compared to others, and (ii) it may be highly erratic (see Example 3.2.2 
on page 148) when a step size is not small enough. 

Since, in practice, the Euler rule requires a very small step to achieve sufficiently accurate results, we present 
another explicit method that allows us to find an approximate solution with better accuracy. It is one of a class 
of numerical techniques known as predictor-corrector methods. First, we approximate y,+1 using Euler’s rule 
(3.2.5): yhiy = Yn +h f (tn, yn). It is our intermediate prediction, which we distinguish with a superscript *. Then 
we correct it by substituting y7,, instead of the unknown value of y,,+41 into the trapezoid formula (3.2.14) to obtain 


Ynt1 = Yn + “(Fen Yn) + f(@n41; Yn +hF (Tn, Yn))], m2 =0,1,2,.... (3.2.15) 
This formula is known as the improved Euler formula or the average slope method. Equation (3.2.15), which is 
commonly referred to as the Heun** formula, gives an explicit formula for computing y,+1 in terms of the data at 
Lp and &,41. Figure 3.4 gives a geometrical explanation of how the algorithm (3.2.15) works on each mesh interval 
[%,%n+41]. First, it finds the Euler point (#41, y%41) as the intersection of the vertical line x = #41 and the 
tangent line with slope ky = f(@n, Yn) to the solution at the point « = x,. Next, the algorithm evaluates the slope 
ko = f (Pyicus 4a) at the Euler point. At the end, the improved Euler formula finds the average of two slopes 
k= 4 (ky + kg) and draws the straight line with the slope k to the intersection with the vertical line « = 2,41. This 
is the Heun point that the algorithm chooses as y,+1, the approximation of the solution at x = %pn41 (see Fig. 3.4). 


Example 3.2.5: Let us consider the initial value problem (3.2.1) with the linear slope function f(x,y) = 3 (3a +y) 
and the initial condition y(0) = 0. Its analytic solution is 


1 1 
=r +—az+--- 


t — 1 3 
y = oe) = 6c"? 32-6 = 6) = () =a? + = a 


k=2 
In our case, the improved Euler formula (3.2.15) becomes 


he he 3h ah? hh? Bh? ahf 
Yn+1 = Yn (14545) + Fem +n ann (+545) 4 Satan eS 


because &p, = nh, In41 = (N+ Lh, Yn + hfn = Yn + A(3 an + 5 Yn) = Yn(1 + B) + oe f(@n415Yn + hfn) = 
2 tnd + 5 Yn(1 + 4) + 3h Zn. To advance on each mesh interval, the average slope method requires 11 arithmetic 
operations while the Euler rule (3.2.7) uses about half as many arithmetic operations—just 6. 


To facilitate understanding, let us write the two first steps of the improved Euler algorithm explicitly. Using a 
uniform partition with a step size h, we start with zo = 0, yo = 0. Then 


a=h, yw =yot* f(to, yo) +4 f(xi, yo + hf (xo, yo) = 2 ?; 


2 
t2=2h yo =yit A(ay +22) + 4y4+ 4 Ba1+y1) 


100 


2 2 
=y + 3h(3hr)+4y4+53ht+ hy =3n? +3n8 4+ 3 a4. 
Since the slope function is linear, the corresponding Heun recurrence can be solved explicitly: 


D e\ 


34Karl Heun (1859-1929) was a German mathematician best known for the Heun differential equation that generalizes the hypergeo- 
metric differential equation. Approximation (3.2.15) was derived by C. Runge. 
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Heun point (an+41, yn41), correction 


— Slope 4 (k + ke) 


<—— Slope ko = f(@n41, 9741) 


Euler point (%n41,y7,41), prediction 


Slope ki = f (an, Yn) 


En+1 


Figure 3.4: The improved Euler approximation for the one step to the solution of the linear differential equation 
y = 2 cos(t) + y(t) — 1 subject to the initial condition y(0) = yo > 0. 


Using the series command from Maple, we get y, as a function of h: 


3 
Yn =F nh? =F 


Therefore, the difference (nh) — yn is the series in h starting with h°. 


2 = 2 = 
n(n* — 1) 1. n(n — 1)(n* +n — 3) 
4 8 64 


he pee, n=0,1,2,.... 


When a slope function is nonlinear in y, it is generally impossible to find an exact formula for y,. In this case 


one can use a software solver. With Mathematica, we have 
For{i=1, i<n+1, i=i+ 1; k1 = Fit,y]; 

k2 = F[tth,y+h*k1]; y = yth(k1+k2)/2; t = tth]; 
Print Ly] 


MATLAB does a similar job: 


function y=heun(h,t0,y0,T) ,t=t0; y=y0; n=round(T-t0)/h; 


for j=i:n 

ki = f(t,y); k2 = f£(tth,y+th*k1); 

y = yth*(ki+k2)/2; t =t0+j+*h; disp(I[t,y]) 
end 

Similarly with Maple: 

impeu:=proc(f,a,b,A,N) 

local n, xn, yn, h, ki, k2, w; 


h:=evalf((b-a)/N); xn:=evalf(a); yn:=evalf(A); w[0]:=[xn,yn]; 


for n from 1 to N do 
ki:=evalf(f(xn,yn)); k2:=evalf(f(xnth, ynth*k1)) ; 


yn:=evalf (ynth*(k1+k2)/2); xn:=evalf(xnth); w[n]:=[xn,yn]; end do; 


vv:=impeu(f,0,1,0,N); 
for n from 0 to N do 
print (x| |n=vv[n] [1] ,yel In=vv[n] [2]); 0d; 
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Table 154: A comparison of the results for the numerical solution of the initial value problem y = 27 y cos(2at), 
y(0) = 1, using the improved Euler method (Heun formula) and the forward Euler rule for different step sizes h. 


The actual solution is y = $(t) = e™C™), 
Heun: 
t Exact h=0.1 h = 0.05 h = 0.025 h=0.01 
0.5 1.0 0.9803425392  0.9982313528 0.9998019589 0.9999877448 
1.0 1.0 0.9610714964  0.9964658340 0.9996039591 0.9999754899 
1.5 1.0 0.9421792703  0.9947034358 0.9994059972 0.9999632284 
3.0 1.0 0.8877017706 0.9894349280 0.9988123511 0.9999264662 
Euler: 
0.5 1.0 1.162054041 1.071163692 1.034443454 1.013584941 
1.0 1.0 0.3082365926 0.5988071290 0.7795371389  0.9058853939 
1.5 1.0 0.3581875773  0.6414204548 0.8063870900 0.9181917922 
3.0 1.0 0.02928549567 0.2147142589 0.4737076882 0.7433952325 


Example 3.2.6: The actual solution of the separable (linear) differential equation y = 27 y cos(27t) subject to the 
initial condition y(0) = 1 is the periodic function y = ¢(t) = e#"@7), Table 154 shows that the improved Euler 
method (3.2.15) gives good approximations even for relatively large step sizes. On the other hand, the forward Euler 
method (3.2.5) has considerable difficulty keeping up with the actual solution, and requires a very small step size to 
give a reasonable approximation. 


Another modification of Euler’s method is known. Using the midpoint rule for the evaluation of the integral on 
the right-hand side of Eq. (3.2.4), we obtain the so-called modified Euler formula, or explicit midpoint rule, or 
midpoint Euler algorithm: 


h h 
Ynt1 = Un thf (m+5 ; m+ 5 Semin) n=0,1,2,.... (3.2.16) 


2 


This formula reevaluates the slope halfway through the line segment by taking two trial slopes, ki(a,y) = f (2, y) 
and ko(a,y;h) = f (a+h/2,y + hk /2), and then using the latter as the final slope. Therefore, the midpoint rule 
is another example of the predictor-corrector method: it extrapolates the value of y at the midpoint: yn 4ij2 = 
Yn thf (an, Yn)/2. Then this predicted value is used to calculate a slope at the midpoint. 


Example 3.2.7: (Example 3.2.5 revisited) The modified Euler algorithm for f(x,y) = 5 (3a + y) becomes 
hl 


at, eae 
Xn ae no 
D 5/28" 59 


Unt > (32 ar Yn) 
1h ky 3h 3h? | 3h? 

n a re Ta win As oa) n re 
¥ 2' 8 2 x 


Yn+1 = 


Table 155: A comparison of the results for the numerical solution of the IVP y’ = 2 cos(t)+y(t)—1, y(0) =0 using 
the Euler method, the trapezoid method, Heun’s formula, and midpoint Euler algorithm for step size h = 0.05. 


t 

1.0 
2.0 
2.5 
3.0 


Exact 

1.301168679 
2.325444264 
2.399615760 
2.131112505 


Euler 

1.280605284 
2.303749072 
2.386084742 
2.129626833 


Trapezoid 

1.3007147307 
2.3238172807 
2.3969091047 
2.1267185085 


Heun 

1.300202716 
2.323267203 
2.396552146 
2.126646709 


Midpoint 

1.301122644 
2.325921001 
2.400687151 
2.133104265 


Comparison with the previous example reveals a remarkable result: it is the same recurrence as with the improved 
Euler formula. Indeed, these two formulas (3.2.15) and (3.2.16) coincide for a linear slope function f(x, y). However, 
if the function f(z, y) is not linear, then these formulas lead to different results. For example, let f(x,y) = 4x”? + 2y. 
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Figure 3.5: Modified Euler approximations (in black) along with the exact solution (in blue) to the linear differential 
equation y = 2 cos(t) + y(t) — 1, plotted with Maxima. 


Then we have the following approximations (with 11 and 13 arithmetic operations at each step, respectively): 


Yada Yn(1 + 2h + 2h?) + 2h(a2 +272,,)+4h?a22, improved Euler, 


l 


a2 
Ynt1 Yn(l + 2h + 2h?) + 4h (x + 5) + 4h 27, modified Euler. 


Table 155 gives comparisons of numerical calculations based on the methods discussed in this section. 


In all problems, use a uniform grid of points: 7, = 20 + kh, k =0,1,2,...,n, where h = (an — Xo)/n is a fixed step size. 
A numerical method works with a discrete set of mesh points {xx} and the sequence of values yo, y1, ---,; Yn, such that each 
Yk is approximately equal to the true value of the actual solution ¢(2) at that point x,, k =0,1,...,n. Throughout, primes 


denote derivatives with respect to x, and a dot stands for the derivative with respect to t. 


1. A projectile of mass m = 0.2kg shot vertically upward with the initial velocity v(0) = 8m/sec (see [14] for details) 


is slowed due to the gravity force F, = mg (g = 9.81m/sec”) and due to the air resistance force F. = —kv|v|, where 
k = 0.002 kg/m. Use Euler’s methods (3.2.5), (3.2.15), and (3.2.16) with h = 0.1 to approximate the velocity after 2.0 
seconds. 


2. Given y’ = y, y(0) = 1, find y(1) numerically using all five numerical methods presented in the section by taking h = 0.1 
and compare the results with its exact value. How many correct decimal places did you get? How can you use your 
results to compute e? 


3. Solve the initial value problem (1 + a”) y’ = 1, y(0) = 0, using the Euler method (3.2.5) to find y(1) with step sizes 
h =0.1, 0.05, 0.01. How can you use your results to compute 7? 


4. As we have learned from §2.1.1, solutions to autonomous differential equations may blow up at a point. Consider the 
initial value problem y = y?, y(0) = 1, and pretend that you don’t know its solution, y = ¢(t) = (1 —t)~*. Use Euler’s 
numerical method to attempt estimating the value of the solution at t = 1. How are you going to verify that the actual 
solution does not exist on the closed interval {0, 1]? 


5. Consider the initial value problem 3ry’ + y = 0, y(—1) = 1. Show that it has the actual solution y = ¢(x) = —27/3, 
which exists for « < 0, having discontinuity at « = 0. Apply Euler’s algorithm (3.2.5) with step size h = 0.15 to 
approximate this solution on the interval [—1,0.5]. Show that the numerical method “jumps across discontinuity” to 
another solution for r > 0. Repeat calculations for h = 0.03, but keeping results at the previous mesh points. Would 
you now suspect a discontinuity at x = 0? 


6. Solve the following differential equation that describes the amount x(t) of potassium hydroxide (KOH) after time t: 


ees) Geet) (as), 
ad \ 2 9 ame a 


3.2. 


10. 


11. 


12. 


13. 
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where k is the velocity constant of the reaction. If k = 6.22 x 10719, ny = ng = 2 x 10°, and n3 = 3 x 10°, how 
many units of KOH will have been formed after 0.3sec? Use the modified Euler method (3.2.16) with uniform step 
size h = 0.01 to calculate the approximate value, and then compare it with the true value. 


. Consider the following autonomous differential equations subject to the initial condition y(0) = 1 at the specified point 


e— 0. 

(a) y=e™%; (d) y= ¥(1-y/4); (g) y =1/(2y + 3y”); 
(b) y=4(14+y?); (e) y=(y +1)/y; (h) y= (y—5)’; 

(c) y= 7y(1—y/100); (f) y=y? —y/10; (Gi) y=”. 


Use the Euler rule (3.2.5), improved Euler formula (3.2.15), and modified Euler method (3.2.16) to obtain an approxi- 
mation at x = 1. First use h = 0.1 and then use h = 0.05. Then compare numerical values with the true value. 


. Answer the following questions for each linear differential equation (a) — (f) subject to the initial condition y(0) = 1 


at the specified point x = 0. 


(a) yp =y-2; (b) xy =2+2-y; (c) 


y’ =3— 9x + 3y; 
(d) y’+y=2e; (e) (@?+1)y'+2cy=6r; (f) («+1 


y ty=2". 
(i) Find the actual solution y = ¢(2). 
ii) Use Euler’s method yxii = yr +hf,x to obtain the approximation y, at the mesh point xx. 
Yr+ ¥y y 
iii) Use the backward Euler method (3.2.12) to obtain the approximation y, at the mesh point zz. 
pp ¥ Pp 
iv) Use the trapezoid rule (3.2.14) to obtain the approximation yz, at the mesh point zz. 
Pp pp ¥ Pp 
(v) Use the improved (or modified) Euler formula (3.2.15) to obtain the approximation y; at the mesh point xp. 


(vi) For h = 0.1, compare numerical values ys found in the previous parts with the true value ¢(0.4). 


. Consider the following separable differential equations subject to the initial condition y(0) = 1 at the specified point 


ro = 0. 
(a) yf = wy"; (b) y =(2e+1)(y? +1); (c) yy! = cosa; 
(d) y =2eyt+a/y; (e) (+2) y! = ui (f) y - ry’ — 4x; 
(g) yi =2xcos’y; (h) y= (22/y)°; (i) yi =3a°t+y. 


Use the Euler method (3.2.5), improved Euler formula (3.2.15), and modified Euler algorithm (3.2.16) on the uniform 
mesh grid to obtain an approximation at the point x = 1. First use h = 0.1, and then use h = 0.05. Compare numerical 
values at x = 1 with the true value. 


Consider the following initial value problems for the Bernoulli equations. 

(a) y’=yQrt+y), y(0)=1/5.  (b) zy’ ty=aey?, y(1)=1. 

(c) y=ay’+y/(2r), yA)=1. (d) gty=yre’, y(l)=1. 

(ec) g=ytty’?, y(0)=1. (f) y= 2ey(1—y), y(0) = 2. 
First find the actual solutions and then calculate approximate solutions at the point x = 1.5 using the Euler method 
(3.2.5), improved Euler formula (3.2.15), and modified Euler method (3.2.16) on the uniform grid. First use h = 0.1 
and then h = 0.05. 


Consider the following initial value problems for which analytic solutions are not available. 


(a) y=rtyy, y0)=1. (b) yi=a2?-y*, yO)=1. 

(c) y=aty, y(0)=1. (4) yi =e*+a/y, y(0)=1. 

(e) y'=sin(z)—y’, yO)=1. (ff) y' =2?—2°* cos(y), y(0) =1. 
Find approximate solutions at the point x = 1.4 for each problem using the Euler method (3.2.5), improved Euler 
formula (3.2.15), and modified Euler method (3.2.16) on the uniform grid. First use h = 0.1 and then h = 0.05. 


Apply the backward Euler algorithm to solve the following differential equations involving cubic and fourth powers. 
In particular, find the approximate value of the solution at the point x = 0.5 with the uniform step size h = 0.1 and 
h = 0.05. 


(a) y=y—5e, y(0)=1; (b) y=y*— 400, yO)=1. 
Make a computational experiment: in the improved Euler algorithm (3.2.15), use the corrector value as the next 
prediction, and only after the second iteration use the trapezoid rule. Derive an appropriate formula and apply it to 


solve the following differential equations subject to the initial condition y(0) = 1. Compare your answers with the true 
values at x = 2 and the approximate values obtained with the average slope algorithm (3.2.15) with h = 0.1. 


(a) Y=(ta)vy (b) yf’ =(Q2e@tyt1"'; — (c) ¥ =(y- 42)’. 
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14. Consider the generalized logistic equation 


Find numerical approximations to the solution in the range 0 
(1.5, 1), (1.5, 2), (2,2) and the values of parameters k = 1, K 


(a) Euler’s rule (3.2.5) with h = 0.1; 

(b) Euler’s rule (3.2.5) with h = 0.01; 

c) improved Euler’s method (3.2.15) with h = 0.1; 
d) improved Euler’s method (3.2.15) with h = 0.01; 
(e) modified Euler’s method (3.2.16) with h = 0.1; 
(f) modified Euler’s method (3.2.16) with h = 0.01; 


t < 10 for the parameter pairs (a, 3) = (0.5, 1), (0.5, 2), 
5, and the initial condition P(0) = 1 using 


15. Repeat the previous exercise with another initial condition P(0) = 6. 
16. Consider the initial value problem for the generalized logistic equation 


dP(t) P(t) 1 
m= pw(a-FO),  Po=$. 


where K(t) depends on time variable t. Find numerical approximations to the solution in the range 0 < t < 10 for the 
following functions 


(A) K(t)=3+sint; (B) K(t)=m(24+4t); (C) K(t)=t+1; 


(a) Euler’s rule (3.2.5) with h = 0.1; 

(b) Euler’s rule (3.2.5) with h = 0.01; 

c) improved Euler’s method (3.2.15) with h = 0.1; 
d) improved Euler’s method (3.2.15) with h = 0.01; 
(e) modified Euler’s method (3.2.16) with h = 0.1; 
(f) modified Euler’s method (3.2.16) with h = 0.01; 


17. Consider the initial value problem y’ = 2(1+ y?), y(0) = 1. Use Euler’s rule (3.2.5) and the Heun method (3.2.15) to 
find the approximate value of the exact solution ¢(a) = tan (S + +) at the point x = 1.5. Use several step decrements, 
starting with h = 0.1, and then h = 0.01, h = 0.001. Do you observe any problem in achieving this task? 

18. Consider the initial value problem y’ = (a + y — 3/2)”, y(0) = 2. Use the improved Euler method (3.2.15) with h = 0.1 


and h = 0.05 to obtain approximate values of the solution at x = 0.5. At each step compare the approximate value 
with the true value of the actual solution. 


19. Which of two numerical methods, average slope algorithm (3.2.15) or midpoint rule (3.2.16), requires fewer arithmetic 
operations? 


20. Let a and b be some real numbers. Apply Heun’s formula and the midpoint Euler method with a fixed positive step 
size h to the initial value problem y’ = az + by, y(xo) = yo. Show that in either case these methods lead to the same 
difference equation, which has the solution 


b2hn?\” axo a anh azo a 


In each problem, the given iteration is the result of applying Euler’s method, the trapezoid rule, improved Euler formula, or 
the midpoint Euler rule to an initial value problem y’ = f(x,y), y(zo) = yo on the interval a = xo < x < b. Identify the 
numerical method and determine a = Zo, b, and f(z, y). 

14. Ynti = Yn + haz + hy2, yo = 1, 2n =1+4+nh, h=0.01, n =0,1,..., 100. 

15. Yn4t1 = Yn + han + hans + hynsi, yo = 1, tn =1+nh, h = 0.02, n =0,1,...,100. 

16. Ynt1 = Yn + 3h + hanyn +h? (nym + 32nYm + 3 Yn) +h? (3anyn + 3 enYn + an + enyn + 3yn) + h* (3enyn + 3), 

Yo = 2, %, =1+nh,h=0.05, n=0,1,...,40. 
17. yada = Yn than + +h (yn +2 (an +2)’, yo =1, tn = nh, h = 0.01, n=0,1,..., 100. 
18. Yn41 = Yn + 4 (yn + yh41), Yo=0, Mn =14+Nh, h = 0.05, n =0,1,..., 100. 
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3.3. The Polynomial Approximation 


One of the alternative approaches to the discrete numerical methods discussed in §3.2 is the Taylor series method. 
If the slope function f(a, y) in the initial value problem 


y' = F(z,y), y(Zo) = Yo, (3.3.1) 


is sufficiently differentiable or is itself given by a power series, then the numerical integration by Taylor’s expansion 
is possible. In what follows, it is assumed that f(x,y) possesses continuous derivatives with respect to both x and 
y of all orders required to justify the analytical operations to be performed. 

There are two ways in which a Taylor series can be used to construct an approximation to the solution of the initial 
value problem. One can either utilize the differential equation to generate Taylor polynomials that approximate the 
solution, or one can use a Taylor polynomial as a part of a numeric integration scheme similar to Euler’s methods. 
We are going to illustrate both techniques. 

It is known from calculus that a smooth function g(x) can be approximated by its Taylor series polynomial of 
the order n: 

g'”) (xo) 


Pn() = g(ao) + 9'(@o)(@ — #0) + +++ + = (e— 20)"; (3.3.2) 


which is valid in some neighborhood of the point x = x9. How good this approximation is depends on the existence 
of the derivatives of the function g(x) and their values at 7 = xo, as the following statement shows (consult an 
advanced calculus course). 


Theorem 3.1: [Lagrange] Let g(x) — pn(#) measure the accuracy of the polynomial approximation (3.3.2) of 
the function g(a) that possesses (n + 1) continuous cernaiives on an interval containing zo and x, then 
ee) 


(n+ 1)! ye 


g(x) — Pn(x) = (x — xo 


? 


where €, although unknown, is guaranteed to be between xg and a. 


Therefore, one might be tempted to find the solution to the initial value problem (3.3.1) as an infinite Taylor 
series: 


y(x2) = 3 Gy, (a — x9)” (3.3.3) 
n>0 
y" (xo) C y'" (xo) & 
2! 3! 

This series provides an explicit formula for the solution, which can be computed to any desired accuracy. If a 
numerical method based on a power series expansion is to yield efficient, accurate approximate solutions to problems 
in ordinary differential equations, it must incorporate accurate determinations of the radii of convergence of the 
series. However, the series representation (3.3.3) may not exist, and when it does exist, the series may not converge 
in the interval of interest. 

Usually solutions of nonlinear differential equations contain many singularities that hinder the computation of 
numerical solutions. The determination of singular points or estimation of the radius of convergence requires quite 
a bit of effort. Without knowing a radius of convergence (which is the distance to the nearest singularity) and 
the rate of decrease of coefficient magnitudes, power series solutions are rather useless because we don’t know how 
many terms in the truncated series must be kept to achieve an accurate approximation. In general, the Taylor series 
approximation of a smooth function is often not so accurate over an interval of interest, but may provide a reliable 
power series solution when some information about its convergence is known, for example, when its coefficients are 
determined, or satisfy a recurrence, or some pattern is evident. 

Since the coefficients a, = y\")(xo)/n! in series (3.3.3) are expressed through derivatives of the (unknown) 
function y(a), they could be determined by differentiation of both sides of the equation y’ = f(x,y) as follows: 


— 29)? + = 19)? + 


= y(xo) + y' (x0) (a — 20) + 


y = f(z,y), 


0 0 
y” f= het til =fetthe=(F4IZ) Mew) 


a a> 
y" = f" = few + Wfeyf + fuyf? + fufot FS = (F415) f(z, y). 
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Continuing in this manner, one can express any derivative of an unknown function y in terms of f(x,y) and its 
partial derivatives. However, we are most likely forced to truncate the infinite series after very few terms because 
the exact expressions for derivatives of f rapidly increase in complexity unless f is a very simple function or some 
of its derivatives become identically zero (for example, if f(x,y) = (2y/x) — 1, then f” = 0). 

Since formulas for successive derivatives can be calculated by repeatedly differentiating the previous derivative 
and then replacing the derivative y’ by f(z, y(a)), we delegate this job to a computer algebra system. For instance, 
the following Maple code allows us to find derivatives of the function y(z): 

derivs:=proc(n) 

option remember ; 

if n=0 then y(x) 

elif n=1 then f(x,y(x)) 

else simplify (subs (diff (y(x) ,x)=f(x,y(x)) ,diff (derivs(n-1) ,x))) 
end if; end; 


Similarly, we can find derivatives using Mathematica: 


f{x_,y_] = x°2 + y°2; (* function f is chosen for illustration *) 
yt[x_,y_] = D[f[x,y[x]],x]/.{y’ [x]->flx,y],y[x]->y}; 

y2[x_,y_] = Dly1[x,y[x]],x]/.fy’ [x] ->flx,y],y[x]->y}; (* or *) 
f2[x_,y_]:= DIf[x, yl], x] + DIflx, yl, yl] flx, yl 


£3[x_,y_]:= D[f2[x, y], x] + D[f2[x, yl, yl] flx, yl] 


Example 3.3.1: Consider the initial value problem 
y=rz-yt2, y(0)=1. 


Since the given differential equation is linear, the problem has a unique solution on the real line (see Theorem 1.1 
on page 22). Therefore, we try to find its solution as the power series 


y(a) =1+ cas 


n>1 


where nic, = y\”(0), n = 1,2,.... This series obviously satisfies the initial condition y(0) = 1 provided that. it 
converges in a neighborhood of the origin. Differentiating both sides of the equation y’ = « — y + 2 and setting 
x = 0, we obtain 
c= y'(0) = (@-y + 2)eno = —y(0) +2 =1, 
2c2 = y"(0) = (@—y + 2)p-0 = (1 - y')e=0 = 0, 
d 
3!c3 = y'"(0) = (w@-— yt 2)%_) = —(1-y’) = —-y"(0) =0. 
dx par 
All coefficients c, in the series representation y(z) = 1+ 2+ 7,59 ¢n2” vanish and we get the solution y = 1+ 2. 
Now we consider the same equation subject to a different initial condition y(1) = 3. We seek its solution in the 


form 
y(x) =3+ S- Cn (2@ — 1)”, nl en = y™(1). 
nol 


Using the same approach, we obtain 
cy =y'(1) = (@ —y + 2)ami = —y(1) +3 = 0, 
2eg = y" (1) = (@-y + 2)p-4 = (1- yeni = 1, 
d 
3ieg3 = (1) = (@-y + 2)ra=s (l-¥)} = —9"Q) = -1, 
x 


x=1 
Ales = y (1) = -y"(1) = 1, 


—1)" 


and so forth. Hence, cy, = ( mo 


n = 2,3,..., and the solution becomes 


yl) =3+ 9 (ea = 34+ OO" eas @-ai tate. 
n>2 n>0 


Note that the series for y(x) converges everywhere. 
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Example 3.3.2: (Example 3.2.3 revisited) | Consider the initial value problem for the following nonlinear 
equation: 


1 
= -———$__==(¢-94+2)"* =, 
aa (c—y+2)~, — y(0) 
Let us try the series method: y(a) = 1+a+ S- dn 2", where n! ay = y(0). Here we have used our knowledge 
n>2 
1 
that y(0) = 1 and y'(0) = ———~ = 1. Now we need to find further derivatives: 
x-—yt2 2=0 
y= (wy +2)% =(-D@-yt2)*0-y/) = y"(0) =0, 
y" = Ae—y+2) 91 —-y')?—(e@—-y+2)*°(-y") => = 9"(0) =0, 


and so on. Hence, all derivatives of the function y greater than 2 vanish and the required solution becomes y = 7+1. 

If we would like to solve the same differential equation under another initial condition y(zo) = yo, we cannot 
choose the initial point (xo, yo) arbitrarily; it should satisfy x) — yo + 2 4 0, otherwise the slope function f(x,y) = 
(x — y +2)? is undefined. For instance, we cannot pick y(1) = 3 simply because this point (1,3) is outside of the 
domain of the slope function. So we consider the same equation subject to the initial condition y(1) = 1 and seek 
its solution in the form: y(x) = 1+ >7,,51 ¢n (w — 1)”. Then the coefficients are evaluated as follows: 


a =y'(1) =(e—yt2)52, =1/2, 


deg =y"(1) = (ev tD| = (ey +270 yaar = 55, 
Bley = y"(l) =— S (@—y +2) 20-¥/) 
da wei 
=2 (a —y+2) yaa + (@—9 +2) 79a = op 
dle, -- Oy | _ 6A -y)y" a aa ee 
(@—y+2) lng @oyt2 lg @oyt2P lee 2" 


To verify our calculations, we ask Maple for help. 
ode:=D(y) (x)=1/ (x-y (x) +2) ; 
dsolve({ode, y(1) = 1}, y(x), ’series’, x = 1) 

The series solution seems to converge for small |x — 1], but we don’t know its radius of convergence because we 
have no clue how the coefficients decrease with n in general. It is our hope that the truncated series formula gives 
a good approximation to the “exact” solution y = In |x — y + 2| +1 —1n2, but hopes cannot be used in calculations. 
We are lucky that the solution can be found in an implicit form or even as an inverse function: x = y— 1+ e471. 
Nevertheless, the truncated series approximation is close to the actual solution in some neighborhood of the point 
x = 1, as seen from Fig. 3.6. Using the following Maple commands, we can evaluate the solution at any point, say 
at x = 2, to get y(2) © 1.442854401. 


soln := dsolve({ode, y(1) = 1}) assign(soln); y(x); 
evalf(subs(x = 2, y(x))) 


Now we plot the actual solution using Mathematica: 

ansi = DSolvel[{y’ [x] == 1/(x - yl[x] + 2), yO] == 1}, yix], x] 
Plot[y[x] /. ansi, {x, -1, 1}, PlotRange -> {0, 2}, AspectRatio ->1] 
ans2 = DSolve[{x’ [y] == xly] - y + 2, x[1] == 0}, xly], y] //Simplify 
ParametricPlot [Evaluate[{xly], y} /. ans2], fy, 0, 2}, 

PlotRange -> {{-1, 1}, {0, 2}}, AspectRatio -> 1] 
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64 


Approximation 
Log plot of |c, |!” -1/2.003 vs. n, for c,, not zero 


Exact solution 


0.005 —- 


L L L L L 
43 3 123 163 203 243 283 323 363 «403.443 


Figure 3.7: The first 100 roots *"*¥/|can+3| of coef- 
Figure 3.6: Example 3.3.2: fourth Taylor approxi- ficients of power series (3.3.4), plotted with Mathe- 
mation to the exact solution, plotted with Maple. matica using a logarithm scale. 


Example 3.3.3: Let f(x,y) = 27 + y? and consider the equation y’ = f(x,y) subject to the initial condition 
y(0) = 0. The amount of work to find sequential derivatives of y‘” accumulates like a rolling snowball with n; 
however, first two derivatives of f are easy to obtain 


y =f = fet fyf =Aet+ay+y’), 
y" _ gq" = (1 +2¢y+24 + 4a7y? 4 3y*), 
which allow us to find the first few coefficients in its power series representation 
ve Qalt 132) 46 x19 


3 
(ee ia 
u(a) = } ena" => + 35 + Soro + ateags + Tomes + 


n>0 


(3.3.4) 


Since no simple pattern seems to be emerging for the coefficients, it is hard to say what the radius of convergence 
is for this series. We know from Example 2.7.5, page 114, that this radius should be around 2 because the solution 
(3.3.4) blows up around a * 2.003147. Actually, computer algebra systems are so powerful that we may try to 
estimate the radius R of convergence using the root test R~' = limsup,,_,,, /|¢n| = limnsoo 4"*8/|can+43], where 
lim sup is the upper limit. Indeed, Fig. 3.7, plotted with Mathematica, unambiguously indicates that the reciprocal 
of the radius of convergence is around 0.5. 
DSolve[{y’ [x] == y[x]“2 + x*2, y[0] == 0}, ylx], x] 
mylist = CoefficientList[y’ [x] - ( y[x]*2 + x2) /. 
{y[x] -> Sum[c[n] x7n, {n, 0, m}], 
y’ [x] -> D[Sum[c[n] x*n, {n, 0, m}], x]}, x]; 
c[0] = 0.; (* m is the number of terms in truncated polynomial *) 
Sum[c[n] xn, {n, 0, m}] /. 
Solve[Take[mylist, {1, m}] == Table[0., {n, 1, m}], 
Table[c[n], {n, 1, m}]][(1]] 
CoefficientList[%, x] 
Table(Z(liJ]*(1/i), fi, 1, Length[%]}] (* find i-th root of c_i *) 
ListPlot[%, Frame -> True, GridLines -> {None, {0.5}}] 

As seen from the previous examples, practical applications of the power series method are quite limited. Never- 
theless, polynomial approximations can be used locally in a step-by-step procedure. For example, if the series (3.3.3) 
is terminated after the first two terms and ¢(x,) and ¢(an+41) are replaced by their approximate values y,, and 
Yn+1, respectively, then we obtain the Euler method. Viewed in this light, we might retain more terms in the Taylor 
series to obtain a better approximation. Considering a uniform grid (3.2.2), page 146, it is common to calculate 
approximation y,+1 of the exact solution at the next step point z+, using either the Taylor Series Method of 
Order 2 


2 
Yn+1 = Yn t hfn + — 
2 y=un 
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where ” 
or the Taylor Series Method of Order 3 (or a three-term polynomial approximation) 


h3 
Yn+1 = Yn + h®o(rn, Yn; h) + 31 Ea + 2foy f + fyyf? + fe fy + fit] . (3.3.6) 


L=IH 
y="n 


In order to formalize the procedure, we first introduce, for any positive integer p, the increment operator acting 
on a function f: 
def hoy bev! (p-1) 
®,f = &,(x,y:h) = f(a,y) + 5 PM (ey) +o + ae (x,y), (3.3.7) 


where the derivatives f(*) can be calculated by means of the recurrence 


k 
fry — (+ + rz) f(w,y(z)), fO=f,  k=0,1,2,.... (3.3.8) 


As always, the approximate value of the actual solution at x = x, is denoted by y,, where x, = %o+nh,n = 0,1,2,.... 
If it is desired to have a more accurate method, then it should be clear that more computational work will be required. 
In general, we have Taylor’s algorithm of order p to find an approximate solution of the differential equation 
y’ = f(x,y) subject y(a) = yo over an interval [a, b]: 


1. For simplicity, choose a uniform partition with N+1 points 7, =a+kh,k=0,1,...,N, and set the step size 
h=(b—a)/N. 


2. Generate approximations y, at mesh points x, to the actual solution of the initial value problem y’ = f(z, y), 
y(Xo) = yo from the recurrence 


Yroi =Yk+h®p(xe,yr;h), k=0,1,2,...,N—1. (3.3.9) 


When x, and y(a,) are known exactly, then Eq. (3.3.9) could be used to compute y,+1 with an error 


perl 
(p+ 1)! y 


ptt 


(p+ 1)! FE (En)), tn <§n < In41; 


it) (E,) = 


where €, is an unknown parameter within the interval (v,,2n+1). If the number of terms to be included in (3.3.7) 
is fixed by the permissible error € and the series is truncated by p terms, then 


pere 


Garay FO emule) <e (3.3.10) 


For given h, the inequality (3.3.10) will determine p; otherwise, if p is specified, it will give an upper bound on h. 
Actually, Maple has a special subroutine based on the Taylor approximation: 

Digits := 10; dsolve(ode, numeric, method = taylorseries, output = 

Array(([0, .2, .4, .6, .8, 1]), abserr = 1.*107(-10)) 

Here the Array option allows us to obtain the output at specified points. There is another alternative approach 
of the polynomial approximation of great practical importance. The Mathematica function NDSolve approximates 
the solution by cubic splines, a sequence of cubic polynomials that are defined over mesh intervals [%p,%p+41] (n = 
0,1,...), and adjacent polynomials agree at the end mesh points. 


Example 3.3.4: (Example 3.3.3 revisited) For the slope function f(x,y) = x? + y?, the Taylor series methods 
of order 2 and 3 lead to the following formulas: 


Ynt1 = Ym + h(as + ye) +h? (en + 259m + Yn), 
h3 
Yn+1 = Un + h(a, “F y>) ace ao tn ye) + 3 (1 + 22 nYn + i, + Aa? ye, ate 3y;) ) 
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respectively. The operators ®2(x, y;h) and ®3(a, y;h) become 


®2(r,y;h) = 2? +y? +h(et+aryty’), 
2 


h 
3(2,y;h) =a? +y? + A(et+a?yty?) + = (14 2ay + a + 4x?y? + 3y4). 


Assuming that x and y have been evaluated previously, the operator ®2(2,y;h) requires 9 arithmetic operations, 
whereas ®3(x, y;) needs 23 arithmetic operations at each step. 


Example 3.3.5: We consider the initial value problem 
y=a?ty, y(0)=1. 


Since the equation is linear, the given problem has the explicit solution y = 3e” — 2 — 22 — x”. The higher order 
derivatives of y(x) can be calculated by successively differentiating the equation y’ = f(x,y), where f(x,y) = 27+ y: 


y = f(z,y) =a? +y, 
yl =f'=frtfyf =2+a*+y, 


and for the derivative of arbitrary order, 


y™ =24+2%er4+an74+y, as ee 


Hence, we obtain the p-th order approximation: 


4 x 3 xe at xP 
Yp(x) = Peay Paar ae : 


To get results accurate up to ¢ = 10~7 on every step of size h, we have from Eq. (3.3.10) 


ptt 


——— |f™ (En, y(En))| = 


(p+ 1)! 2 + 2, + €2 + y(En)| <e- 


peti 
(p+ 1)! 
Since |y(&,,)| < 3e on the interval [0, 1], the error of approximation should not exceed 
ier ppt 
it! (5+ 3e) <e or after rounding ii (5+ 3e) <5 x 107°. 
For h = 0.1 this inequality gives p = 7. So about 7 terms are required to achieve the accuracy ¢ in the range 
0 <a <1. Indeed, |y7(x) — y(x)| = 3 ip s8 z < 0.000083579 forO <a <1. 


Example 3.3.6: (Example 3.3.2 revisited) We consider the initial value problem 


y=(@-y+2)", y= 1. 


The Euler approximation yn41 = Yn + h(an — Yn + 2)~1 requires 5 arithmetic operations at each step. The second 
order Taylor series approximation (3.3.5) gives 


h h? yi —1 
Ynt1 = Yn + ——— te een thy t+ 


AS eee, 
2 (Un ~~ 1); 
Ln — Yn +2 2 (an — Yn + 2) 2 


where y/, = (%n — Yn + 2)~+. Such a second order approximation requires 11 arithmetic operations. The third order 
approximation (3.3.6) yields 


(yp, — 1) + = [wi yn)? + 2(y, - 1)? i)*] . 


6 


h2 1\2 h3 
Yntl1 = Yn thy}, + as 


which requires 22 arithmetic operations. 
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1. 


7. 


Determine the first three nonzero terms in the Taylor polynomial approximations for the given initial value problems. 

(a) y' =2+3y*, y(0)=1. (b) y! = 2x7 +3y?, y(0) = 1. 

(c) y/ =2a74+3y, y(0)=1. (d) y’ =y? +sinz’, y(0) =1. 

(ec) y=1/@—-y), yO) =1 (ff) y'=y—y?/3, (0) =1. 

(g) y =(y—-5),y0)=1 (bh) x =2-y’, y(0) =1. 
Find four-term Taylor approximations for the given differential equations subject to the initial condition y(0) = 1. 

(a) y=1+y’; (b) y' =3e4+2y; (ce) y! = 2a? + 2y, (d) yi =4-y’; 

(e) y=1/(@+y); (f) yi =30?-2y, (g) yi =2+a+y; (h) y! =1+2y; 

(i) y'=Qy-y)/(@-3); Gi) yo =y-8e% (kK) oy =(L-3e ty)"; (I) oy’ = 2ery. 
Suppose that the operator T,(x,y;h) defined by (3.3.7) satisfies the Lipschitz condition: |T,(#,y;h) —Tp(#,z;h)| < 
Ll|y — z| for any y, z and all h and w (x, «+h € [a,6]), and the (p+ 1)-st derivative of the exact solution of the IVP 
y’ = f(x,y), y(a@) = yo is continuous on the closed interval [a,b]. Show that the local truncation error zn = Yn — u(@n) 
between yn, the approximate value by the Taylor algorithm, and u(xn), the reference solution, is as follows: 


M 


/\(<h——— 
|2n] L(p+1)! 


lence a 1] , M= max ju* (z)). 
x€ [a,b] 


Obtain the Taylor series solution of the initial value problem y’ = x + 2xy, y(0) = 0 and determine: 


a) x when the error in y(x) obtained from four terms only is to be less than 107” after rounding. 
y y g 


(b) The smallest number of terms in the series needed to find results correct to 107" for 0 <a <1. 


Obtain the Taylor series solution of the initial value problem 
y +2ey=22, y(0)=0 
and determine: 


(a) « when the error in y(2) obtained from four terms only is to be less than 107" after rounding. 


(b) The number of terms in the series to find results correct to 107” forO <a <1. 


Solve the IVP y’ = «—y?, y(0) = 1/2 by the Taylor algorithm of order p = 3, using the steps h = 0.5 and h = 0.1, and 
compare the values of the numerical solution at x = 1 with the true value of the actual solution. 


Repeat the previous exercise for the IVP: y’ = « — y”, y(0) = 1. 


In Exercises 8 through 14 


(a) 
(b) 
(c) 


14. 


Obtain an actual solution and, from it, an accurate true value at the indicated terminal point. 
At the initial point, expand the actual solution in a Taylor polynomial of degree p = 5. 


Build a Taylor series numerical method of degree p and use it to calculate the value at the terminal point, using a 
stepsize of h = 0.1. Determine the error at the terminal point. 


Using h = 0.05, recalculate the value at the terminal point and again determine the error. 


, _ (36x + 102) y 


ar ee ee ee y(—2) = 1; terminal point at « = —1, and degree p = 3. 
; ss (0) = 1; terminal point at x = 1, and d 1 
= ——.— = 1; terminal point at x = 1, and degree p = 2. 
U-Tpay Y p gree p 
18 x2 
y = “i. y(2) = 2; terminal point at z = 3, and degree p = 2. 
y 
aa (1) = 1; terminal point at x = 2, and degree 2 
— =") = ni = 4. 
y 6y +72’ y ; Pp ’ g Pp 
y= — a y(2) = 3; terminal point at x = 3, and degree p = 3. 
y2— = 
2 
y= ae y(1) = 2; terminal point at x = 2, and degree p = 2. 
— 10x? 
y = et Ml y(1) = 1; terminal point at x = 2, and degree p = 2. 


Ty2 +622’ 
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3.4 Error Estimates 


Mathematical models are essential tools in our understanding of the real world; however, they are usually not 
accurate because it is impossible to take into account everything that affects the phenomenon under consideration. 
Also, almost every measurement is not accurate and is subject to human perception. The errors associated with both 
calculations and measurements can be characterized with regard to their accuracy and precision. Accuracy means 
how closely a computed or measured value agrees with the true value. Precision refers to how closely individual 
computed or measured values agree with each other. 

Once models are established, they are used for quantitative and qualitative analysis. Available computational 
tools allow many people, some of them with rather limited background in numerical analysis, to solve initial value 
problems numerically. Since digital computers cannot represent some quantities exactly, this section addresses 
concerns about the reliability of numbers that computers produce. 


Definition 3.4: The initial value problem 
y= f(xy), (a<a<b) = (3.4.1) 


is said to be a well-posed problem if a unique solution to the given problem exists and for any positive € there 
exists a positive constant & such that whenever |¢o| < € and |d(x)| < ¢, a unique solution, z(), to the problem 


2’ = f(z,z) + 6(2), z(&o) = yo + €0, (3.4.2) 


exists with |z(x) — y(x)| < ke for all x within the interval |a, }]. 


The problem specified by Eq. (3.4.2) is often called a perturbed problem associated with the given problem 
(3.4.1). Usually, we deal with a perturbed problem due to inaccurate input values. Therefore, numerical methods are 
almost always applied to perturbed problems. If the slope function in Eq. (3.4.1) satisfies the conditions of Theorem 
1.3 (page 23), the initial value problem (3.4.1) is well-posed. 

At the very least, we insist on f(x,y) obeying the Lipschitz condition 


|f(@, y1) ~ f(x, y2)| < Lyn — y2| for all Y1; Y2 € [a, 8], a < x < b. 


Here L > 0, called a Lipschitz constant, is independent of the choice of y; and y2. This condition guarantees that 
the initial value problem (3.4.1) possesses a unique solution (Theorem 1.3 on page 23). The Lipschitz condition 
is not an easy one to understand, but there is a relatively simple criterion that is sufficient to ensure a function 
satisfies the Lipschitz condition within a given domain 2 = [a,b] x [a, 6] containing (xo, yo). If the function f(z, y) 
has a continuous partial derivative with respect to y throughout [a, 3], then it obeys the Lipschitz condition within 
Q. If fy = Of /Oy exists and is bounded in the domain 2, then for some 6 (y1 < @ < yo) f(x,y.) — f(t, y2) = 
(1 — y2) fy (a, 4). 

Taking a stronger requirement, we may stipulate that the rate function f in the problem (3.4.1) has as many 
derivatives as needed by Taylor expansions. By choosing a constant step size h for simplicity, we generate a uniform 
set of mesh points: zg, = % + nh, n = 0,1,2,.... The quantity h, called the discretization parameter, measures 
the degree to which the discrete algorithm represents the original problem. This means that as h decreases, the 
approximate solution should come closer and closer to the actual solution. In what follows, we denote the actual 
solution of the initial value problem (3.4.1) by $(x) and its approximation at the mesh point 2 = x, by Yn. 

The proof of the convergence yy +> ¢(a,) as h > 0 is the ultimate goal of the numerical analysis. However, to 
establish such convergence and estimate its speed, we need to know a good deal about the exact solution (including 
its derivatives). In most practical applications, this is nearly impossible to achieve. When we leave the realm of 
textbook problems, the requirements of most convergence analysis are too restrictive to be applicable. In practice, 
numerical experiments are usually conducted for several values of a discretization parameter until the numerical 
solution has “settled down” in the first few decimal places. Unfortunately, it does not always work; for instance, 
this approach cannot detect a multiplication error. 

Recall that a “numerical solution” to the initial value problem (3.4.1) is a finite set of ordered pairs of rational 
numbers, (%n, Yn), 2 = 0,1,2,...,.N, whose first coordinates, x,,, are distinct mesh points and whose second coor- 
dinates, y,, are approximations to the actual solution y = ¢(x) at these grid points. Numerical algorithms that we 
discussed so far generate a sequence of approximations {yn}n>o0 by operating within one grid interval [%,,¢%n+1] of 
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size h = &n41 — Xp and use the values of the slope function on this interval along with only one approximate value 
Yn at the left end x,. They define the approximation ynex~ to the solution at the right end 2,4, by the formula 


Yneat = Y + h&(x, Y; Ynect h) or Ynt1 = Yn + h®(rn, Yn, YUn+13 i) (3.4.3) 


where the function ®(2, y, Yneat; h) may be thought of as the approximate increment per unit step h and it is defined 
by the applied method. For example, ®(x, y;h) = f(,y) in the Euler method, but in the trapezoid method (3.2.14), 
® depends on Ynext, that is, ® = 4 f(a, y) + 4 f(x +h, Ynext): 


Definition 3.5: The big Oh notation, O(h”), also known as the Landau symbol, means a set of functions that 
are bounded when divided by h”, that is, |h~" f(h)| < K for every function f from O(h") and some constant 


K = K(f) independent of h. 


It is customary to denote the big Oh relation as either f = O(h") or f € O(h”) when h—” f(h) is bounded. 
The Landau symbol is used to estimate the truncation errors or discretization errors, which are the errors 
introduced by algorithms in replacing an infinite or infinitesimal quantity by something finite. 


Definition 3.6: The one-step difference method (3.4.3) is said to have the truncation error given by 


u(an +h) — u(an) 


Ta(h) = ? 


— O(2n, Yn; h), n=0,1,2,..., (3.4.4) 
where u(x) is the true solution of the local initial value problem 


This function u(x) on the mesh interval [%, %p + h] is called the reference solution. The difference u(xp +h) — 
Ynt1 = hT,(h) is called the local truncation error. We say that the truncation error is of order p > 0 if 
Tn(h) = O(h?), that is, |T,(h)| < Kh? for some positive constant K (not depending on h but on the slope 
function). 


A local truncation error (from the Latin truncare, meaning to cut off) measures the accuracy of the method at 
a specific step, assuming that the method was exact at the previous step: 


hT,,(h) = (u(2n+41) _ tlt, )) ar (Yn41 = Yn)s 


where u(x) is the reference solution on the interval [%,,2»+41]. A numerical method is of order p if at each mesh 
interval of length h its local truncation error is O(h?). The truncation error can be defined as 


Tn(h) = ; [u(an +h) — Ynext] (3.4.5) 

because of the assumption that u(2,) = yy. This shows that the local truncation error hT,,(h) is the difference 

between the exact and the approximate increment per unit step because the reference solution is an exact solution 

within every step starting at (ap,¥Y,). Truncation errors would vanish if computers were infinitely fast and had 
infinitely large memories such that there was no need to settle for finite approximations. 

As a rule, for a given amount of computational effort, a method with a higher order of convergence tends to 
give better results than one with lower order. However, using a high-order method does not always mean that you 
get greater accuracy. This is balanced by the fact that higher order methods are usually more restrictive in their 
applicability and are more difficult to implement. When a higher order method is carried out on a computer, the 
increased accuracy repays the extra effort for more work to be performed at each step. 

Let (a) be the actual solution to the initial value problem (3.4.1). The difference 


En =$(2n)—Yn, n=0,1,2,..., (3.4.6) 


is known as the global truncation error (also called the accumulative error), which measures how far away the 
approximation y,, is from the true value ¢(z,,). To find this approximation, y,, we need to make n — 1 iterations 
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Figure 3.8: Local and propagated errors in Euler’s approximations along with the exact solution (solid) and reference 
solutions (dashed), plotted with MATLAB. 


according to the one-step algorithm (3.4.3), and every step is based on the solution from the previous step. The input 
data at each step are only approximately correct since we don’t know the true value ¢(2;), but only its approximation 
ye (k = 0,...,n— 1). Therefore, at each step, we numerically solve a perturbed problem by introducing a local 
truncation error that arises from the use of an approximate formula. The essential part of the global error (not 
included in the local truncation error), accumulated with iteration, is called the propagated truncation error 
(see Fig. 3.8). 


Definition 3.7: The one-step method (3.4.3) is considered consistent if its truncation error 
Tr(h) 70 as h->0 


uniformly for (x,y). A method is said to be convergent if 


li | an n)| = 
im max |yn — 6(tn)| = 0 


for the actual solution y = (x) of the initial value problem (3.4.1) with a Lipschitz slope function f(s, y). 


Hence, convergence means that the numerical solution tends to the values of the actual solution at mesh points 
as the grid becomes increasingly fine. From Eq. (3.4.3), we have consistency if and only if ®(#, y;0) = f(a, y). Every 
one-step method of order p > 0 is consistent. 

a 

To derive the local truncation error for the Euler method, we use Taylor’s theorem: 


(fn41 = ig 


En) 


#(2n41) = ?(@n) sr (nad = ae x) + 


for some number &, € (@n,2n41). Since {n41 — In =h, b' (Xn) = f(Ln, O(an)), and ¢(a,) is assumed to be equal to 
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Yn, we obtain the local truncation error hT,,(h) = $(an) — yn to be 


h? h? 
hT,(h) = > o' (&) = > [fa (Ens On) ly fy (Ens on) f (Ens On) = O(h”), 


where ¢, = $(€,) is the value of the actual solution ¢(a) at the point z = €,. So the local truncation error for the 
Euler algorithm (either forward or backward) is of order p = 1, and we would expect the error to be small for values 
of h that are small enough. If the initial value yo = y(a) is given and we seek the solution value at the point b, then 
there are N = (b — a)/h steps involved in covering the entire interval [a,b]. The cumulative error after n steps will 
be proportional to h, as the following calculations show. 


h2 
En41 = O(Xn41) — Yn+1 = (Xn) + h@' (an) ae > Olen) — Un4t1 


2 
= den) +A! en) +9" En) — tn — AF (nsdn) 


I 


O(n) — Yn + RIB" (On) — Fn Yn)] + 8" En) 


h2 
= E,+hlf (en, (tn) — f(tn,¥n)] + > b' (En) 
|En| + hL|On — yn| + Mh? = |Ep| (1+ AL) + MP?, 


JN 


where M = max, |¢”(x)|/2 and L is the Lipschitz (positive) constant for the function f, that is, | f(x, y1)—f (x, y2)| < 
L|y1 —y2|. If the slope function is differentiable in a rectangular domain containing the solution curve, then constants 
L and M can be estimated as L = max(z,y)er|fy(x,y)| and 2M = max(z yer |fo(x,y) + f(x,y) fy(z, y)|. Now we 
claim that from the inequality 

|En4il <|En|(l+hL)+Mnh?, (3.4.7) 


it follows that a 
|En| <= h [A + hL)" — I, = O01, 2y2405 (3.4.8) 
We prove this statement by induction on n. When n = 0, the error is zero since at ro = a the numerical solution 


matches the initial condition. 
For arbitrary n > 0, we assume that inequality (3.4.8) is true up to n; using (3.4.7), we have 


x 


M 
Bua E,|\(1 + hL) + Mh? < =h [(1+ AL)” —1) (14+ AL) + Mh? 
ih L 
M 


f 
= = nil +AL)"* — 7 h(1+AL) + Mh? 
M M M 
= —hA(1+AL)U -—-—h=—h (14+AL)"™™ -1). 
FRA + AL — = h=h [(1 + hb) 


This proves that the inequality (3.4.8) is valid for all n. The constant hL is positive, hence 1+hL < e””, and we 
deduce that (1+hL)" < e"’”. The index n is allowed to range through {0, dene boa \f therefore, (1+hL)” < e%-%), 
Substituting into the inequality (3.4.8), we obtain the estimate 


b-—a 
7 


\b(2n41) — Yn+i| < a h (ae = 1) ,  n=0,1..., (3.4.9) 
Since M (e%°-*) — 1) /L is independent of h, the global truncation error of the Euler rule (3.2.5) is proportional to 
h and tends to zero as h > 0. oO 

By scanning the rows in Table 151, we observe that for each mesh point 2, the accumulative error E, = 6(&n)—Yn 
decreases when the step size is reduced. But by scanning the columns in this table, we see that the error increases as 
tp gets further from the starting point. This reflects the general observation: the smaller the step size, the less the 
truncation error and the approximate solution is closer to the actual solution. On the other hand, the exponential 
term in the inequality (3.4.9) can be quite large for large intervals, which means that we should not trust numerical 
approximations over long intervals. 

The inequality (3.4.9) should not be used to build bridges or airplanes, but our goal is to demonstrate the general 
form of the procedure. The bound (3.4.9) is too crude to be used in practical estimations of numerical errors because 
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it is based only on two constants—the Lipschitz constant L (which is actually an estimation of f,) and the upper 
bound MM of the second derivative of the actual solution (which is usually unknown). In the majority of applications, 
the cumulative error rarely reaches the upper bound (3.4.9), but sometimes it does. 


Example 3.4.1: In the initial value problem 
y’ =—10y, y(0) = 1, 


for the linear equation, we have the slope function f(x,y) = —10y, with the Lipschitz constant L = 10. For the 
second derivative, y’’(2) = 100 e~1°* of the exact solution y(x) = e~'°” on the interval [0, 1], we have the estimate 
M = $ maxze(o,1) |y’()| = 50. Thus, we find the upper bound in the inequality (3.4.9) to be 


|En+1| = |6(¢n41) — Yagil < 5h (e? — 1) = 110127.329 h. 


On the other hand, the Euler algorithm for the given initial value problem produces yn41 = Yn — 10hyn = yo (1 — 
10h)"*", which leads to the estimate at the point a = 1: 


2 2 
Ens, =e7° — (1— 10h)? = (son — AOR 2 4, Os — St) a 


<50e7!° h = 0.00227 h. 


If h were chosen so that 1— 10h < —1 (h > 0.5), the corresponding difference solution y,, would bear no resemblance 
to the solution of the given differential equation. This phenomenon is called partial instability. 


The truncation error for the improved Euler method is 


Un+1 — Un 1 


T(h) = 8 _ =f, — 5 Fantasia + hf) fn = f(2n5 Yn); Un = U(Ln) = Yn- 


The function f(a, +h, yn + hfn) can be estimated based on the Taylor theorem: 


f(an + hy yn + hfn) = fn + hfx(@n, Yn) + hfn Julieta) + En, 


where the error E,, at each step is proportional to h?. It is very difficult to predict whether such an approximation 
is an underestimate or otherwise. Hence, 


Un+1 — Un 


Ty(h) = PH _ f, — [fetus tn) + fr fulnsYn)] + O(R?), 


On the other hand, 5 


/ h A h? Wr 
Untl = Unt+hu (tn) +> u (tn) + oy U (En) 
2 


= Unthf (tn, Un) + . u" (an) + O(h3). 


ches tt Un = f(a, tn) + wn) +O(h?), Un = Yn. 
Substituting this representation into the above expression for T,,(h), we obtain 
h, h 2 2 
Tr(h) = Fu (tn) — 5 [fe(@n, 9m) + fn fy(@n, Yn)] + O(h”) = O(h") 
since 
wa.) = ait) = fir(2n:Yn) + fn fy(@n; Yn) + O(h’). 


Thus, the truncation error for the improved Euler method is of the second order. Similarly, it can be shown (see 
Problems 3 and 4) that both the trapezoid and modified Euler methods are of the second order. a 

One of the popular approaches to maintain the prescribed error tolerance per unit step is based on interval 
halving or doubling. The idea consists of computing the approximate value y,+1 at the grid point 7,41 by using the 
current step h = a41 — Ly and then recomputing its value using two steps of length h/2. One expects to reduce 
the error by approximately (1/2)?, where p is the order of the truncation error. If the results of such calculations 
differ within the tolerance, the step size h is kept unchanged; otherwise, we halve it and repeat the procedure. The 
same approach is used to double the step size. 
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Example 3.4.2: Consider the differential equation y = y? sin(y(t)) +t subject to three initial conditions: y(0) = 
—0.8075, y(0) = —0.8076, and y(0) = —0.8077. By plotting their solutions with Maple, 


y3 := diff(y(t), t) = y(t)*2*sin(y(t))+t 
DEplot(y3, y(t), t = 0 .. 6, [Ly(O) = -.8075], [y(0) = -.8076], 
[y(0) = -.8077]] ,arrows=medium, y= -3 .. 2,color = black,linecolor = blue) 


we see (Fig. 3.9 on page 171) that solutions are very sensitive to the initial conditions. This is usually the case when 
there is an exceptional solution that separates solutions with different qualitative behavior. If the initial condition 
identifies the exceptional solution, any numerical method will eventually flunk (see Exercise 14 on page 184). 
Euler’s accumulative error E,(h) = (an) — yn is O(h), that is, |E,(h)| < Ch, where C is a constant that 
depends on the behavior of the slope function within the mesh interval, but independent of step size h. To perform 
actual calculations, we consider the equation y = y” sin(y(t)) +t subject to the initial condition: y(0) = —0.8075. By 
calculating the ratios E,,/h for different grid points, we see that they remain approximately unchanged for different 
step sizes: 


h Approx. att=1 |E(h)|/h Approx. at t = 2 |E(h)|/h 

2 —1.2078679834365 0.30429340837767 | —1.67796064616645 5.67692722580626 
2 —1.2033363156891 0.31856008092308 | —1.59929791928236 6.31943993103098 
2 —1.2009071155484 0.32618254383425 | —1.55268398278824 6.67229599081492 
2 —1.1996483656937 0.33012512485624 | —1.52733905657636 6.85629087138864 
2 —1.1990075067446 0.33213046777075 | —1.51413104023359 6.95007737527897 


where (1) = —1.19835888124895, ¢(2) = —1.50055667035633 are true values at t = 1 and t = 2, respectively 
(calculated with the tolerance 2~'4). This table tells us that the ratio E(h)/E(h/2) of two errors evaluated for the 
same mesh point will be close to 2. Hence, the error goes down by approximately one-half when h is halved. If we 
repeat similar calculations using the Heun method (of order 2), we will get the table of values: 
h Approx. at t= 1 |E(h)|/h? Approx. at t = 2 |E(h)|/h? 
—1.1987431613123 0.3935712129344 —1.50867192833994 8.31002417145396 
—1.1984558487773 0.3974527082992 —1.50261140901733 8.41620954044174 
—1.1983831848709 0.3992853900927 —1.50107317886035 8.46247526969455 
—1.1983649205765 0.4001727624709 —1.50068612220823 8.48375632541138 
—1.1983603426266 0.4006089575705 —1.50058907206927 8.49391367618227 


In the case of a second order numerical method (Heun or midpoint), the ratio of cumulative errors E(h)/E(h/2) 
will be close to 4 = 27, which is clearly seen from Fig. 3.9(b). 
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Figure 3.9: Example 3.4.2. (a) Solutions are sensitive to the initial conditions, plotted in Maple; and (b) error ratios 
for three different numerical methods, plotted with MATLAB. 


The previous observations show that the smaller the step size, the better the approximation we obtain. But what 
should we do when we cannot use a small step size? For instance, when the slope function is given at particular mesh 
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points, and it is unknown at other points. One of the approaches to improve approximations is to use iteration, as 
the following example shows. 


Example 3.4.3: Consider the following “improvement” of the Heun method. For the initial value problem y’ = 
f(x,y), y(xo) = yo, to find the approximate value y,,+1 of the actual solution at the right endpoint « = 2,41 of the 
mesh interval [2,2n+41] of size h = %p41 — Xn, the iterative algorithm can be expressed concisely as 


predictor: yf, =yr thf (rk, yn), 
j—1 . 
corrector: 44, = Yet f(ee, yn) +3 Sf (teri, Yeg1)) J =1,2,...m. 


The value m defines the number of iterations, which may depend on the mesh interval. Note that m = 1 corre- 
sponds to the improved Euler method. This value can be either fixed through the computations or variable, with a 
termination criterion 


View — View 
Perl _*kt1) x 100% <e, 


Yer. 


where € is a tolerance, and Ves and y) 4, are results from the prior and present iteration. It should be understood that 
the iterative process does not necessarily converge to the true answer. To see that a priori a fixed number of iterations 
is not the best option, we reconsider the initial value problem from Example 3.2.6 on page 155: = y = 27 y cos(2zt), 
y(0) = 1. Numerical experimentation reveals the following results summarized in Table 172. The data show that the 
next iteration may not give a better approximation; therefore, it is preferable to use a variable termination criterion 
rather than a fixed number of iterations. 


Table 172: A comparison of the results for the numerical approximations of the solution to the initial value problem 
y = 2ry cos(2rt), y(0) = 1 at the point t = 2, using the iterative Heun formula and the improved Euler method 
(3.2.15) for different step sizes h. 


Iterative Heun 
Exact m Approximation m Approximation m Approximation 
1.00 1 yoo © 0.92366 2 yao © 0.88179 3 Yoo © 0.99523 
1.00 4 yso & 0.99011 5 yioo © 0.99961 6 yooo © 0.99916 
Heun 
Exact h Approximation h Approximation h Approximation 


1.00. 0.1 yao © 0.92366 0.05 yao © 0.99294 0.025 yao © 0.99921 


So far we have always used a uniform step size in deriving approximate formulas. The main reason for such an 
approach is to simplify the derivations and make them less cumbersome. However, in practical applications, the step 
size is usually adjusted to calculations because there is no reason to keep it fixed. Of course, we cannot choose a step 
size too large because the numerical approximation may be inconsistent with an error tolerance, which is usually 
prescribed in advance. Therefore, the numerical method chosen for computations imposes some restrictions on the 
step size to maintain its stability and accuracy. 

The major disadvantage of halving and doubling intervals consists in the substantial computational effort required. 
Contemporary general-purpose initial value problem solvers provide an automatic step size control to produce the 
computed accuracy within the given tolerance. This approach gives a user confidence that the problem has been 
solved in a meaningful way. On the other hand, the step size should not be taken too small because the finer the 
set of mesh points, the more steps needed to cover a fixed interval. 

When arithmetic operations are performed by computers, every input number is transformed into its floating- 
point form and represented in its memory using the binary system. For instance, the number 7 is represented in 
floating-point form as 0.31415926 x 10!, where its fractional part is called the mantissa, and the exponential part is 
called the characteristic. Due to the important property of computers to carry out only a finite number of digits or 
characters, the difference between a number stored in a computer memory, Y,,, and its exact (correct) value, yp, is 
called the round-off error (also called the rounding error): 


Rn=%Y—-Yn, n=0,1,2,.... 


The rounding errors would disappear if computers had infinite capacity to operate with numbers precisely. Round-off 
errors can affect the final computed result either in accumulating error during a sequence of millions of operations or 
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in catastrophic cancellation. In general, the subject of round-off propagation is poorly understood and its effect on 
computations depends on many factors. If too many steps are required in the calculation, then eventually round-off 
error is likely to accumulate to the point that it deteriorates the accuracy of the numerical procedure. Therefore, 
the round-off error is proportional to the number of computations performed and it is inversely proportional to some 
power of the step size. 

One-step methods do not usually exhibit any numerical instability for sufficiently small h. However, one-step 
methods may suffer from round-off error accumulated during computation because higher order methods involve 
considerably more computations per step. 

Let us go to an example that explains round-off errors. Suppose you make calculations keeping 4 decimal places 
in the process. Say you need to add two fractions: 2 + 2 = 4. First, you compute 2/3 to get 0.6667. Next you 
add these rounded numbers to obtain 0.6667 + 0.6667 = 1.3334, while the answer should be equal to 4 = 1.3333. 
The difference 0.001 = 1.3334 — 1.3333 is an example of round-off error. A similar problem arises when two close 
numbers are subtracted, causing cancellation of decimal digits. Moreover, the floating-point arithmetic operations 
do not commute. 

Of course, a computer uses more decimal places and performs computing with much greater accuracy, but the 
principle is the same. When a lot of computations are being made, the round-off error could accumulate. This is well 
observed when a reduction in step size is made in order to decrease discretization error. This necessarily increases 
the number of steps and so introduces additional rounding error. 

Another problem usually occurs when a numerical method has multiple scales. For instance, the Euler rule 
Ynt1 = Yn thf (an; Yn) operates on two scales: there is y, and another term with a multiple of h. If the product 
hf, is small compared to yn, then the Euler method loses sensitivity, and the quadrature rule (3.2.6), page 147, 
should be used instead. 

For instance, consider an initial value problem 


y’ = cosa, y(0) = 107, 


on the interval [0,1]. Its exact solution is trivial: y= 10’ +sin(x). However, the standard Euler algorithm computes 
yo + h cos(0) + h cos(z1) +--+, whereas the quadrature rule provides yo + h(y(0) + y(a1) + y(a2) +--+), which is 
not computationally equivalent to the former. Indeed, if h is small enough, the product h cos(a) may be within the 
machine epsilon®” when added to yo = 10"; then it will be dropped, causing the numerical solution to be unchanged 
from the initial condition. 


1. Show that the cumulative error E, = ¢(an) — yn, where ¢ is an exact solution and yp is its approximation at x = @n, 
in the trapezoid rule (3.2.14) satisfies the inequality 
1+hL/2 


(Bizal (G5) |En| + Mh3(1 — Lh) 


for some positive constant M, where L is the Lipschitz constant of the slope function f. 
2. Suppose that f(x,y) satisfies the Lipschitz condition | f(x, y1) — f(x, y2)| < Lly1 — y2| and consider the Euler 6-method 
for approximating the solution of the initial value problem (3.2.1): 
Yn+1 = Yn le! 9) fn Ofn+1]; n=0,1,2,..., 


where fn = f(@n, yn) and fn4i = f(@n41, yn+1). For 0 = 0 this is the standard Euler method; for 6 = 1/2 this is the 
trapezoid method, and for 6 = 1 this is the backward Euler’s method. By choosing different values of 6 (0 < 6 < 1), 
numerically verify that the 6-method is of order 2 for f(x,y) = 2 (3a —y+2)7' and xo = 0, yo = 1. 


3. Show that the trapezoid method (3.2.14) is of order 2. 
4. Show that the midpoint Euler method (3.2.16) is of order 2. 
5. Consider the initial value problem y’ = x + y+ 1, y(0) = 1, which has the analytic solution y = ¢(x) = 3e* — x — 2. 


(a) Approximate y(0.1) using Euler’s method with h = 0.1. 
(b) Find a bound for the local truncation error in part (a). 
) Compare the actual error in y1, the first approximation, with your error bound. 
d) Approximate y(0.1) using Euler’s method with h = 0.05. 
) 


(e 


35For the IEEE 64-bit floating-point format, the machine epsilon is about 2.2 x 10~1S. 


Verify that the global truncation error for Euler’s method is O(h) by comparing the errors in parts (a) and (d). 
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3.5 The Runge-Kutta Methods 


In §3.2 we introduced the Euler algorithms for numerical approximations of solutions to the initial value problem 
y’ = f(x,y), y(vo) = yo on a uniform mesh v,41 = 2, +h, n = 0,1,...,N—1. The main advantage of Euler’s 
methods is that they all use the values of the slope function in at most two points at each step. However, the 
simplicity of these methods is downgraded by the relatively high local truncated errors, which leads to a requirement 
of small step size A in actual calculations, and as consequence, to longer computations where round-off errors may 
affect the accuracy. 

In this section, we discuss one of the most popular numerical procedures used in obtaining approximate solutions 
to initial value problems—the Runge-Kutta method (RK for short). It is an example of so-called self-starting (or 
one-step) numerical methods that advance from one point x, (n = 0,1,...,.N) to the next mesh point ¢,41 = @p+h, 
and use only one approximate value y,, of the solution at x, and some values of the slope function from the interval 
[2n,2n41]. Recall that slopes are known at any point of the plane where f(z, y) is defined, but the values of the 
actual solution y = ¢(a) are unknown. The accuracy of a self-starting method depends on how close the next value 
Yn+1 Will be to the true value $(%n41) of the solution y = (x) at the point « = x,41. This can be viewed as 
choosing the slope of the line connecting the starting point S(a,,, yp) and the next point P(an41, Yn+1) as close as 
possible to the slope of the line connecting the starting point S with the true point A (p41, 6(@n41)). 

The main idea of the Runge-Kutta method (which is actually a family of many algorithms including Euler’s 
methods) was proposed in 1895 by Carle Runge®® who intended to extend Simpson’s quadrature formula to ODEs. 
Later Martin Kutta®”, Karl Heun, then John Butcher [6], and others generalized these approximations and gave a 
theoretical background. These methods effectively approximate the true slope of SA on each step interval [%7y,%n+1] 
by the weighted average of slopes evaluated at some sampling points within this interval. Mathematically this means 
that the truncation of the Taylor series expansion 


= / h " hs m 
ti — tn i Pa oo, (3.5.1) 
is in agreement with an approximation of yn+1 calculated from a formula of the type 


Ynt1 = Yn + hlaof(@n, Yn) + Cis Lx + prh, yn + byh) 
+ aa f (tn + poh, yn + beh) + +++ + Opf (Ln + Uph, yn + bph)]. (3.5.2) 


Here the a’s, ju’s, and b’s are so determined that, if the right-hand expression of Eq. (3.5.2) were expanded in powers 
of the spacing h, the coefficients of a certain number of the leading terms would agree with the corresponding 
coefficients in Eq. (3.5.1). The number p is called the order of the method. An explicit Runge-Kutta method (3.5.2) 
can be viewed as a “staged” sampling process. That is, for each i (i = 1,2,...,p), the value yu; is chosen to determine 
the x-coordinate of the i-th sampling point. Then the y-coordinate of the i-th sampling point is determined using 
prior stages. 

The backbone of the Runge-Kutta formulas is based on the Taylor series in two variables: 


k 
fle+nu+s) =e ¢ a, >) f(x,y). (3.5.3) 


Ox 
k>0 


This series is analogous to the Taylor series in one variable. The mysterious-looking terms in Eq. (3.5.3) are under- 
stood as follows: 


fs) a\? 5 O'F ef ef 
tesa ley tt yt td ee Fe aan 
¢ Ox a =) al Ox? al Oxy ile Oy?’ 


36Carle David Tolmé Runge (1856-1927) was a famous applied mathematician from Germany. Runge was always a fit and active man 
and on his 70th birthday he entertained his grandchildren by doing handstands. 

87Martin Wilhelm Kutta (1867-1944) is best known for the Runge-Kutta method (1901) for solving ordinary differential equations 
numerically and for the Zhukovsky—Kutta airfoil. (The letter u in both names, Runge and Kutta, is pronounced as u in the word “rule.” ) 
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and so on. To simplify notations, we use subscripts to denote partial derivatives. For example, fr, = 07 f /Ox?. 
Since the actual derivation of the formulas (3.5.2) involves substantial algebraic manipulations, we consider in detail 
only the very simple case p = 1, which may serve to illustrate the procedure in the more general case. 

Thus, we proceed to determine ag, a1, 4, and b such that 


Yn+1 = Yn + hlaofn a anf (Gn + uh, Un se bh)]. (3.5.4) 


First, we expand f(a, + uh, Yn + bh) using Taylor’s series (3.5.3) for a function of two variables and drop all terms 
in which the exponent of h is greater than two: 


h? 
i (em + uh, Yn + bh) = fn + foph + fybh + > (i tes aI 2ubfry F. Be Fy) + O(h*), 


where fn = f(an,Yn), fe = fe(Xn; Yn), and so forth. The nomenclature O(h®) means an expression that is pro- 
portional to the step size h raised to the third power. Here we used the first few terms of the two-variable Taylor 
series: 
f(c+rjy+s)=f(e,y)+rfe(z,y) + sfy(z,y) +197 feo(a, y)/2 
+7 sfay(a,y) +8’ fyy(@,y)/2+0 [(lr| + |sl)"] - (3.5.5) 


Hence, Eq. (3.5.4) can be reduced to 


h3 
Yn+1 = Yn + A(ao + 01) fn + h?ai[ufe + bfy] + On 0 [L? fea + 2ubfey +b? fyy] + O(h*). (3.5.6) 


By chain rule differentiation, 
df(.y) Of , Af dy ay _ 
dz Ox Oy da’ dx 


with the same abbreviated notation, we obtain 


Tas oY =UTetThy)as yl" = [fox + 2f fay + f?foy + fy (fo + ffy)I,,> 


where all functions are evaluated at « = x,. Therefore, Eq. (3.5.1) becomes 


flz,y); 


2 


Yn+1 = Yn +hfn + . a oi the 


3 
+ - [fox + 2f foy +f? fyy + fylfe + ffy)], + OCP’). (3.5.7) 


Finally, we equate terms in like powers of h and h? in (3.5.6) and (3.5.7) to obtain the three conditions 


1 fn 
ag + Oy » Hea. HSS 
From the preceding equations, we find the parameters 
1 fin 
Qo C” be IC ’ IC ’ 


where C' = q, is an arbitrary nonzero constant, which clearly cannot be determined. Substituting these formulas 
into Eq. (3.5.4), we obtain a one-parameter family of numerical methods (one-parameter Runge-Kutta algorithms 
of the second order): 


h h 
Ynt+1 = Yn + (1 _ C)hf (tn, Yn) +Chf (« + ac" =F so (ent) +hTh, 


where AT}, is the local truncation error that contains the truncation error term 


he 2 a a\? Pe a a i" 
tm =(2-5) (F415) reo (Z+rg) sro i. (3.5.8) 
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or, after some simplification, 


h2 


Th = 94C [( 


AC = 3)yn. + 3fy(En; Yn) Yn] aT O(h®). 


There are two common choices: C = 1/2 and C = 1. For C = 1/2, we have 
h h 
Ynt1 = Yn + gf (en, Yn) + af len +h, yn +hf(fn;Yn)) + hTh, 
Wy 


h2 
where T;, = 7p Bfulen, Yn)y, — yll'| + O(h®). This one-step algorithm is recognized as the improved Euler method 
(3.2.15), page 153. For C = 1, we have 


Yn+1 = Un + hf (an Se h/2,Yn + hf (fn; Yn) /2) ale hTn, 


h2 
where T;, = an + 3fy(an,Yn)y] + O(h®). This one-step algorithm is the modified Euler’s method (3.2.16). Note 


that other choices of C’ are possible, which provide different versions of the second order methods. For instance, when 
C = 3/4, the truncation error term (3.5.8) has the simplest form. However, none of the second order Runge-Kutta 
algorithms is widely used in actual computations because of their local truncation error being of order O(h°). | 


The higher order Runge-Kutta methods are developed in a similar way. For example, the increment function for 
the third order method is 


Yn+1 = Yn + h(aky + bk2 + cks), 


where the coefficients k,, k2, k3 are sample slopes, usually called stages, that are determined sequentially: 


ky = f(@n, Yn); 
i) = f(an + ph, Yn 1 phky), 
kz = f(an + rh, yn + shky + (r — s)hk2). 


To obtain the values of the constants a,b,c,p,r, and s, we first expand kz and k3 about mesh point (%p, yn) ina 
Taylor series (the analog of Eq. (3.5.5)): 


kg 


2 
fn + ph fe + phky fy 7 (ph)? aise + ki fry + S hy| a O(h*), 
1 k2 
ka = fare + ha fy] + PDC — 8)(Fa+ hada Bu + (PH)? | Fon + ha Fay + 


The function y(a +h) is expanded in a Taylor series as before in Eq. (3.5.6). Coefficients of like powers of h through 
h® terms are equated to produce a formula with a local truncation error of order h*. Again, we obtain four equations 
with six unknowns: 


a+b+c=1, bp+er=1/2, bp? +cr? = 1/3, cp(r—s) = 1/6. 


Two of the constants a,b,c,p,r, and s are arbitrary. For one set of constants, selected by Kutta, the third-order 
method (or three-stage algorithm) becomes 


h 
Ynt1 = Yn + & (ky + 4k2 + ks), (3.5.9) 


where 
ky = f(@n, Yn); 
ky = fea air h/2,yn + ky h/2), 
k3 = f (an +h, yn + 2hke — hk). 


Note that if the slope function in Eq. (3.2.1) does not depend of y, then the formula (3.5.9) becomes the Simpson 
approximation of the integral in Eq. (3.2.3), page 146. 
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Table 176: Example 3.5.1: a comparison of the results for the numerical solution of the initial value problem 
y=y—1+cost, y(0) = 0, using the third-order Kutta method (3.5.9) for different step sizes h. The true values at 
the points t = 1 and t = 2 are $(1) = 1.3011686789397567 and (2) = 2.325444263372824, respectively. 

Approx. at t=1 |E|/h3 Approx. at t = 2 |E|/h> 
1.3011675995209735 0.0353704 2.3254337871513067 0.343285 
1.3011685445838836 0.0352206 2.3254430282232250 0.323787 
1.3011686621826817 0.0351421 2.3254441138396460 0.313594 
1.3011686768475141 0.0351020 2.3254442449916843 0.308384 
1.3011686786783776 0.0350817 2.3254442610948014 0.305751 


Coefficient kz = f (tn + 4,4 + 4k) 


Yn + hky 
hke 


Yn + oy 


Ln In+# In th Dn In +4 Inth Ln + 22 
h h 
Coefficient k3 = f (ee + 4 un + s ka) Coefficient ky = f (tn +h, yn + hks) 


Yn thky 


(tn + 4, Um + ke) 
Yn 


h h h ! 
Ln, Intz Inth tp + Ln Inty nth In + 


Figure 3.10: Classical Runge-Kutta algorithm. 


Example 3.5.1: (Example 3.2.1 revisited) Let us start with the initial value problem for a linear differential 
equation where all calculations become transparent. For the slope function f(t, y) = y—1+2 cost from Example 3.2.1, 
page 148, the stages on each mesh interval [t,,, tr41] of length h = t,41 —t, in the Kutta algorithm (3.5.9) have the 
following expressions: 


ky = Yn —1+2cos(tn), 


178 Chapter 3. Numerical Methods 


n=» (143) (1+3) +h eosin +2005 (mn +2) 


h 
kg = (1 +h +h?) (yn — 1) + 2(h? — h) cos(tn) + 4h cos (« + 5) + 2cos(tn4i). 


For this slope function, the Kutta algorithm (3.5.9) requires 24 arithmetic operations and 3 evaluations of the cosine 
function at each step. On the other hand, the improved Euler method (3.2.15), which is of the second order, needs 13 
arithmetic operations and 3 evaluations of the cosine function. The Taylor series approximation of the third order, 


2 3 


h h 
Ynt1 = Yn(1 +h) +h (2cost, — 1) + > (Yn —1+2cost, — 2sint,) + — 


5 (Yn —1—2sint,), 


requires 13 arithmetic operations and 2 evaluations of the trigonometric functions on each step. The following table 
confirms that the Kutta method (3.5.9) is of order 3. 


All the fourth-order algorithms are of the form 
Yn+1 = Yn + h(aky + bke + cks + dkza), 


where k,,ka,k3, and ky are derivative values (slopes) computed at some sample points within the mesh interval 
[2n,2n41], and a, b, c, and d are some coefficients. The classical fourth order Runge-Kutta method (see 
Fig. 3.10) is 


h 
Ynt1 = Yn + & (ky + 2k2 + 2k3 + ka), (3.5.10) 


where 


kz = f (an + $,Yn 2 
ka = f(@n +h, yn + hks). 


Example 3.5.2: Consider the nonlinear differential equation y’ = (2 + y — 4)? subject to y(0) = 4. This initial 
value problem has the exact solution y = 4+ tan(a#) — x. Using the third-order Kutta method (3.5.9), we obtain the 
following values of coefficients: 


h h ; 
ki = (tan +n —4)*, kn = (00+ 5 +m + Blen +o — 4)? 4) 3 
k3 = (an +h+ yn + 2hk2 — hk — 4)*. 
(on Pe BV" | 2 : nd \2 
In particular, for n = 0 (first step), we have yy = 4+ $ A?+ (ht ci since ky = 0, 4kg = h*, kg = (n + s) : 


The algorithm (3.5.9) requires 22 arithmetic operations at each step. The Taylor series approximation of the third 
order, 


requires 11 arithmetic operations. On the other hand, the classical fourth order Runge-Kutta method (3.5.10) asks 
to calculate four slopes: 


Be No 

ky = (tn + Yn — 4)’, n= (mtu 44 545%) 
h h.\? P 

k3 = Zn +¥n—4+5 +5 ka ; kg = (an + Yn —44+h+hkz)’. 


Therefore, the algorithm (3.5.10) requires 25 arithmetic operations at each step. 


Although the Runge-Kutta methods look a little bit cumbersome, they are well suited for programming needs, 
as the following codes show. For example, in Maple the classical fourth order formula (3.5.10) can be written as 
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x[0]:=0.0: y(0]:=0: h:=0.15: # initial condition 
for i from 0 to 9 do 

x[it1] := x[il]+h: 

ki:=f(xfi] , ylil); 

k2:=f(x[i]t+h/2 , y[i]+h*k1/2); 

k3:=f(x[i]+h/2 , y[i]+h*k2/2); 

k4:=f(x[it1] , y[i]+h*k3); 

y [it] :=y [i] +h* (k1+2*k2+2*k3+k4) /6: 

od: 

ptsapp1 := seq([x[i],y[i]],i=1..10): 
ptssoli := seq([x[i] ,fsol(x[i])],k=1..10): 
plapp:=pointplot ({ptsapp1}): 
display (psol,ptapp) ; 
errorplot := pointplot( {seq([x[i] ,fsol(x[i])-yli]],i=1..10)}): 
display (errorplot) ; 


Mazima and MATLAB have a special solver based on scheme (3.5.10), called rk (load dynamics) and ode45 
(actually, it is a more sophisticated routine), respectively. Then the script to solve the differential equation y’ = 
f(a,y) (the slope function f(a,y) is labeled as FunctionName) subject to the initial condition y(x0)=a0 on the 


interval [x0,xf] and to plot the solution is 
[x,y] = ode45(@FunctionName, [x0,xf], a0); plot (x,y) 


For example, if the slope function is piecewise smooth 


_ f ysin(ra/3), if O< a <3, 
F(t,y) = { 0, otherwise, 


function f=FunctionName (x,y) 
f=y*sin(pi*x/3) .*(x<=3) ; 
We present two other popular fourth order Runge-Kutta type algorithms. The first one, ascribed to Kutta, is 


we define it with the following MATLAB script: 


Yn+1 = Un = (ky 3ke t 3k3 + ka), (3.5.11) 
ky = In = P(t n1Yn); " 
ko = f (iy, + 3 Yn + 3 h1) ; 

kg = f (tn + Yn — $k + Ako) , 
ka = f(@n +h, Yn + hky — hke + hks). 
Another widely used fourth-order method is one credited to S. Gill [17], 


where 


h 1 1 
ntl = Yn + — [ki +2(1— — ) ko +2114 —) kg thal, 3.5.12 
Ynt+1 y = | ( =) 2 ( =) 3 | ( ) 
ky = f(@n, Yn); 


ko = f (an +4%,4%m+%h), 
ky = f (tn + 4.9m — (4 — Yq) hh + (1- Jp) hho) , 
ka = f (tn + hy Yn — ghke + (1+ =) hk). 


where 


Example 3.5.3: (Example 3.2.6 revisited) | We compare the results of numerical calculations for different 
Runge-Kutta methods: the third-order (3.5.9), the classical fourth order Runge-Kutta method (3.5.10), the Kutta 
formula (3.5.11), and Gill’s method (3.5.12). For this purpose we consider the linear differential equation with 
oscillating slope function y = 27 y cos(2zt) subject to the initial condition y(0) = 1. The actual solution of this 
initial value problem is y = ¢(t) = e&™@74), 

Gill’s algorithm for this initial value problem requires evaluation of the slope function at four sample points. 


ky = fn = f (fn, Yn) = 27 Yn cos(2rtn), 
ka = 2mypn [1 + thcos(27t,,)| cos(2rt, + Th), 
kg = 2m [yn — By + hho + A (kr — k)] cos(2nt, + Th), 


ka = 2m |ym — 2 ko + (1 + +5) hks| cos(2mtn + 2h). 
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The approximate value y,,41 of the solution at the next step 2,41 = %, +h is obtained according to formula (3.5.12), 
which requires 45 arithmetic operations and 3 evaluations of cosine function. The classical fourth order Runge-Kutta 
method (3.5.10) is less costly—it requires only 30 arithmetic operations and the same number of cosine evaluations. 


At the end, we present results of numerical calculations with constant step size h = 0.1 for the IVP y’ = 
(2/2) (y—Ina)!/2 +1/2, y(1) = 0, where the actual solution y = ¢(x) = (Inz)? +Inz is a monotonically increasing 
function, using the third-order Kutta algorithm (3.5.9), the Kutta formula (3.5.11), the classical fourth order Runge- 


Kutta method (3.5.10), and Gill’s method (3.5.12). 


Table 180: A comparison of the results for the numerical solutions of the initial value problem y = 27 y cos(2zt), 
y(0) = 1 using the third-order and fourth order Kutta algorithms (3.5.9) and (3.5.11), the classical Runge-Kutta 


method (3.5.10), and Gill’s rule (3.5.12) with step size h = 0.1. 
x Exact Kutta-3 Kutta-4 Classical Gill 
1.0 1.000 1.0143670292 0.9930807904 0.9992873552 0.9992873552 
2.0 1.000 1.0289404699 0.9862094562 0.9985752184 0.9985752184 
3.0 1.000 1.0437232876 0.9793856663 0.9978635890 0.9978635890 
4.0 1.000 1.0587184905 0.9726090915 0.9971524667 0.9971524667 


In all problems, use a uniform mesh obtained by a partition with grid points cz, = xo + nh, n = 0,1,.... As usual, denote 
by yn the approximate value of the actual solution at = z,. Throughout, primes denote derivatives with respect to x, while 


dots denote derivatives with respect to t. 


1. For the given initial value problems, use the classical Runge-Kutta method (3.5.10) with h = 0.1 to obtain an approx- 
imation to y(2.5), the value of the unknown function at x = 2.5. Compare your result with the true value obtained 


from the actual solution. 


(a) y’=(y+a—4)*, y(0)=2. (b) yi =y?/(e+1), y(0) =1/2. 
(ce) y'=(+y-3)/@-yt, yQ2)=2. (d) y= 2y+4e?, (0) =0. 
(e) y=ycost+2cos*t, y(0)=1. (f) y=yt+2cost, y(0)=0. 


2. Redo Problem 1 using the optimal second order Runge-Kutta scheme: 


3h 


h 
Ynt+1 = Yn + q f(2n, Yn) + oe 


2h 2h 
f in +=, yn + = flan, yn) ’ n=0,1,2,.... 
3 3 
8 


3. Redo Problem 1 using Ralson’s method (which corresponds C = 2/3 in Eq. (3.5.8)): 


h 2h 3h 3h 
Yn+1 = tnt BHO) +f (ant Bat fentn)) ’ n=0,1,2,.... 


4. Redo Problem 1 using each of the following third order approximations: 


(a) the third order Kutta method (3.5.9); 
(b) the Nystrom method: 


(2k, + 3k2 + 3ks), n=0,1,2,..., 


2h 2h 2h 2h 
ki = f(&n, yn), in =$ (en be Uno ti), k3 aCe t 3B un t at kn) 


where 


3 


(c) the nearly optimal method (which is used in MATLAB function ode23): 


h 
Ynti = Yn + 9 (2k + 3ko + 4ks), m=O 1 2s os; 


in which the stages are computed according to 


h 
ki = f(@n, Yn), in=t (an t gyn t 5h). ks = 5 (a t Tey t ka) 


(d) Heun’s algorithm: 


h 
Ynt1 = Yn + 7 (ki + 3ks), = 0,4, Dace: 


where 


(3.5.13) 


(3.5.14) 


(3.5.15) 


(3.5.16) 


(3.5.17) 
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5. Solve numerically the following initial value problems using the classical Runge-Kutta method with h = 0.1 and 
h = 0.01. Determine the numerical value of the solution 2 units from the initial position at « = 0, and then compare 
with the true value. 

(a) y=y +4, y(0)=0. (b) yf =32%(y—y?), y(0) = 1/2. 

(eh ye ek Oe A ee. 

(fe) geverre-t, vos fy geet re), oy t 
(g) y=3y—2e+5, yO)=1. (h) yo =cos(y—a+7/2), (0) =0. 

6. Consider the following initial value problems for which closed form solution formulas are not available. Apply the 
classical Runge-Kutta method (3.5.10) to these problems with h = 0.1, h = 0.05, and h = 0.025 to see whether the 
numerical solutions settle down. Pick the interval of interest to be 2 units from the initial position at x = 0 and find 
approximations at x = 2. 

(a) y’=2?—-2cos(y), y(0)=1. (b) y’=siny+sinz, y(0)=0. 
(c) yi=a?ty?, y(0)=1. (d) yx =yVe?+y? +3, y(0)=1. 
(e) y’ =2re %+sin(ry), y(0)=0. (f) y’ =cos (x? + y’) , y(0) =1. 
(g) yi =y %-227, y(0)=1. (bh). Seve, 2G =. 

7. Redo the previous problem using the Kutta method (3.5.11) 

8. Redo Problem 5 using the Gill method (3.5.12). 

9. Using a second order implicit method 

h 
Yn+1 = Yn t 2 (ky t ko) ; (3.5.18) 
where a4 h = 8 
ky = tn + ——h, yn + — ki + — = hko), 
a ( we 7 a AaB ) 
V34+1 V3+2 h ) 
ko = In + ——=h, yn + — = hki+—ke), 
2=f ( a3 y rE it 7 he 


10. 


11. 


12. 


find a four-decimal-place approximation to y(1.0) with step size h = 0.1 of the following IVPs. 
(a) y’=yte", yO)=-1 (b) y=—4t y, y(0) =2. 
(c) y= 2ey-vVy, y(0)=1. (d) y= 2ty+1, y(0) = 2. 
Compare the numerical results with the exact solution. Express the exact solution to part (d) in terms of the error 
re ae 
function, erf(t) = = | e” dz. 
Vi Jo 


Redo Problem 5 using Butcher’s (1964) fifth order RK method 


Yer = ye + a (7ki + 32k3 + 12k4+32k5+7ke),  k=0,1,..., (3.5.19) 
where ky = f (rr, Yk); ko =f (rr + h/4, yr t hk /4), kg = f (tr + h/4, yr t kyh/8 t k2h/8), 
ka =f (tr h/2, yr koh/2 } k3h), ks = f (ae + 3h/4, ye + 3k1h/16 + 9ksh/16), 
ke = f (tx +A, yx — 3kih/7 + Qkoh/7 + 12kgh/7 — 12kah/7 + 8ksh/7). 


Select one of the following initial value problems and compare the numerical solutions obtained with the classical 
fourth-order RK formula (3.5.10) and the fourth-order Taylor series. Use different values of step size h = 27”, for 


n = 2,3,...,7, to compare with the true value at the point x = 2. 
(a) y’ =1-2/(4y), y) =1. (b)  y' = Say/(2* —y*), yl) = 2. 
(ce) ay =yt+V/e7?+y?, y(1) =1. (d) y’ =2(=4) » yAj=1. 


(e) sy =y+10Ve?-y, yI)=1. (fF) yo =y-vy yl) =1/4. 

Stiffness is a subtle, difficult, and important concept in the numerical solution of ordinary differential equations. It 
depends on the differential equation, the initial conditions, and the numerical method. A problem is stiff if the solution 
being sought varies slowly, but there are nearby solutions that vary rapidly. The following example, credited to Larry 
Shampine, shows that a practical model of flame propagation is stiff. 

If you light a match, the ball of flame grows rapidly until it reaches a critical size. Then it remains at that size because 
the amount of oxygen being consumed by the combustion in the interior of the ball balances the amount available 
through the surface. Let y(t) represent the radius of the ball at time t, and let y be its derivative with respect to t. 
The simple model is y = y? — y?, y(0) = 6, where 0 < t < 2/6, and terms y? and y® come from the surface area and 
the volume. The critical parameter is the initial radius 6. 

First, solve the given initial value problem analytically since the slope function is autonomous. Second, use a computer 
solver based on the Runge-Kutta method to solve this problem numerically for two different values, 6 = 0.01 and 
6 = 0.00001, to observe the stiffness. For instance, MATLAB has a special subroutine ode23s to solve some stiff problems: 
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Summary for Chapter 3 


delta = 0.01; F = @(t,y) y°2 - y*3; 


opt=odeset (’stats’,’on’);figure;title({’ode45, \delta = ’,delta});hold on 
disp(’**Statistics for ode45**’) ;ode45(F,[0 2/delta] ,delta, opt) ; 
figure;title({’ode23s, \delta = ’,delta});hold on; 


disp(’**Statistics for ode23s**’);ode23s(F,[0 2/delta] ,delta, opt) ; 


Solve the initial value problem y’ = y,/y? — 4 with y(0) = V5 by the Runge-Kutta method (3.5.10) on the interval 
[0, 0.55], and account for any difficulties. Then, using negative h, solve the same differential equation on the interval 
with the initial value y(0.55) = 280. 


Consider the IVP: y' = 27+ y?, y(0) = 0. To solve the problem numerically, apply the Runge-Kutta methods of order 
4: (3.5.10), (3.5.11), and (3.5.12). Calculate the number of arithmetic operations required per step. 


Summary for Chapter 3 


1. 


13. 


14. 


15. 


16. 


17. 


18. 
19. 


A recurrence is an equation that relates different members of a sequence of numbers {yo, y1, y2, 
...}, where the values of yn of the sequence are unknown quantities that we wish to determine. 


The order of a recurrence relation is the difference between the largest and smallest subscripts of the members of 
the sequence that appear in the equation. The general form of a recurrence relation (in normal form) of order p is 
Yn = f (Yn-1; Yn—2;-++;Yn—p) for some function f. A recurrence of a finite order is usually called the difference equation. 


A constant coefficient recurrence relation of the first order yn = f(yn—1) has an equilibrium solution y = yn (which 
does not depend on n) if y = f(y). 


A constant coefficient linear nonhomogeneous difference equation yn+1 = pyn + dn has the general solution (3.1.6). If 
Qn = aq”, its exact solution is given by formulas (3.1.9), (3.1.10). 


The first order variable coefficient difference equation Yyn+1 = PnYn + dn has a solution (3.1.12). 


If a small change in the initial value has produced a large change in the solution, we call such a problem (recurrence) 
ill-conditioned. 


The Euler method, or the tangent line method, is the simplest numerical method for solving the initial value problem 
y’ = f(x,y), y(xo) = yo. For computing approximate values yn of the solution $(x,) at the mesh points x, = xo + nh, 
(n = 0,1,2,...), the Euler approximation is defined by Eq. (3.2.5) (recall that fn = f(&n,Yn))- 


The backward Euler method is defined in Eq. (3.2.12). 
The trapezoid rule (3.2.14) is another example of an implicit one-step method. 


The improved Euler formula, or the Heun formula, or the average slope method, is defined by Eq. (3.2.15). 


. The modified Euler formula is given in Eq. (3.2.16). 


. The round-off error is the difference between an approximation, yn, and its computed numerical value using the 


floating-point presentation, Y;,. 


The global truncation error (also called accumulative error) at the n-th step is the difference En = $(@n) — Yn 
between the true value ¢(#,) and its approximation. 


The local truncation error for the difference one-step method yn+1 = yn th ®(an, yn; ynti;h), (n = 0,1,2,...) is the 
error that the increment function, h ®, causes during a single iteration, assuming perfect knowledge of the true solution 
at the previous iteration (called the reference solution). 


If in the IVP y’ = f(z, y), y(a) = ya, the function f(z, y) is sufficiently differentiable, then calculate its derivatives by 
means of the recurrence 

FO =f FEV SAP + APF, b= 0,1,2,.... 
We define the operator T,(x,y;h) by Eq. (3.3.7) to advance from yn to yn41 = Yn +ATp(az, y;h). 


Taylor’s algorithm of order p to find an approximate solution of the differential equation y’ = f(a, y) subject to 
y(a) = Ya over an interval [a,b] is outlined on page 163. 


The methods associated with the names of Runge, Kutta, and others are based on the following approximation of the 
initial value problem y’ = f(x,y), y(xo) = yo: 


yng = Yn + hlaof (an, yn) + arf (an + pih, yn + bh) + a2f(an + pah, yn + beh) + +++ + apf (an + Mph, yn + bph)]. 
The third order Kutta method is given in Eq. (3.5.9). 


There are several fourth order Runge-Kutta methods. We present three of the most popular algorithms: the classical 
Runge-Kutta method (3.5.10), the Kutta scheme (3.5.11), and Gill’s algorithm (3.5.12). 
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Review Questions for Chapter 3 


Section 3.1 
1. Solve the given first order difference equations in terms of the initial value a9. Describe the behavior of the solution as 
n — oo. 
(a) ynt1 = —1.2 yn; (b)  yn41 = SE yn; (Cc) Yntt = 4/ SH an; 
(4) ynt1 = (0.5)" 7 yns (eC) nt = 1.8 yn — 8; (fF) yng = —0.4 yn +7. 


2. An investor deposits $2,500 in an account paying interest at a rate of 4% compounded quarterly. She also makes 
additional deposits of $50 every quarter. Find the account balance in 5 years. 


3. A man takes a second mortgage of $250,000 for a 30-year period. What monthly payment is required if the interest 
rate is 5.75%? 


4, Find the effective annual yield of a bank account that pays interest at a rate of 4% compounded weekly. 


5. A college student borrows $40,000 for a flashy car. The lender charges annual interest at a rate of 6.0%. What monthly 
payment is required to pay off the loan in 10 years? What is the total amount paid during the term of the loan? 


6. If the interest rate on a 15-year mortgage is fixed at 7.0% and $700 is the maximum monthly payment the borrower 
can afford, what is the maximum mortgage loan possible? 


7. A man wants to purchase a yacht for $300,000, so he wishes to borrow that amount at the interest rate 6.0% for 10 
years. What would be his monthly payment? 


8. A home-buyer wishes to finance a mortgage of $300,000 with a 15-year term. What is the maximum interest rate the 
buyer can afford if the monthly payment is not to exceed $2,500? 


Section 3.2 of Chapter 3 (Review) 


In all problems, use a uniform grid of points rx, = ro + kh, k = 0,1,2,...,, where h = (an — xo) /n is a fixed step size. 
A numerical method works with a discrete set of mesh points {x;,} and the sequence of values yo, y1, ---,; Yn, such that each 
Yk is approximately equal to the true value of the actual solution ¢(z) at that point x,, k =0,1,...,n. Throughout, primes 


denote derivatives with respect to x, and a dot stands for the derivative with respect to t. 


1. Consider the following autonomous differential equations subject to the initial condition y(0) = 1 at the specified point 


£=, 
(a) y=e%/?; (d) y=y(1—y/5); (g) y= 1/(4y? + 3y”); 
(b) y= 44+; (e) gy=(y? + 1)/y’; (h) y= (y/2-1)°; 
(c) g=y*"; (f) y = 3y? —y/2; (i) y = 5y (1 — y/3). 


Use the Euler rule (3.2.5), Heun’s formula (3.2.15), and modified Euler method (3.2.16) to obtain an approximation at 
t = 1.2. First use h = 0.1 and then use h = 0.05. Then compare numerical values with the true value. 


2. In each problem (a) through (i), use an appropriate method to find the actual solution y = ¢(x) to a given differential 
equation subject to the initial condition y(0) = 1. Calculate the true value $(1.5) of the actual solution at the point 
x = 1.5. Then apply the average slope algorithm (3.2.15) and the midpoint rule (3.2.16) to determine its approximate 
value using a uniform grid with a constant step size h. Obtain at least a four-decimal place approximation to the 
indicated value. First use h = 0.1 and then use h = 0.05. Which of these two numerical methods gives the better 


approximation? 

(a) y! = (w— 4y +5)”; (d) y/ = 2a (y—«? — 0.5)’; (g) y' = Vera; 
(b) y= 4a +y; (e) y' =(y—2)/(x +1); (h) yy! =6x+6+y; 
(c) y =5+(y—-2)’; (f) y' = (4x — y)?; (i) y’ =cos(y + 2). 


3. The initial value problem y = \/y, y(0) = 0 has two distinct solutions for z > 0: yi(x) = 0 and yo(x) = 27/4. 
Which of the five numerical methods (Euler, backward Euler, trapezoid, Heun, and midpoint) may produce these two 
solutions? 


4. Answer the following questions for each linear differential equation (a) — (f) subject to the initial condition y(0) = 1 


at x = 0. 
(a) y =3-y; (c) y’—5y+8a = 10; (e) (4+27)y' = 4ay — 82; 
(b) y’ =2y+8r+4; (d) y’ + 2y + 8a = 0; (f) y’ =a(a? +y). 


(i) Find the actual solution y = ¢(z). 
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Review Questions 
(ii) Use Euler’s method yz41 = yx + hf, to obtain the approximation yz at the mesh point xx. 
(iii) Use the backward Euler method to obtain the approximation y, at the mesh point xx. 
(iv) Use the trapezoid rule to obtain the approximation y;, at the mesh point xx. 
(v) Use the improved and modified Euler formula to obtain the approximation yx at the mesh point xx. 
) 


(vi) For h = 0.1, compare numerical values ys found in the previous parts with the true value ¢(0.6). 


Consider the following separable differential equations subject to the initial condition y(xo) = yo at the specified point 


(a) ec? y'+cos*y=0, y(0)=1. (b) y’ =e" (cosx—2xr), y(0) =e. 
(c) «?(2—y)y’=(e@—2)y, y)=1. (d) y= 3e7y—y/e, y(1)=1. 

(ec) y=er4, y(0)=1. (f)  y'+3a?(y+1/y)=0, y(0) =4. 
(g) y’=ysinz, y(0)=1. (h) t?y+secy=0, y(1)=0. 


Use the Euler method (3.2.5), Heun formula (3.2.15), and modified Euler method (3.2.16) to obtain a four-decimal 
approximation at zo + 1.2. First use h = 0.1 and then use h = 0.05. Then compare numerical values with the true 
value. 


Consider the following initial value problems for the Bernoulli equations. 

(a) y'=y(8e—y), yO)=1. (b) yi =4e°y—27y?, yO) =1. 

(c) y+3y=ay, y)=1. (a) y'+6ay=y™*, y(0)=1. 

(e) 2a—-y*)y’=y, yO)=4 (f) (y-a)y’=y, y(0)=1. 
First find the actual solutions and then calculate approximate solutions for y(1.5) at the point « = 1.5 using the 
Euler method (3.2.5), trapezoid method (3.2.14), Heun formula (3.2.15), and modified Euler method (3.2.16). First use 
h = 0.1 and then h = 0.05. 


Consider the following initial value problems for the Riccati equation. 
(a) yy! =2-2y/ety?/e*, yA)=7/3. (b) y= 30° +y/et3ey’, y(1)=1. 
Use the Euler rule (3.2.5), trapezoid method (3.2.14), Heun formula (3.2.15), and midpoint rule (3.2.16) to obtain a 
four-decimal approximation at x = 2.0. First use h = 0.1 and then use h = 0.05. 
Consider the following initial value problems for which analytic solutions are not available. 
(a) y’=sin(y’—2), y(0)=1. (b) yaa? ty? yO)=1L 
(c) y=(y—a)x*y’,  y(0) =1. (d) y=y?/l0—a2*, yO)=1. 
(ce) y'=y°/l0+sin(x), yO)=1 (f) y’=27/? —cos(y),  y(0) =1. 
Find approximate solutions for y(1.5) at the point « = 1.5 for each problem using the Euler rule (3.2.5), trapezoid 
method (3.2.14), Heun formula (3.2.15), and midpoint rule (3.2.16). First use h = 0.1 and then h = 0.05. 
Consider the initial value problem y’ = (4+.2?)~1 — 2y?, y(0) = 0, which has the actual solution y = ¢(x) = 2/(4+27). 


a) Approximate ¢(0.1) using Euler method (3.2.5) with one step. 


(c 
(d 


(a) 

(b) Find the error of this approximation. 
) Approximate $(0.1) using Euler method with two steps. 
) 


Find the error of this approximation. 


Repeat the previous problem using the improved Euler’s method (3.2.15). 
Repeat Problem 9 using the modified Euler’s method (3.2.16). 


Which of two second order numerical methods, improved Euler (3.2.15) or modified Euler (3.2.16), requires fewer 
arithmetic operations performed at each step when applied to the IVP y/ =a? +47, y(0) = 0? 


Show that Euler’s method can be run forward or backward, according to whether the step size is positive or negative. 


Apply the Heun method with iteration (see Example 3.4.3, page 172) to integrate y’ = 5e°°” —y from x =O tor=1 


with a step size of h = 0.1. The initial condition at x = 0 is y = 2. Employ a stopping criterion of 0.000001% to 
terminate the correct iteration. 


Can Euler’s method be used to solve the initial value problem 
g=4ty’, — y(0)=0 


over interval [0,1]? 
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16. In psychology, the Weber—Fechner law was first published in 1860 by a German philosopher, physicist, and experimental 
psychologist Gustav Theodor Fechner (1801-1887), a student of Ernst Heinrich Weber (1795-1878) from Leipzig Uni- 
versity, one of the founders of experimental psychology. This law states that the rate of change dR/dS of the reaction R 
is inversely proportional to the stimulus S. Use Heun’s method with step size h = 0.1 to solve the initial value problem 


a =3,  R(01)=0 
over interval [0.1, 5, 2]. 

Section 3.3 of Chapter 3 (Review) 

1. Solve the IVP y’ = 1 — y?, y(0) = 0 by the Taylor algorithm of order p = 3, using the steps h = 0.5 and h = 0.1, and 
compare the values of the numerical solution at x, = 1 with the true value of the exact solution. 


2. This exercise addresses the problem of numerical approximations of the following irrational numbers e % 2.718281828, 
In2 © 0.6931471806, 7 ¥ 3.141592654, and the golden ratio (1+ V5)/2 & 1.618033988. In each case, apply the second 
and the third order polynomial approximation with the step size h = 0.01 to determine how many true decimal places 
you can obtain with such approximations. 

(a) The number e = y(1), where y(zx) is the solution of the IVP y’ = y, y(0) =1. 
(b) The number In2 = y(2), where y(z) is the solution of the IVP y’ = 271, y(1) =0. 
(c) The number 7 = y(1), where y(2) is the solution of the IVP y’ = 4/(1 +27), y(0) =0. 


EAE art) is the solution of the IVP y’ = jy. 4 — ‘ 


(d) The number (1+ V5)/2 = y(2), where y(t) = 


TO /3=t 9-02 | Vaat 
=A 19 
y(0) = 5 +2V3— Se. 
3. Consider Taylor series approximation of order n = 4: 
3 k k 
h O 0 
Init = tn +hPilen unit) =n th De (A412) f(t,y) ; 
k=0 : y cee 


where h is step length and an41 =2%n +h (n =0,1,...) are the mesh points. 

The Richardson improvement method can be used in conjunction with Taylor’s method. Let the interval of interest be 
[0, b], where b = 1 is the right end. If Taylor’s method of order n = 4 is used with step size h, then y(b) © yn + Ch* 
for some constant C’. If Taylor’s method of order n = 4 is used with step size 2h, then y(b) © yen + 16C h*. The terms 
involving C'h* can be eliminated to obtain an improved approximation for y(b): 


16 yn — Yor 
b) & ————_.. 
y(b) 5 
Use this improvement scheme to obtain better approximation to y(1) with h = 1/2 and h = 1/4, where y(z) is the 
solution of the initial value problem 
y = 9a + 3y with y(0) = 1. 


Section 3.4 of Chapter 3 (Review) 
1. For the initial value problem y’ = 2x — y, y(0) = 0, 


(a) find the actual solution y = ¢(2); 
2 3 4 


(b) using the Maclaurin expansion of the exponential function e* = 1+ 24 a | = | a ----, expand ¢(x) ina 


power series with respect to 2; 


(c) for a uniform grid with step size h, verify that the Euler method and the backward Euler method have the local 
truncation error O(h?), and the Heun formula, the modified Euler method, and the trapezoid rule all have the 
local truncation error O(h?). 


2. Numerically estimate the step size that is needed for the Euler method (3.2.5) to make the local truncation error less 
than 0.0025 at the first step. 

(a) y=2t?-3y", y0)=1 (b) y=t-2Vy%, (0) =1; 
(c) g=vtity, yO=1 (d) yat-e™, y(0)=1. 

3. For each of the following problems, estimate the local truncation error, T,(h), for the Euler method in terms of the 
reference solution y = ¢(t). Obtain a bound for T;,(h) in terms of ¢ and ¢(t) that is valid on the closed interval 0 <t <1. 
Find the actual solution and obtain a more accurate error bound for T;,(h). Compute a bound for the error T,(h) in 
the fourth step for h = 0.1 and compare it with the true error. 


(a) y=1-2y, y0)=2; (b) 2=tt+y, yO=h (Cc) y=t—-y, y(0)=0. 
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4. Redo Exercise 3 using the Heun formula (3.2.15) instead of the Euler method (3.2.5). 
5. Estimate the largest stepsize for which Euler’s rule (3.2.5) remains stable on the interval [0,1] for the initial value 
problem: y’ + 3a?y = x”, y(0) = 2. 


6. Some efficient algorithms for integration of initial value problems may produce instability. When such unstable schemes 
are employed, we may observe numerical solutions that are qualitatively different from the true solutions. Such solutions 
are called “ghost solutions.” 


Consider the initial value problem for the logistic equation 


y=y(l-y), —y(0) = yo, 
where yo = 0.5 is given. In order to integrate the logistic equation, employ the central difference scheme 


Yn+1 — Yn-1 


= Yn (1 — Yn 
oh Yn (1 — yn) 


with initial conditions yo (given exactly to be 0.5) and yi = yo + hyo (1 — yo), computed by Euler’s algorithm. 
Compute a numerical solution by the central difference scheme using fixed time-mesh length h = 0.1 and make at least 
500 iterations. Then plotting the results, observe a ghost solution. 


Section 3.5 of Chapter 3 (Review) 


1. For the given initial value problems, use the classical Runge-Kutta method (3.5.10) with h = 0.1 to obtain a four- 
decimal-place approximation to y(0.5), the value of the unknown function at x = 0.5. Compare your result with the 
actual solution. 

(a) y =(y+4e-1)?, y(0)=1. (b) y!/ =3y+272?, y(0) =0. 
1 


(c) yf =y7/?, y(0) =1. (4) y=a/y, y(0)= 
2. Consider the following initial value problems. 
(a) yl =yt20?, y(1) =2. (b) y=(e? +y")/(e-y), y(3)=1. 
(c) y’=3y—ae+5, y(lI)=2 (d) y! =4ay/(e?—y*),  y(1) =3. 
(ec) y=a?t+ Vy, y(0)=4. (f) y= (ay +4y*)/(y? +4), y(0) =1. 
(g) y’ =2y+27, y(0) =0. (h) y' =2y—5r+3, y(3)=4. 
Using the classical Runge-Kutta method (3.5.10) with step size h = 0.1, compute approximations at 2 units from the 


initial point. 
3. Find the actual solutions to the following Bernoulli equations subject to specified initial conditions. 
(a) 2y'=y—a/y, y(1) =2. (b) y= 3y—3ry, y(0) = 1. 
(ce) y + 2ay + 2ay fai, y= (d) ay tye ay yl) = 1/9. 
(ce) yt+ytanc=y"/2, y(0)=1 (ff) sy -y=ar'y", yl) =1/9. 
(g) y+2ey=2ey°, y(0)=2 9 (bh) ay’ ty=27y?Inz, y(1) = 1/2. 
Compare the true values evaluated at two units away from the initial position with approximations found with the aid 
of the following Runge-Kutta methods with fixed step size h = 0.1. 


(a) The fifth order Butcher’s scheme (3.5.19). 


(b) The second order implicit method (3.5.18). 
(c) The fourth order classical method (3.5.10). 
(d) The fourth order Gill’s method (3.5.12). 
(e) The fourth order Kutta method (3.5.11). 


4. Consider the first-order integro-differential equation subject to the initial condition: 


y (x) =2.3y—0.01y” o1y [ y(x)dx, —_y(0) = 50. 
0 


Use the Heun method with h = 0.001 over the interval [0, 2.0], and the trapezoidal rule to find an approximate solution 
to the problem. 


Chapter 4 


Response for square pulse Filter output for periodic input 


Second and Higher Order Linear 
Differential Equations 


Ordinary differential equations (ODEs) may be divided into two classes: linear equations and nonlinear equations. 
The latter have a richer mathematical structure than linear equations and are generally more difficult to solve in 
closed form. As we saw in Chapters 1 and 2, first order differential equations have an unusual combination of 
features: they are important in applications and some of them can be solved implicitly or even explicitly. Unfortu- 
nately, the techniques applicable for solving second order (and higher order) nonlinear ODEs are not available at an 
undergraduate level. Therefore, we consider only linear equations. 

This chapter is an introduction to the elegant theory of second and higher order linear differential equations. 
There are two main reasons for concentrating on equations of the second order. First, they have important ap- 
plications in mechanics, electric circuit theory, and physics. First and second derivatives have well-understood 
interpretations: velocity for order one and acceleration for order two. On the other hand, derivatives of order more 
than two are usually not easy to interpret conceptually. Second, their theory resembles that of linear differential 
equations of any order, using a constant interplay of ideas from calculus and analysis. Therefore, we often formulate 
the definitions and theorems only for second order differential equations, leaving generalizations for higher order 
differential equations to the reader. Furthermore, a substantial part of the theory of higher order linear differential 
equations is understandable at a fairly elementary mathematical level. 

The text is very selective in presenting general statements about linear n-th order differential equations because 
it has a different goal. The theorems are usually proved only for second order equations. For complete proofs and 
deeper knowledge about this subject, the curious reader is advised to consult more advanced books (see, for instance, 


[9]). 
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188 Chapter 4. Second and Higher Order Linear Differential Equations 


4.1 Second and Higher Order Differential Equations 


Recall from calculus that derivatives of functions u(a) and y(t) are denoted as u/(a) or du/dx and y’(t) = dy/dt 
or y, respectively. Newton’s dot notation (y) is usually used to represent the derivative with respect to time. The 
notation x or t stands for the independent variable and will be widely used in this chapter. Higher order derivatives 
have similar notation; for example, f” or d*f/da? denotes the second derivatives. 

A second order differential equation in the normal form is as follows: 


dy =F(a,y ay or y" =F (a,y,y') (4.1.1) 
dxr2 9 dx 4 y] = 


where F(x, y, p) is some given function of three variables. If the function F'(, y, p) is linear in variables y and p (that 
is, F(x, ay: + by2, p) = aF (a, y1,p) +b F (2, yo, p) for any constants a, b, and similar for variable p), then Eq. (4.1.1) 
is called linear. For example, the equation y” = cos x +5y? +2 (y’)” is a second order nonlinear differential equation, 
while the equation y” = (cos) y is a linear one. 

A function y = ¢(2) is a solution of (4.1.1) in some interval a < x < b (perhaps infinite), having derivatives up 
to the second order throughout the interval, if ¢(a) satisfies the differential equation identically in the interval (a, b), 
that is, 


da? dr 
For many of the differential equations to be considered, it will be found that solutions of Eq. (4.1.1) can be 
included in one formula, either explicit 


2 
ee F (x.0(2 =) for all x € (a, 0). 


Y= o(a, C1, C2) 
or implicit 
B(x, y, Ci, C2) = 0, 
where C and C2 are arbitrary constants. Such a solution is referred to as the general solution of the differential 
equation of the second order in either explicit or implicit form. Choosing specific values of the constants C; and 
C2, we obtain a particular solution of Eq. (4.1.1). All solutions can be so found except, possibly, singular and/or 
equilibrium solutions. 

Second order differential equations are widely used in science and engineering to model real world problems. The 
most famous second order differential equation is Newton’s second law of motion, mij = F (t,y, y), which describes 
a one-dimensional motion of a particle of mass m moving under the influence of a force F’. In this equation, y = y(¢) 
is the position of a particle at a time t, y = dy/dt is its velocity, 7 = d?y/ dt? is its acceleration, and F is the total 
force acting on the particle. 

For given two numbers yo and yi, we impose the initial conditions on y(«) in the form 


y(to) = yo, -y'(z0) = y- (4.1.2) 


The differential equation (4.1.1) with the initial conditions (4.1.2) is called the initial value problem (IVP for 
short) or the Cauchy problem. Now we can state the following theorem, which is a direct extension of Picard’s 
Theorem 1.3, page 23, for first order equations. 


Theorem 4.1: Suppose that F, OF /Oy, and OF /Oy’ are continuous in a closed 3-dimensional domain Q in 
xyy’-space, and the point (9, yo, yo) belongs to 2. Then the initial value problem (4.1.1), (4.1.2) has a unique 
solution y = ¢(a) on an z-interval in 2 containing zo. 


The general linear differential equation of the second order is an equation that can be written as 


d? d 
a2(x) 55 + a(x)" + ao(x)y(x) = g(a). (4.1.3) 
The given functions a9(x), ai (x), @2(a), and g(x) are independent of variable y. When Eq. (4.1.1) cannot be written 
in the form (4.1.3), then it is called nonlinear. If the function a2(x) in Eq. (4.1.3) has no zeroes on some interval, 
then we can divide both sides of the equation by a2(x) to obtain its normalized form: 


y" (x) + p(x) y'(a) + a(x) y(@) = f(a). (4.1.4) 
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The points where the coefficients of Eq. (4.1.4) are discontinuous or undefined are called the singular points of 
the equation. These points are usually not used in the initial conditions except some cases (see Exercise 10). For 
example, the equation («7 — 1) y’ + y = 1 has two singular points z = 1 and x = —1 that must be excluded from 
consideration. If in opposite, the initial condition y(1) = yo is imposed, then the differential equation dictates that 
yo = 1; otherwise, it has no solution. 


Theorem 4.2: Let p(x), g(x), and f(a) be continuous functions on an open interval a < x < b. Then, for each 
xo € (a,b), the initial value problem 


Y pay ely = f(2), ) ytol=vo, 4 (Go) =n 


has a unique solution for arbitrary specified real numbers yo, 1. 


Since every linear second order differential equation can be reduced to the first order (nonlinear) Riccati 
equation (see Problem 17), Theorem 4.2 follows from Picard’s theorem (page 23). | 
Equation (4.1.3) is a particular case of the general linear differential equation of the n-th order 


n n-1 


An (x) on + An—1(2x) 


y 


qgnct + 11+ + Go(a)y(a) = g(a). (4.1.5) 


If g(a) is identically zero, equations (4.1.3) or (4.1.5) are said to be homogeneous (the accent is on the syllable 
“ge”): if g(a) is not identically zero, equations (4.1.3) and (4.1.5) are called nonhomogeneous (or inhomogeneous 
or driven) and the function g(x) is referred to as the nonhomogeneous term, which is variously also called the input 
function or forcing function. The functions ao(x), a1(@),...,@n(x) are called the coefficients of differential equation 
(4.1.5). If all the coefficients of Eq. (4.1.5) are constants, then we speak of this equation as a linear, n-th order 
differential equation with constant coefficients. 

The existence and uniqueness theorem for the initial value problem for linear differential equations of the form 
(4.1.5) is valid. This theorem, which we state below, guarantees the existence of only one solution for the initial 
value problem. Also, Eq. (4.1.5) does not have a singular solution (i.e., a solution not obtained from the general 
solution). Therefore, the initial value problem for a linear equation always has a unique solution. 


Theorem 4.3: Let functions ao(), ai(%), ..., @n(x) and f(x) be defined and continuous on the closed interval 
(n 


a<axz<bwith a(x) £0 for x € [a,b]. Let x be such that a < xp < band let yo, yp, --., Yo = he any constant. 
Then in the closed interval [a,b], there exists a unique solution y(x) satisfying the initial value problem: 


an(x)y™ + an—1(x)y") +--+ + a1(2)y! + ao(x)y = 9(2), 


y(xo) = yo, y' (a0) = yh, 2-3 YP (a0) = of”. 


Is a similar theorem valid for boundary value problems? No, as shown in §10.1, where a boundary value problem 
can have many, one, or no solutions. 

According to Theorem 4.2, any initial value problem for the second order differential equation (4.1.4) has a unique 
solution in some interval |a, b| (where a < b). In particular, if a solution and its derivative vanish at some point in 
the interval |a,b|, then such a solution is identically zero. Therefore, if a nontrivial (not identically zero) solution 
y(x) is zero at some point xo € |a,b|, then y’(zo) 4 0, and hence the solution changes its sign at 29 when it goes 
through it (which means that the graph of y(x) crosses the abscissa at x0). 


Example 4.1.1: Let us consider the initial value problem 


2_A)y" + (ex+2)y't+a°y=sinz,  y(xo)=yo, y'(t0) = yp. 


x(a 
To determine the interval of existence, we divide both sides of the differential equation by «(a? — 4) = x(a —2)(a +2) 
to obtain y” + p(x)y’ + q(a)y = f(x) or 
ji 1 j x _ sing 1 


y Tago) 
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The coefficient p(x) = 1/a(x — 2) is not defined at two singular points x = 0 and x = 2. Similarly, the functions 
q(x) = «/(z? — 4) and f(x) = S22. GIDE fail to be continuous at singular points x = +2. So we don’t want to 
choose the initial point 2 as 0 or +2. For example, if x9 = 1, the given initial value problem has a unique solution 
in the open interval (0,2). If a9 = 3, the given initial value problem has a unique solution in the interval 2 < x < oo. 


The behavior of a solution at these singularities requires additional analysis. 


4.1.1 Linear Operators 


With a function y = y(x) having two derivatives, we associate another function, which we denote (Ly)(x) (or L[y] 
or simply Ly), by the relation 


(Ly) (x) = aa(a) y" (a) + a1(x)y'(@) + ao(x)y(@), (4.1.6) 


where a(x), a1(x), and ag(x) are given functions, and a2(x) 4 0. In mathematical terminology, L is an operator®® 
that operates on functions; that is, there is a prescribed recipe for associating with each function y(x) a new function 
(Ly)(x). An operator L assigns to each function y(x) having two derivatives a new function called (Ly)(«). Therefore, 
the concept of an operator coincides with the concept of a “function of a function.” 


Definition 4.1: By an operator we mean a transformation that maps a function into another function. A 


linear operator L is an operator such that Liaf + bg] = aL f +bLg for any functions f, g and any constants a, b. 


In other words, a linear operator is an operator that satisfies the following two properties: 

Property 1: L{cy|] = cL{y|, for any constant c. 

Property 2: L{y + y2] = L[yi] + Lye] - 

All other operators are nonlinear. another function. Analysis of operators, their properties, and corresponding 
techniques are called operator methods. 


Example 4.1.2: The operator L[y] = y” + xy is a linear operator. If y(x) = sing, then 
(Ly)(x) = (sinz)” + 2? sina = (x? — 1)sinz, 
and if y(x) = x*, then 
(Ly)(x) = (a*)” + a?24 = 1227 + 2°. 
Example 4.1.3: An operator 
Ly=y?+y" 


is a nonlinear differential operator because for any constant c we have 


Licey] = (cy)? + cy” = ey? + cy" 4 cLy = cy? +y"). 


Differentiation gives us an example of a linear operator. Let D denote differentiation with respect to the inde- 
pendent variable (x in our case), that is, 


d d 
Dy =y' = > ( Se) p=). 


Sometimes we may use a subscript (D,.) to emphasize the differentiation with respect to x. Then D is a linear operator 
transforming a function y(«) (assumed differentiable) into its derivative y’(«). For example, 


D(z) = 327, De?” = 2e?*, D(cosx) = —sinz. 


Applying D twice, we obtain the second derivative D(Dy) = Dy’ = y”. We simply write D(Dy) = Dy, so that 
lg d? 
Dy=y', Dy=y", Dey=yl",-- ("= D? canals 
x 


Note that D° is the identity operator (which we omit). With this in mind, the following definition is natural. 


°8The word “operator” was introduced in mathematics by the famous Polish mathematician Stefan Banach (1892-1945) who published 
in 1932 the first textbook Théorie des opérations linéaires (Theory of Linear Operations) on operator theory. 
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Definition 4.2: The expression 


L[x, D] = an(x)D™ + @n—1(x)D"—* +--+ + a1(x)D + ao(z) (4.1.7) 


is called an n-th order linear differential operator, where ag(x), ai(x), ..., @n(x) are real-valued functions. 


Then homogeneous and nonhomogeneous equations can be rewritten as L[x,D]jy = 0 and L[z,Djy = f, re 
spectively. Note that we write the operator in the form where the coefficient a,(2) precedes the derivative op- 
erator. For instance, 2D operates on a function f as x7Df = 2? f’(x), while Dx? produces a different output: 
Da? f(x) = 2x f(x) +a? f(x). If all coefficients in Eq. (4.1.7) are constants, we drop x and denote such an operator as 
L{D]. It is a custom, which we follow throughout the book, to drop the identity operator D° (that is, D°y(x) = y(x) 
for any function y(x)) in expressions containing the derivative operator D. 


Theorem 4.4: [Principle of Superposition for homogeneous equations] If each of the functions y1, y2, 
..-; Ym is a solution to the same linear homogeneous differential equation, 


an(x)y™ + an—1(x)y"Y +--+ +a1(2)y’ + ao(z)y = 0, (4.1.8) 
then for every choice of the constants c), C2, ..., Cm the linear combination 
Y= C1Y1 + Coya2 + +++ + CmYm 


is also a solution of Eq. (4.1.8). 


Since the left-hand side of this equation is a linear operator, we have 
Ly = Licey + coya Fees + CmYm| = ci Lyi + coLy2 apes eb CmLYm =0 


because Ly; = 0, 7 =1,2,...,m. 


4.1.2 Exact Equations and Integrating Factors 


We define a certain class of differential equations that can be reduced to lower order equations by simple integration. 


Definition 4.3: An exact differential equation is a derivative of an equation of lower order. If this equation 


is written in the operator form Ay = f, then it is exact if and only if there exists a differential operator B such 
that A = DB, where D stands for the derivative operator. 


d d 
In particular, a linear exact second order differential equation in [P(x)y’] + ae (Q(x)y] = f(x), where P(x) and 
x x 


Q(z«) are given continuously differentiable functions, can be integrated immediately: 
P(x) y' + Q(2)y = / f(a) de +C. 


Since this equation is a linear first order differential equation, the function y(a) can be determined in an explicit 
form (see §2.5). 


Theorem 4.5: A linear differential equation a2(z) y’’ +a1(x) y’ +aoy = f(x) is exact if and only if its coefficients 
satisfy the equation 
Gv) G5 (2) + ag(x). (4.1.9) 


For the equation az y” +a, y' + aoy = f(x) to be exact, there should exist functions P(x) and Q(z) 
such that 
a(x) = P’, ay =P'+Q, ay = Q(z). 


By eliminating Q, we obtain the necessary and sufficient condition (4.1.9). Oo 
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We may try to reduce the differential equation L[x,D]y = f(x), where L[x,D] = a2(x)D? + ai(x)D + ao, to an 
exact one by multiplying it by an appropriate integrating factor p(x): 
pagy” + wary’ + pagy = pf. 
Using the condition (4.1.9), we conclude that y should satisfy the following equation 
(1a1)' = (uaz)” + pao. 
Reducing parentheses, we see that yz is a solution of the so-called adjoint equation 
L'n = 0, where L' =D7a9 — Day + ao. 
Example 4.1.4: Find the general solution of the equation 
zy” + (2+ a7)y' + 32y = 0. 
Solution. An integrating factor w = p(x) is a solution of the adjoint equation xy” — x?! + ap = 0, which has a 
solution pp = x. So, after multiplication by uw, we obtain an exact equation re [27y/' + xy] = 0, or after integration, 


we get ry! + xy = C,. Division by x? yields y’ + xy = C,/x?. Upon multiplication by an integrating factor, we 
get ly ail = C\2~? e*’/?, Next integration gives us the general solution: 


yer /2 =, fare" Par+ Cy. 


The definition of exactness can be extended for nonlinear equations in a more general form: 


bs ai(e,u)(y)'| o” +d / by(@,w)(u')? = F(a). (4.1.10) 
i=0 j=0 

‘ m / 
For the first set of terms, we use the relation uv’ = (uv)! — vu’. Letting u = a; and v’ = (y’)'y” = (4 (y/)'*7) ; 


we reduce the above equation to the following: 


* ai(x,y(yytt\) Sh (y')#! fda; , aa; ,) a 
(pen) ae E at By | +2 ew) = f(z). 


i=0 i=0 


Since the first term is exact, the remainder must also be exact; but the first order exact equation is necessarily linear 
in y’. Hence, the terms of higher degree in y’ must vanish. Collecting terms of order 0 and 1 and equating them to 


zero, we get bo + diy’ — se y’ =0. Using the exactness criterion (2.3.5), page 74, we obtain 
x 
0 Oag Obo 
— |b, -— } = —. 4.1.11 
Ox ( . | Oy ( ) 


The necessary and sufficient conditions of exactness for Eq. (4.1.10) consist of the exactness condition (4.1.11) and 


the conditions for the vanishing of higher powers: 
1 Oag-1 1 Oax_2 


ko Ox k-1 Oy’ 


where we let a; = 0 and b; = 0 for i > n and j > m, respectively. 


by 


2<k<max(m,n+ 2), (4.1.12) 


Example 4.1.5: Let us consider the equation 
(a?y + 2a? y')y” + y? + ayy’ + 427 (y')? = 2a, 


which is of the form (4.1.10). Here ag = x7y, a, = 273, n = 1 and bo = y’, bi = 4zry, bo = 4x7, m = 2. It is 
easily checked that conditions (4.1.11), (4.1.12) are satisfied. We have f(x?y) y da = x?yy' — [(a?y! + 2zy)y’ da, 
J (a?)(2y'y" dz) = a3 (y')? — 3 f 2?(y')? dx, and fy? dx = y?x — f 2yy'xdx so that, upon multiplying by dz and 
integrating, the given equation is reduced to 


xyy’ + 23 (y’)? + xy? = [= dg=27+C. 


4.1. Second and Higher Order Differential Equations 193 


Let us consider the general second order differential equation in normal form: 
R(x, y,y')y" + S(x,y, y") =0. 
Introducing a new variable p = y’ = dy/dz, we rewrite the given equation in differential form: 
R(x, y,p) dp + S(x,y, p) dx = 0. (4.1.13) 


Treating x, y as constants, we define the quadrature 


[Re y,p) dp = y(2x, y, p). 


Then, using the equation dy(z,y,p) = R(z,y,p)dp + vx(z,y,p) dx + py(x,y,p) dy, we replace R(z,y,p)dp = 
dy — pz dx — py dy in Eq. (4.1.13) to obtain 


This equation is exact if and only if the first order differential equation 
[S(x, y,p) — px] dx — py dy = 0 (4.1.15) 


is exact. If this is the case, there exists a potential function ~(a,y) such that dw = [S(x,y,p) —y2] dx — gy dy. 
Therefore, the first integral of Eq. (4.1.14) has the form 


p(x,y,p) + ¥(z,y)=Ci with p=y/, (4.1.16) 


which is a first order differential equation. So we reduced the second order nonlinear differential equation (4.1.13) 
to a first order equation, which may be solved using one of the known methods (see Chapter 2). 


Example 4.1.6: Integrate the nonlinear differential equation 
(3y? — 1) y" + 6y(y')? + y' — 3y?y’ +1-2? =0. 
Solution. In Eq. (4.1.13), we have R(x, y,p) = 3y? — 1 and y(z, y, p) = (3y? — 1)p, and hence Eq. (4.1.15) becomes 
[6yp* +p—3y’p+1-— 327] da — 6yp dy = 0. 
Replacing p by dy/da, we obtain the equation 
(1 — 3y*) dy + (1 — 327) da = 0, 


which is exact and has the potential function (2, y) = y — y? + x — x3. From Eq. (4.1.16), we get the first order 
equation 
(39? —1)y +y—yt+e-—27=C). 


The change of variable u = y? — y transforms the left-hand side into u’ — u + a — x?, which is linear. The general 
integral of the given equation is found to be 


y—y= Ce * —C, —5—S2— 32" — 2", 


4.1.3. Change of Variables 


We introduce some transformations that reduce the homogeneous second order equation 
y" + p(x)y’ + a(z)y = 0 (4.1.17) 


to other canonical forms. Let us start with the Bernoulli substitution y = y(x)v(x), where y(a) is some known 
nonvanishing function. The transformed equation for v is then 


y(x)u" + (2'(x) + pp)u’ + (p” + py’ + ayp)u = 0. 
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This new equation may be integrated for some choice of y(xz). However, there is no rule for such a fortunate choice. 
The substitution 


y = v(2) exp { —5 [ ar} (4.1.18) 


0 


reduces Kq. (4.1.17) to the form v” + (q— p?/4—p’/2) v = 0, which does not contain the first derivative of an 
unknown function. If g — p?/4 — p’/2 is equal to a constant divided by x”, the equation for v can be integrated 
explicitly. If this expression is a constant, then it will be shown how to integrate it in §4.5. For a second order 
constant coefficient differential equation ay’ + by’ + cy = 0, substitution 


y =e 92/(24) wy(z) (4.1.19) 


reduces the given equation to the following one: 
1 2 
vl += (« _ x) e=0. (4.1.20) 
Ga a 


Theorem 4.6: The second order homogeneous linear differential equation 


y" — g(x) y =0, (4.1.21) 


where g(a) is a continuous function for x > 0 and q(x) > qo > 0, has a solution that is unbounded and another 
one that is bounded. 


The given equation (4.1.21) with positive initial conditions y(0) = yo > 0 and y/(0) = y; > 0 has the 
unbounded solution y(a) that also has an unbounded derivative because its graph is concave up. Indeed, integration 
of Eq. (4.1.21) yields 


x 


(0) -v'0) = [ as)uls)ds>a f uses = v@) > +a f uls)as. 
0 0 0 
Therefore, y’(x) is positive and the solution is growing. Taking the lower bound, we get 


y'(a) > yi + xqoyo > 1 => y(x) > yt t+ yo. 


Using this unbounded solution, we construct a new bounded solution 


ua) * Cy(x) — y(e) f° y(s)Pas (c= [e<[ ays). 


Since C is finite, we have u(0) = Cy(0) = Cyo > 0 and u(x) is always positive by our construction. At infinity, u(x) 
tends to zero as the following limit shows, evaluated with the aid of l’Hopital’s rule, 


. © ds / 1 . ~y-2(z) 4 
a aes ( | Pal a5) = ae =e 


With a little pencil pushing, it can be checked that u(x) is also a solution of Eq. (4.1.21). oO 
The substitution y’ = yw reduces the equation y” + Q(2)y = 0 to the Riccati equation 


w +w* + Q(z) =0 


BInGE / ” Pl 2 2 
d = = = 
wit (¥)\_wyoy _ Oy = yw)" _ a Q(z). 
dx \y y? y? 
In §2.6.2 we discussed the reduction of the Riccati equation to a second order linear differential equation. Now we 


consider the inverse procedure. Namely, the second order linear differential equation 


ag(a)y" + ar(a)y! + ao(x)y = 0 
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ie Sexe { [ over ach . 


where v(z) is any solution of the Riccati equation 


C2) + (28 o'(#) 


°F @)aa(a) 


has a solution 


on an interval upon which a2(x) # 0 and o(x) 4 0. 


Example 4.1.7: Solve the initial value problem: 


y’+2y'/e+y=0, ylj=1, y/0)=0. 


ere exp {- f° +} ave iar 


Then its derivatives are P r ; 
, Uv v ny Uv 2v' Qu 


¥ og Y x | 
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(4.1.22) 


(4.1.23) 


This yields the equation for v: v”+v = 0. It is not hard to find its solution (consult §4.5): v(a) = C; cosa+C> sina, 


where C; and C% are arbitrary constants. Thus, the general solution of the given equation becomes 


_ C, cos” + C2 sin x 


y(x) 


x 


After substituting this expression into the initial conditions, we get 


_ cosa (cos 1 — sin 1) + sina (cos 1 + sin 1) 


y(x) = 


x 


Problems| In all problems, D denotes the derivative operator, that is, Dy = y’; its powers are defined recursively: 
y=y y 


D” = D(D"—'), n = 1,2,..., and D® is the identical operator, which we usually drop. 


1. Classify the differential equation as being either linear or nonlinear. Furthermore, classify the linear ones as being 


homogeneous or nonhomogeneous, with constant coefficients or with variable coefficients, and state the order. 


(a) y"+2°y=0; (b) y”+ay=sinaz; (ce) y”+yy'=1; 
(d) yy 4y! = 207 4+3; (ce) y+ yy =1; (f)  y"” +a2y =cosha; 
(g) (cosx)y’+y e® =sinhz; (h) y”+a2y=coshz; (i) y-y =1; 
(ij) (sinh.x)(y')? + 3y = 0; (k) 5y’—ay =0; (I) y?y'/? =sing; 
(m)  2y"4+3y' +4a?y =1; (n) y/”-1=0; (0) 2? y” —y=sin’ 2. 


2. For each of the following differential equations, state the order of the equation. 


(a) yl=2?4+y; (b) y”+ay"-y=sing; (c) (y’)?+yy'ey’ =Ing; 


(d) sin(y”)+ yy =1;  (e) (sinha) (y’)? +y"=2y; (fF) yy" =1; 
(g) y® —ay =0; (hb) (y’”)? +41 =sina; (i) y”+y?% =2. 


3. Using the symbol D = d/dt, rewrite the given differential equation in the operator form. 


(a) yy” +4y'’+y=0; (b) y"—5y”+y'-y=0; (c) dy” — 3y’ — 24y = 0; 
(d) 3y% —2y”+y'=0; (e) yl —(sinz)y”+y=2; (f) Ty +8y'” —9y" =0. 


4. Write the differential equation corresponding to the given operator. 
(a) D?—-2D477?+1; (b) (D+1)?; (c) D?+3D—10; 
(d) 4D*—8p*—7D+6; (e) 3D?+2D+1; (f) (D+1)%. 


5. In each of the following initial value problems, determine, without solving the problem, the longest interval in which 


the solution is certain to exist. 

(a) (©—3)y"+(Ina)y=2?, y(1)=1, y/(1) =2; 

(b) y+ (tanax)y’ + (cotx)y=0, y(r/4) =1, y'(n/4) = 0; 
(c) (a? +1)y"+(e—-Ly'+y=0, y(0)=0, y(0) =1; 
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19. 


20. 


Chapter 4. Second and Higher Order Linear Differential Equations 


(d) sy” + 22?y’+ysine =sinhe, y(0)=1, y/(0)=1; 

(e) sinazy” +2y'’+7y=1, y(1)=1, y'/(1) =0; 

(f) y"—(a@- Dy +a?y=tane, y(0)=0, y/(0) =1; 

(g) (x? —4)y"” +ay'+(x+2)y=0, y(0)=1, y/(0) =-1. 


. Evaluate (the symbol D stands for the derivative d/dx) the following expressions. 


(a) (—2)(a2 +22); (b) (D—1)(D+2)(sin2x); (c) (D?—5D?+11D—1)(a? — 2); 
(d) (D—2)(2?—2z); (e) (D—2)(D*?+4) cos2z; (f) (D? —D* +2D) sin3z. 


. A particle moves along the abscissa so that its instantaneous acceleration is given as a function of time t by 2 — 32”. 


At times t = 0 and t = 2, the particle is located at x = 0 and x = —10, respectively. Set up a differential equation 
and associated conditions describing the motion. Is the problem an initial or boundary value problem? Determine the 
position and velocity of the particle at t = 5. 


. Let y = ¥i(x) and y = Yo(x) be two solutions of the differential equation y” = 2? + y?. Is y = Yi(x) + Yo(a) also a 


solution? 
d?y da dz \* a 
(a) Show that aa ~ aya (=) . Hint: Differentiate both sides of dy/dx = 1/(da/dy) with respect to x. 


3 
(b) Use the result in (a) to solve the differential equation a +2 (4) =0. 


Show that y1 = x and y2 = e” are solutions of the linear differential equation (x — 1)y” — xy’ + y = 0. 


Find the general solution 


d 3 dy 1d du 
ss —)=0: b -— —}]=0. 
(a) dx (« *) pee) r dr (« *) e 


2 
Replace r in the Bessel equation aad + ue tv?u=O0byr=e’. 


Show that a°/9 and («?/? + 1)?/9 are solutions of the nonlinear differential equation (y’)? — xy = 0 on (0,00). Is the 
sum of these functions a solution? 


Show that a nontrivial (not identically zero) solution of the second order linear differential equation y” +p(x)y'+q(x)y = 
0, x € |a, b|, with continuous coefficients p(x) and q(x), cannot vanish at an infinite number of points on any subinterval 
[a, 8] C Ja, b]. 


Given two differential equations 


Prove the Green formula 


Hint: Derive the Lagrange identity:  [P(x)(uv’ — u'v)]' = Q(x)u(x) — f(x)v(2). 


Using the substitution y = y(a)u(x), transfer the differential equation a? y” — 4a? y’ + (a? +1) y = 0 to the equation 
without wu’. 


Show that the change of the dependent variable u = P(x)y’/y transfers the self-adjoint differential equation (P(ax)y’)’ + 


2 
Q(x)y = 0 into the Riccati equation u’ + oa + Q(x) = 0. 

1 /25k x 
Show that for any constant C, the function y(x) = ¢ + = 7) is a solution to the nonlinear differential 


equation y" +key *=0 (k>0). 


Using the substitution y’ = uy, show that any linear constant coefficient differential equation ay” + by’ + cy = 0 has an 
explicit solution. 


For a given second order differential operator LD = a,D? + a,D + ao, the operator L’ = D?a, — aiD + ao is called the 
adjoint operator to L. Find an adjoint operator to the given one. 


(a) aD? +2°D+1; (b) 2D? +a2D—1; (c) D?+22D; (d) (1—2”)D* — 22D +3. 


4.1. 
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Find the adjoint of the given differential equation (where a is a real number and n is a positive integer). 


(a) y’ -—y'’-y=0, Fibonacci equation; (b) (1 x”) y” —2xy'+n(n4+1)y=0, Legendre’s equation; 

(c) y’-—ay=0, Airy’s equation; (d) (a? 4a) y" + xy’ n*y =0, Dickson’s equation. 
Solve the following linear exact equations. 

(a) yy" + 2ay’ + 2y = 0; (b) ay" + (sina)y’ + (cosx)y = 0; 

(c) yl" + 2a7y' + dry = 2a; (d) (1-2? )y" + (1—a)y' +y =1- 22; 

(e) y+ 4ary' + (2+ 427)y = 0; (f) "4 tay! 4 ay wjy=0; 

(zg) y +a2y' + 2xy = 2a; (bh) y” In(a? +1) 4+ +t y +y ao = 0; 

(i) avy” +a7y' + 2ay =0; (j) y + (sin x)y’ + (cos x)y = cos 7; 

(k) y"+y'cotz—yese?x=cosz; (1) y’elne4+2y'—y/r#=1. 


Solve the following nonlinear exact equations. 


(a) xy” + (Gry? + Ly’ + 2y? +1 =0; (b) (l+y)cy” +yy'— ay’)? +y' =(Lt+y)?e sing; 
(c) yy” sine + [y’sine+ycosz]y’=cosxz; (d)  (xcosy+sin x)y" ~ 2(y! ? sin y + 2(cosy +cosx)y’ = ysinaz; 
(e) (1—y)y" — (y')? = 0; (f) (cosy —ysiny)y” — (y’)?(2siny + ycosy) = sina. 


Find an integrating factor that reduces each given differential equation to an exact equation and use it to determine 
the equation’s general solution. 

(a) + ey — ae y= (b) Qe tay" + (10 +a + 2?)y! = (25 — 6a)y; 
(c) y+ “- wry y = 0; (d) (a? —a)y” + (22? + 4x — 3)y’ + 82y = 1; 

(e) aa y" a5 Bethy’ 4+ 4% = 3a; (f) (2sinxz — cosxz)y” + (7sinz + 4cosx)y’ + 10ycosx = 0; 

(g) o t2ty teehee (h) yt Qet5)y! + (e+ Bye. 
Use the hit ae method to reduce the second order (pendulum) equation @$ = g siny to the first order 
equation. Note: the reduced first order equation cannot be solved in elementary functions. 


By substitution, reduce the coefficient of y in y” — x” y = 0 to negative unity. 


Differential equations may have stationary solutions, also called equilibrium solutions. Find all equilibrium solutions 
of the given differential equations. 


(a) j+y=y"; (b) j+4y =8; (c) 7+ 4y =0; (d) j+y=1. 


Formulate a differential equation governing the motion of an object with mass m = 1 kg that stretches a vertical spring 
6cm when attached and experiences a resistive forth whose magnitude is one-sixteenth of the object’s speed. 


Let u be a function of the variables x and y. Show that the operator A defined by 


Au= [ a [ dnu(é,n) V@— 2? + — a 


is a linear operator. 


Derive a differential equation whose solution is a family of circles (x — a)? + (y— b)? = 


What differential equation does the general solution (ax + b)y = C1 + C2 xe%*/° satisfy? 


Show that the change of the dependent variable y’ = uy transfers the differential equation y” + p(x) y’ + q(x) y = 0 into 


the Riccati equation u’ + u? + p(x)ut q(x) = 0. 


Find a necessary condition for a differential operator L = a2D? + a,D + ao to be self-adjoint. 


By changing the independent variable x = x(t), rewrite the differential equation y” + p(x)y’ + g(x)y = f(x) in new 
variables, y and t. 


In the differential equation y” + p(x)y’ + g(x)y = f(x), make a change of independent variable, x = x(t), so that the 
equation in the new variable does not contain the first derivative. 


Prove that the second order variable coefficient differential equation y” +p(x) y’+q(x) y = 0 sometimes could be reduced 
to a constant coefficient equation by setting t = w(x) if W(x) = [vee dx, where k £ 0 is a constant. Note that 


this is only a necessary condition. Derive the sufficient condition on the coefficients p(x) and q(x) under which the 
transformation t = 7(x) reduces the given equation to a constant coefficient differential equation. 
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4.2 Linear Independence and Wronskians 
For a finite set of functions fi (x), fo(x), ..., fm(x), its linear combination is defined as 
ay f(x) + O2 f2(x) +... + Om fm(2), 


where a1, Q2, ..., @m are some constants. We begin this section with a very important definition: 


Definition 4.4: A set of m functions fi, fo,..., fm, each defined and continuous on the interval |a, b|, is said 
to be linearly dependent on |a, | if there exist constants a1, Q2,...,Qm, not all of them zero, such that 


a1 fi(a) + ao fe(e) +--+ +Omfm(z) = 0, z € |a, Ql, 


for every x in the interval |a,b|. Otherwise, the functions fi, fo,..., fm are said to be linearly independent 
on this interval. 


A set of functions fi, fo, ..., fm is linearly dependent on an interval if at least one of these functions can be 
expressed as a linear combination of the remaining functions. Two functions f1(2) and f2(z) are linearly dependent 
if and only if there exists a constant k such that fi(2) = k fo(a). 

The interval on which functions are defined plays a crucial role in this definition. The set of functions can be 
linearly independent on some interval, but can become dependent on another one (see Example 4.2.3). 


Example 4.2.1: The n (n > 2) functions fi(x) =1= 2°, fo(x) =2,..., fn(x) =2"~" are linearly independent on 
the interval (—oo, co) (and on any interval). Indeed, the relation 


ay-ltaor+:--:+an,rz”'=0 


fails to be valid because a polynomial cannot be identically zero. 


Example 4.2.2: Any two of the four functions 
fi(z) =e", fo(x) =e, f(z) =sinha, f4(x) = coshe 


are linearly independent, but any three of them are linearly dependent. The last statement follows from the formulas 


sinh x = (e* +e"). 


Nle 


(e* - e*) , cosha = 
The equation 
aye” +age" =0 
with nonzero constants a, and a2 cannot be true for all x because, after multiplying both sides by e”, we obtain 
22 _ 
aye" +a9 = 0. 


The last equation is valid only for « = 4 In (-22). but not for all 2 € (—oo, 00). 


Example 4.2.3: Consider two functions f;(%) = x and f2(#) = |x|. They are linearly independent on any interval 
containing zero, but they are linearly dependent on any interval |a, b| when either 0 <a<bora<b<0. 


Example 4.2.4: The functions f;(x) = sin? x, f(x) = cos? x, and f3(x) = 1 are linearly dependent on any finite 
interval. This follows from the identity 


sin? z + cos? zx —1=0. 


Recall that a matrix is a rectangular array of objects or entries, written in rows and columns. The properties 
of square matrices and their determinants are given in §6.2 and §7.2. 
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Definition 4.5: Let f1, fo,..., fm be m functions that together with their first m—1 derivatives are continuous 
on an interval |a,b| (a < b). The Wronskian or the Wronskian determinant of f1, fo,..., fm, evaluated at 
x € |a,b| is denoted by W[fi, fo,..., fm](a) or W(fi, fo,.--,; fm; x) or simply by W(x) and is defined to be the 
determinant 

fi fo 


fi fs 
1 


" 
1 2 


Wf» fay +++9 fm] (@) = det 


iii FEP-D _ 4 


Each of the functions appearing in this determinant is to be evaluated at x € |a, b]. 


For the special case m = 2, the Wronskian®’ takes the form 
fil) fo(x) | / / 
Wifi, r)= = fila x) — fila a). 
[fi fal( ) fi(a) f5(x) fil ) fo( ) fi ) fal ) 
In the practical evaluation of a Wronskian, the following lemma may be very helpful. We leave its proof for the 
reader (see Exercise 15). 
Lemma 4.1: For gg = 1 and arbitrary functions f, 91, ---, gn—1, the Wronskian determinants satisfy the equation 


a (f 9x) _ ¢n dg, 
j,k=0,1,....n—1 j,k=0,1,...,.n—1 


, 


In particular, 


g 


— f2,/ _ £2 
ee ee 


1 
0 


= 5 fg _ £2 
WIf, fg] = det i | =f 
fo f92 
WIf, foi, fg2] = ’ (fg1)' (f92)' = f? W(1, 91, 92]. 
f" (fon) (faa)” 


Example 4.2.5: Let us find the Wronskian for the given functions: fi(#) =a, fo(#) =a cosa, f3(a%) = sina. 
From the definition, we have 


x x cos(x) x sin(x) 
Whi, fa, f3l(z) = det | 1 cos(x) — xsin(x) sin(z)+axcos(z) | = 2°, 
0  -—2sin(a) —acos(x) 2cos(x) — xsin(x) 


which could be verified after tedious calculations without a computer algebra system. On the other hand, using 
Lemma 4.1, we get 


1 cos(z) sin(x) . 
_ 3 ee - _ 3 —sin(z)  cos(x) igs 
Wifi, fa, fal(x) = 2° det : - ee Bea Mee cos(x) —sin(x) 


All computer algebra systems (Maple, Mathematica, Maxima, Sage, SymPy, and MuPad from MATLAB) have a 
dedicated command to calculate a Wronskian: 

with(VectorCalculus): Wronskian([x,x*sin(x) ,x*cos(x)],x) # Maple 

Wronskian[{x,x Sin[x], x Cos[x]},x] (* Mathematica *) 

load(functs)$  wronskian([f1(x), £2(x) , £3(x)],x); /* Maxima, Sage, MuPad, and SymPy */ 


39Wronskian determinants are named after the Polish philosopher Jésef Maria Héené-Wronski (1776-1853). He was born Héené to a 
Czech emigrant, but in 1792 Jésef ran away from home and changed his name; he served in the Russian army (1795-1797) and later 
worked mostly in France. The term “Wronskian” was coined by the Scottish mathematician Thomas Muir (1844-1934) in 1882. 
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Theorem 4.7: Let fi, fo, ..., fm be m functions that together with their first m—1 derivatives are continuous 
on an open interval a < x < b. If their Wronskian W[f1, fo,..., fm|(o) is not equal to zero at some point 
xo € (a,b), then these functions f1, fo,..-, fm are linearly independent on (a,b). Alternatively, if fi, fo,..-, fm 
are linearly dependent and they have m — 1 first derivatives on the open interval (a,b), then their Wronskian 
W (fi, fe,---; fm|(a) = 0 for every x in (a, b). 


We prove this theorem only for the case m = 2 by contradiction. Let the Wronskian of two functions 
fi and fg be not zero, and suppose the contrary is true, namely, functions fi(a) and f(a) are linearly dependent. 
Then there exist two constants a, and ag, at least one not equal to zero, such that 


a1 fi(x) + a2 fo(x) = 0 
for any x € (a,b). Evaluating the derivative at 29, we obtain 
an fi (wo) + a2 f3(x0) = 0. 


Thus, we have obtained a linear system of algebraic equations with respect to a ,,a@2. The right-hand side of this 
system is zero. It is known that a homogeneous system of algebraic equations has nontrivial (that is, not identically 
zero) solutions if and only if the determinant of the corresponding matrix is zero. The determinant of coefficients 
of the last two equations with respect to a1,a2 is precisely W[f1, fa](xo), which is not zero by the hypothesis. 
Therefore, aj = a2 = 0 and this contradiction proves that functions f(x) and f2(a) are linearly independent on 
(a, b). 

The second part of this theorem follows immediately from the first one. Thus, if functions fi(#) and f2(a) are 
linearly dependent on (a, 6), then f2(a) = 8fi(2) for some constant {. In this case, the Wronskian becomes 


_| file) Bfi(a) 
| Ai@) BF (a) 


=o. 


Wih.fal(e) = | 2 | 
2) 
) 


I 
@ 


Example 4.2.6: Show that the functions f(x) = 2” an 
interval (a,b) that does not contain zero. 
Solution. The Wronskian W() of these two functions, 


g(x) = a” (n 4 m) are linearly independent on any 


is not equal zero at any point x 4 0. So, from Theorem 4.7, it follows that these functions are linearly independent. 
It is also clear that there does not exist a constant k such that «” = ka2™. 


Theorem 4.7 gives us only the necessary condition of linear dependency, but not the sufficient one, as the following 
example shows: linearly independent functions may have an identically zero Wronskian! 


Example 4.2.7: Consider two functions 


x, for x20, _ 0, for x20, 
fle) ={ 0, for «<0, falc) = { x*, for «<0. 


Then W(x) = 0 for all x, but the functions are linearly independent. If this were not true we would have two nonzero 
constants a ; and a2 such that 


a,z*, for x>0, vat a, for «20, 


ay f(x) + 02 f2(x) = { ax’, for x <0, = a2, for «<0, 


Since this relation holds only when constants a, and a2 are both zero, we get a contradiction and the claim about 
linear dependency of functions f; and fo is void. 
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Theorem 4.8: A finite set of linearly independent holomorphic (represented by convergent power series) func- 
tions has a nonzero Wronskian. 


Theorem 4.9: Let y1,y2,...,Yn be the solutions of the n-th order differential homogeneous equation 


y + dns(a)y"") +--+ + ar (a)y! + ao(a)y = 0 


with continuous coefficients ag(a#), ai(a), ..., @n—1(x%), defined on an open interval a < x < b. Then functions 
Y1,Y2;--+;Yn are linearly independent on this interval (a,b) if and only if their Wronskian never vanishes on 
(a,b). Alternatively, these functions y1,y2,...,Yn are linearly dependent on this interval (a,b) if and only if 


W|y1,Y2,-++;Yn|(x) is zero for all x € (a,b). 


We prove the theorem only for the case n = 2. We wish to determine the sufficient conditions in 
order for the solutions y;, y2 to be linearly dependent. The necessary conditions were found in Theorem 4.7. These 
functions will be linearly dependent if we can find constants a; and az to be not all zero, such that 


ayi(x) + a2ye(x) = 0. 
Suppose the Wronskian W[y1, y2|(x) is zero at some point « = zo. Then the system of algebraic equations 
Crys (xo) + Cay2(xo) = 0, Ciyi(wo) + C2yo(xo) = 0 
has a nontrivial solution C; = a, and C) = ag. In fact, the corresponding matrix 
yi(%o0) Yy2(xo) 
¥i(@o) Yo(#o) 
is singular, that is, its determinant is zero. Only in this case the corresponding system of algebraic equations has a 
nontrivial (not identically zero) solution (see §7.2.1). Then the function 
¥ = ary (x) + azyao(x) 
is a solution of the differential equation y” + a,y’ + aoy = 0. Moreover, ¥ also satisfies the initial conditions 


7 dy 
y(xo) = 0, daz = 0. 


However, only the trivial function 7(x) = 0 satisfies the homogeneous linear differential equation and the homoge- 
neous initial conditions (Theorem 4.2 on page 189). 


Theorem 4.10: (Abel) If yi(x) and y2(«) are solutions of the differential equation 
y" + p(x)y' + q(x)y = 0, (4.2.2) 


where p(a) and g(x) are continuous on an open interval (a,b), then the Wronskian W(x) = W[y1, y2|(x) is given 
by 


x 


Wlys, y2](z) = W[y1, y2](zo) exp \-/ p(t) at} : (4.2.3) 


0 


Here po is any point from the interval (a, b). 


Note: The formula (4.2.3) was derived by the greatest Norwegian mathematician Niels Henrik Abel (1802-1829) 
in 1827 for the second order differential equation. In the general case, namely, for the equation y“”) +pn_—1 (x)y"—) + 
--++pi(x)y’ + po(x)y = 0, Joseph Liouville (1809-1882) and Michel Ostrogradski (1801-1861) independently showed 
in 1838 that 


W (x) = W(ao) exp {- a Pn—1(t) at => W' (x) + Pn—-1(4)W (a) = 0. (4.2.4) 
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Each of the functions y;(%) and y2(x) satisfies Eq. (4.2.2). Therefore, 
yi + p(x) + a(x)y1 = 0, yy + p(z)y2 + a(x)y2 = 0. 


Solving this system of algebraic equations with respect to p and q, we obtain 


W W 7 ae LOA 
yy — Yi Ye W'(x) Vis — YiYe 1 a 
p(x = —  _COU — : Ct) = SC Wilyt.y 5 4.2.5 
(2) Y1Ys — Vive W (x) ( yiys — yiye W(x) ivi. val v2 


where W(x) = yiy5 — yj ya is the Wronskian of the functions y1, y2. The first equation can be rewritten in the form 
W'(x) + p(x)W (ax) = 0. (4.2.6) 


This is a separable differential equation of the first order and its solution is given by the formula (4.2.3). Abel’s 
formula shows that the Wronskian for any set of solutions is determined up to a multiplicative constant by the 
differential equation itself. 


Corollary 4.1: If the Wronskian W (a) of solutions of Eq. (4.2.2) is zero at one point 7 = 2 of an interval 
(a, b) where coefficients are continuous, then W (a) = 0 for all x € (a,b). 


Corollary 4.2: If the Wronskian W (2) of solutions of Eq. (4.2.2) at one point « = po is not zero, then it is not 
zero at every point of an interval, where coefficients p(a) and q(a) are continuous. 


Example 4.2.8: Given the differential equation 
y —y=0. 
Find the Wronskian of solutions 
yi(z) =e”, yo(x)=e 7, y3(a) =sinhaz, ya(x) = cosha. 


Solution. We calculate derivatives of these functions to obtain 


yi(x) = y(@), yo(x) = —Yyo(x), y3(a) = cosh x, y4(x) = sinh e, 
nee yee ™, y3 (x) = sinha, yi (x) = cosha, 
yi (x) = y1(2), yo (x) = —y2(z), y3 (x) = coshe, yy (x) = sinh x. 


1 1 01 1 1 01 

1 -1 10/7] _—/1 -1 1 0 

WO)=det}s 4 9 1/=|1 1 01 

1 -1 1 0 1 -1 1 0 

We add the second column to the first one to obtain 

2 1 0 1 1 1 01 
0 -1 1 0 0 -1 1 0 
W (0) = det 2 1 01/7 2 det 1101 
0 -1 1 0 0 -1 1 0 


Since the latter matrix has two equal columns (the first and the fourth), its determinant is zero. Hence, at any 
point, the Wronskian W (a) = W(0) = 0 and the given functions are linearly dependent. To check our answer, we 
type in Mathematica: 

y = {Exp[x], Exp[-x], Sinh[x], Cosh[x]} 

w = fy, Dly, x], Dly, {x,2}], Dly, {x, 3}]} Det [w] 


Since the output is zero, the functions are linearly dependent. 
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The Wronskian unambiguously determines whether functions are linearly dependent or independent if and only 
if these functions are solutions of the same differential equation. But what should we do if we don’t know whether 
the functions are solutions of the same differential equation? If these functions are represented as convergent power 
series, then Theorem 4.8 assures us that evaluating the Wronskian is sufficient. If they are not, there is another 
advanced tool to determine whether or not a set of functions is linearly independent or dependent, called the Gramm 
determinant, but it is out of the scope of this book. 


1. 
2. 
3. 


10. 


. Find the Wronskian of the two solutions of the given differential equation without solving the equation. 


17. 


Show that the system of functions {f1, fo,..., fn} is linearly dependent if one of the functions is identically zero. 


Which polynomials can be expressed as linear combinations of the set of functions {(a — 2), (« — 2)”, (a —2)?}? 


Determine whether the given pair of functions is linearly independent or linearly dependent. 


(a) Aes », (a) = Coe (b) ey a(x) =a? +1; 
(c) fiw) = 2a? + 8a, fa(x) = 2x? - 32; (d)  fi(x) = 2/21, ee )= 2°, : 
(e) fila) =1+ cosa,  fo(x) = cos* a; (f) fi(x)=cosa,  fe(a) =sin (x — “); 
(g) fi(z) =sin ae fo(x) =cosxsinz; (h) fi(x)=tanz, fo(x) = cota; 
(i) fix) =e", f(x) = e**; G) fiz) =e*, fal) =e?"'; 
(k) fila) =e%, fol) =2e*; Q) file) =e*, fox) =1+e* 

. Obtain the Wronskian of the following three functions. 


(a) file)=e*, fale) =me®, and fale) = (20-1) 
(b) fi(x) =e”, fe(x)=e” sinz, and f3(x) =e” cosa. 
(c) file) =a, fe(v)=axe”, and f3(x) =2xre”. 
(d) fi(x) =e" cosa, fo(x) =e?” cosa, and fs(ax) = e®” cosa. 
Suppose the Wronskian W|f,g] of two functions f and g is known; find the Wronskian W|[u,v] of u = af + bg and 
v=af + 6g, where a, b, a, and § are some constants. 
Are the following functions linearly independent or dependent on [1, co)? 
(a) a2? and «2? Ing; (b) 2?Inz and 2? In2z; 
(c) Ina and In(a*); (d) Ine and (Inz)?. 
Are the functions f1(«) = a|z| and f(a) = x? linearly independent or dependent on the following intervals? 
(a) O<ax<l1; (b) -1l<a<0; (c) -l<a<il. 
Determine whether the following three functions are linearly dependent or independent. 
(a) fi(z) = /r+2, fol(x)=VJVe4+22, and  fs(x)=a2—-1,; 
(b) 1, sinz, and cosa; (c) 1, cos*z, and cos3x+3cosz. 
B 


Let a, 6, and y be distinct real numbers. Show that x, x 
positive x-axis. 


, and x” are linearly independent on any subinterval of the 


The Wronskian of two functions is W(x) = z?—1. Are the functions linearly independent or linearly dependent? Why? 


(a) (sinx)y” + (cosx)y’ + tan zy = 0; (b) wy” —(Qx4+1)y’+sin2ry =0; 
(c) ay" +y'+ay =0; (d) (1—2°)y" — 2xy! + 2y = 0; 
(e) ay” —a(a@+1)y'+(a4+1)?y=0; (f) sin(2x)y” — 2y’ + ey = 0. 


. Derive the Abel formula for the self-adjoint operator DP(x)D + Q(«), where D = d/dz. 


Prove that for any real number a and any positive integer n, the functions e®”, xe®”, ..., 2” te are linearl 
y y ef | ? ? y 


independent on any interval. 


For what coefficients of the second order differential equation a2(x)y” + ai(x)y’ + ao(z)y = 0 is the Wronskian a 
constant? 


. Prove Lemma 4.1 on page 199. 


Prove the following claim: 

If two solutions of the differential equation y” + p(x)y’ + q(x)y = 0 on some interval |a, | vanish at a point from |a, db], 
then these solutions are linearly dependent. 

The same claim is valid when their derivatives are zero at a point from |a, |. 


Let f and g be two continuously differentiable functions on some interval |a, b|, a < b, and suppose that g never vanishes 
in it. Prove that if their Wronskian is identically zero on |a, b|, then f and g are linearly dependent. Hint: Differentiate 


f(x)/g(2). 
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4.3. The Fundamental Set of Solutions 
Let us consider a homogeneous linear differential equation of the second order 


y” + p(x)y' + g(a)y = 0 (4.3.1) 


in some interval x € |a,b|. Any two solutions of Eq. (4.3.1) are said to form a fundamental set of solutions of 
this equation if they are linearly independent on the interval |a, b|. This definition can be extended to the n-th order 
differential equation (D = d/dz) 


Lix,Djy=0, where Ly = y™ + an_i(x)y"-) +--+ + a1(x)y! + ao(x)y. (4.3.2) 


Namely, any set of solutions y1, y2,---,Yn is a fundamental set of solutions of Eq. (4.3.2) if they are linearly 
independent on some interval. Note that the number of functions in the fundamental set of solutions coincides with 
the order (the highest derivative in the equation) of the equation. 

According to Theorem 4.9, the functions y, and yg form a fundamental set of solutions of Eq. (4.3.1) if and only 
if their Wronskian W[y1, y2](a) is not zero. 

The family y = y(a, C1, C2) of solutions of Eq. (4.3.1), which depends on two arbitrary constants C; and C2, is 
called the general solution of Eq. (4.3.1) because it encompasses every solution. If we know {y1, yo}, a fundamental 
set of solutions of Eq. (4.3.1), then we can construct the general solution of the homogeneous equation (4.3.1) as 


y(x) = Ciyi(x) + C2y2(z), (4.3.3) 


where C and C2 are arbitrary constants. The function (4.3.3) is also a solution of Eq. (4.3.1) as it follows from the 
Principle of Superposition (Theorem 4.4, page 191). 


Theorem 4.11: Let y1, y2, ...,Yn be n linearly independent solutions (a fundamental set of solutions) of the 
linear differential equation of the n-th order (4.3.2). Let y be any other solution. Then there exist constants 
Cy, Co,...,Cy such that 

y(x) = Cry (a) + Coya(z) +--+ + Cryn(2), (4.3.4) 


As usual, we prove the theorem only for n = 2. Let y(x) be any solution of Eq. (4.3.1). We must show 
that the linear combination C1 y1 (x) + Coy2(x) is equal to y(x) for some choice of coefficients C and C2. Let x be a 
point in the interval (a,b). Since the solutions y; and y are linearly independent, their Wronskian W(y1, y2;x) 4 0 
for any x € (a,b). Consequently W(a9) 4 0. We know that the solution of Eq. (4.3.2) is uniquely defined by its 
initial conditions at any point of interval (a, b), so the values yo = y(%o) and yf = y’(ao) uniquely define the function 
y(a). After differentiation, we obtain the system of algebraic equations 


yo = Ciyi(to) + eae} 
Yo = Cry\(to) + C2 y5(zxo). 


This system has a unique solution because the determinant of the corresponding system of algebraic equations is 
the Wronskian W (a9) 4 0. Thus, the function y is the general solution of Eq. (4.3.2). Note that Theorem 4.11 can 
be proved by mathematical induction (Exercise 9) for any n > 2. 


Corollary 4.3: Let y1, y2, ...,; Yn be solutions to the linear differential equation (4.3.2). Then the family of 
solutions (4.3.4) with arbitrary coefficients C,,C2,...,C,, includes every solution if and only if their Wronskian 
does not vanish. 


Theorem 4.12: Consider Eq. (4.3.1) whose coefficients p() and g(x) are continuous on some interval (a,b). 
Choose some point 2 from this interval. Let y; be the solution of Eq. (4.3.1) that also satisfies the initial 
conditions 

y(to) =1, y"(a0) = 0, 


and let yp be another solution of Eq. (4.3.1) that satisfies the initial conditions 


og) Os may ag) — le 


Then y; and ya form a fundamental set of solutions of Eq. (4.3.1). 
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To show that functions y; and yz are linearly independent, we should calculate their Wronskian at one 
point 
yi(to) y2(xo) 1 0 
W (y1, Y2} Xo) = - = 1 
(4,250) =| a4 (ao) yhoo) |=] 0 1 
Since their Wronskian is not zero at the point xo, it is not zero anywhere on the interval (a, b) (see Corollary 4.2 on 
page 202). Therefore, these functions are linearly independent (by Theorem 4.9, page 201). | 
This theorem has a natural generalization, which we formulate below. 


Theorem 4.13: There exists a fundamental set of solutions for the homogeneous linear n-th order differential 
equation (4.3.2) on an interval where coefficients are all continuous. 


Theorem 4.14: Let yi, y2,..-,Yn be n functions that together with their first n — 1 derivatives are continuous 
on an open interval (a,b). Suppose their Wronskian W(y1, y2,.--,Yn;z) # 0 for a < « < b. Then there 
exists the unique linear homogeneous differential equation of the n-th order for which the collection of functions 
Y1,Y2;---,Yn forms a fundamental set of solutions. 


We prove the statement only for a second order differential equation of the form (4.3.1). Let y, and 
def 


y2 be linearly independent functions on interval (a,b) with W(y1, yo;7) = W(x) £0, x € (a,b). Then we have 
Wi + pl@)y, +a(@)y1 =0, ys + p(@)y2 + a(@)y2 = 0. 


Solving this system of algebraic equations for p(x) and q(x), we arrive at the equations (4.2.5), which determine the 
functions p(x) and q(a) uniquely. The formula for the general case is given in #12 of Summary, page 255. a 

To solve an initial value problem for the linear n-th order differential equation (4.3.2), one must choose the 
constants C; (j = 1,2,...,n) in the general solution (4.3.4) so that the initial conditions y(zo) = yo, y'(@1) = 1, 
..., YY) (x9) = Yn—1 are satisfied. These constants are determined upon substitution of the solution (4.3.4) into 
the initial conditions, which leads to n simultaneous algebraic equations 


Since the determinant of the above system is the Wronskian evaluated at the point x = xo, the constants are 
determined uniquely. Does it mean that a nonvanishing Wronskian at a point implies a unique solution of the 
corresponding initial value problem? Unfortunately no because it is a necessary but not a sufficient condition. (For 
instance, the Wronskian of y; = x~! and y2 = x”, the fundamental set of solutions for the equation xy!’ — 2y = 0, 
is constant everywhere, including the singular point z = 0.) On the other hand, if the Wronskian is zero or infinite 
at the point where the initial conditions are specified, the initial value problem may or may not have a solution or, 
if the solution exists, it may not be unique. 


Example 4.3.1: Let us consider the differential equation 
yi +y=0. 


The function y:(x) = cosa is a solution of this equation and it satisfies the initial conditions y(0) = 1, y’(0) = 0. 
Another linearly independent solution of this equation is y2(x) = sin x, which satisfies y(0) = 0, y’(0) = 1. Therefore, 
the couple {y1, y2} is the fundamental set of solutions of the equation, and the general solution becomes 


y(x) = Cy yi(x) + C2 yo(x) = Ci cosa + C2 sing. 


To find the values of constants C;, C2 so that the function satisfies the initial conditions y(7/4) = 3/2 and y'(m/4) = 
—2,/2, we ask Mathematica to solve the system of algebraic equations 


C, cos(m/4) + Cg sin(1/4) = 3, —C\ sin(m/4) + C2 cos(m/4) = —2. 
y[x_] = cl Cos[x] + c2 Sin[x]; y’’[x] + y[x] == 0 (* to check solution *) 
eqns = {y[Pi/4] == 3*Sqrt[2], y’ [Pi/4] == -2*Sqrt[2]} 


c1c2 = Solve[eqns, {c1, c2}] soln[x_] = ylx] /. ctc2[(1]] 


206 Chapter 4. Second and Higher Order Linear Differential Equations 
This yields the solution y = 5 cosx + sina. 


Example 4.3.2: Given two functions y;(2) = 2° and yo(x) = x?. These functions are linearly independent on the 


interval (—o0,0o). Their Wronskian W(x) = —2x* vanishes at x = 0 because x? and x? are tangent there. Using 
formulas (4.2.5), page 202, we find 
4 6 
p(“) = —-, q(z)=—=, «r#0. 
x x 


Therefore, the corresponding differential equation is 
-=ytsy=0 or ay” — day’ + 6y =0. 
x 


The coefficients of this equation are discontinuous at « = 0. Hence, functions y;(a2) and y2(x) constitute a funda- 
mental set of solutions for this equation on each of the intervals (—oo,0) and (0, 00). 

An initial value problem has a unique solution when the initial conditions are specified at any point other than 
x = 0. When these initial conditions are replaced with y(0) = 2, y’(0) = 3, the problem has no solution. However, 
with the initial conditions y(0) = 0, y’(0) = 0, the problem has infinitely many solutions y = Cia? + Cx”, with 
arbitrary constants C and C2. 


Example 4.3.3: For a positive integer n and a real number a, let us consider n functions y; = e*”, yg = re, ..., 
Yn = x”—'e%, Using Lemma 4.1 on page 199, the Wronskian of these functions becomes 


Wy, Y2s--+5 Unie) =e wil, Ty... aes x) =e Il (k!) = 0, (4.3.5) 


where ii Gas a, denotes the product a9a1a2---@n—1, similar to 5> for summation. Each function yz41 = gk er, 
k =0,1,...,n—1, is annihilated by the operator (D — a)*+! and, therefore, by any operator (D — a)™ for m > k, 
where D = d/dz. This claim can be shown by induction and it is based on the formula (D — a)x* e* = ka*~! e2”, 
So every application of the operator D — a to x* e®* reduces the power x* by 1. This observation allows us to 
conclude that the given functions y1, ye, .-., Yn form the fundamental set of solutions to the differential equation 
(D—a)"y =0. 


Example 4.3.4: Find the value of the Wronskian, W[y1, y2](2), at the point x = 2 given that its value at x = 1 is 
W([y1, y2|(1) = 4, where y; and yg are linearly independent solutions of xy" + 2y’ + xysinz = 0. 
Solution. First, we reduce the equation to the normal form (4.2.2): 


2 
y+ —y' + (sina) y = 0, 
with p(x) = 2/x and q(x) = sinz. From Eq. (4.2.3), we find 


W(2) 


W(1) exp {-f p(t) av} = W(1) exp {—2(In2 — In1)} 
= W(1) exp{In2-7} = W(1).2-7 =4-2-7 =1. 


1. Verify that yi(x) = cos3xe?* and yo(x) = sin 3x?” both satisfy y” — 4y’ + 13y = 0 and compute the Wronskian of the 
functions yi and ye. 


2. Verify that the functions yi (2) = 2? and y2(x) = 2~* form a fundamental set of solutions to some differential equation 
on an interval that does not contain x = 0. 


3. For the given pairs of functions, construct the linear differential equation for which they form a fundamental set of 
solutions. 
(a) W=rl, Y= x; (b) Yyi=sinz, yo =cosz; (c) yi=sing, yo =tanz; 
@) yer’, w=2; (©) wae? wer, ©) wosinde, w= oy; 
@): wee, pee ©) wee’ wae GCG) we tane, y=00te; 
G)  wm=2, p=l/e, (kK) w=rt+l, pw=rt-l (I) we=rt+l, y=e’. 


4.3. 


11. 


12. 
13. 


14. 


15. 


16. 


ares 


. Let ¢i1(@) and ¢2(x) be nontrivial solutions to the linear equation (4.3.1), where the functions p(x 
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. Prove that if two solutions yi (x) and y2(x) of the differential equation y” + p(x)y’ + q(x)y = 0 are zero at the same point 


in an interval (a,b), where the coefficients p(«) and q(x) are continuous functions, then they cannot form a fundamental 
set of solutions. 


. Answer true or false for each of the following statements. 


(a) If yi(a) and y2(a) are linearly independent solutions to the second order linear differential equation (4.3.1) on 
interval |a, b|, then they are linearly independent on a smaller interval |a, 8| C |a, bj. 


(b) If yi(a) and y2(x) are linearly dependent solutions to the second order linear differential equation (4.3.1) on 
interval |a,b|, then they are linearly dependent on the larger interval |a, | D |a, bj. 


and q(a) are 


) 
continuous at zo. Show that there exists a constant K such that ¢1(@) = Kd2(z) if (xo) = 65(xo) = 0. 


. Prove the Sturm Theorem: 


If t; and tz (t1 < tz) are zeroes of a solution x(t) to the equation %(t) + Q(t)a(t) = 0 and 
Q(t) < Qi(t) for all t € [t1, te], 


then any solution y(t) to the equation ij(t) + Qi(t)y(t) = 0 has at least one zero in the closed interval [t1, t2]. 


. Find the longest interval in which the initial value problem is certain to have a unique (twice differentiable) solution. 


(a) t(t—5)y+ 3ty=3, y(3) =y(3) = 1; (b) t(t—5)y+3ty=3, y(6) =y(6) = 1, 
(c) t(t—5)9+4y=3, y(-1)=y/(-1)=1; (qd) (P—ly+yty=3, y(0)=9(0) =1; 
(e) (P-Dg+y+y=3, y(2)=9(2)=1; (f) (P+le+yt+y=3, y(0)=9(0) =1; 
(g) (P+ 4)9+4y=4, y(-2) =9(-2) =1; (h) sinty+ty=t?, y(1) =y(1) =2; 

(i) (P+t-6)j+y=t, y(0) =9(0) =1; (j) (t? 4+ 2¢-3)gt+y=2, y(0)=9(0) = 


. Prove Theorem 4.11 (page 204) by mathematical induction. 
10. 


Show that the functions 7%, 2°, and «7 form a fundamental set of solutions to the differential equation 


wy” + (83-a-B-y)y"+(l-a-B-y+aB+ay+ By)cy’ - abyy =0 
if a, 8, and y are distinct real numbers. 
If yi and ye are linearly independent solutions of ry” + y’ + xycosx = 0, and if W[yi,y2](1) = 2, find the value of 
Wy, 2] (2). 
Does the initial value problem 2?y” — 2y = 0, y(0) = y’(0) = 1 have a solution? 


Verify that each of the sets, {a~?, 279} and {2a 2_ 5a 3 5a 74+2¢ =. is a fundamental set of solutions to the 
differential equation x?y” + 6a y’ +6y = 0. 
-1/2 


Show that the functions u(#) = x and v(a) = a? are solutions of the nonlinear differential equation «?yy” + 
(xy’ — y)” = 3y?, but no linear combination of them is a solution, although w(x) = Vci@—! + 27 is a solution for 
arbitrary constants c1, C2. 


Verify that sin® 2 and sin x — z sin 3a are solutions of y” + (tana —2cotx)y’ = 0 on any interval where tana and cot x 


are both defined. Are these solutions linearly independent? 
From the given functions, choose a fundamental set of solutions to the differential equation. 
(a) 2e-*, cosha, — sinha on (—oo, co) for the equation y” — y = 0. 

(b) «3, 32° Inz, x? (Ina — 5) on (0,00) for the equation x? y” — 5ay’ + 9y = 0. 

(c) 2 sin2x, —cos 2x, cos(2x + 7) on (—oo, 00) for the equation y” + 4y = 0. 

(d) 4x, $ n 2 — 2, = on (—1,1) for the Legendre equation (1 — x*)y” — 2xy’ + 2y = 0. 


1-2 


(e) e”, e” Inz, xe®* on (0,00) for the equation ry” + (1—2)y’+xy=0. 
/ 


(f) x, xe”, z nz on (0,00) for the equation #?y” — #(2+-2)y’ +(a+2)y=0. 
(g) x, 4 (x? — 2) Vx? +1+ # arcsinh(x), x nz on (0,00) for the equation (x? +1) y” —3ry'+3y=0. 


2 
(h) x? — 32, ae =a (x* — 3x), x°V/4— a? on |2x| < 2 for the Dickson equation (x? — 4) y+ ry’ —9y =0. 


For the given pairs of functions, construct the linear differential equation for which they form the fundamental set of 
solutions. 
(a) yw=2, yon’: (b) yi =sin2z, yo = cos 2a; (ec) yaa, waa: 
(d) w= gil? Yo = gi/3. (e) y=sin'a, y=sin-?a2; (f) y=2z, y=ln|z|; 
¢) nee, wee () wee", wae @ we, yw =sing; 
G) wm=2?, w=2" (K+) m=2?t+lw=2?-1, 0) n=bi[el, w=en; 
(m) y=2%, y=e"; (n) yw=1, yw=1/z; (0) y=e", y=oy. 
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4.4 Homogeneous Equations with Constant Coefficients 
Consider a linear differential equation L[D]y = 0, where L[D] is the linear differential constant coefficient operator 
LID] =a,D” + @p—i~D") +:++++a,D+ a0, An x 0, (4.4.1) 


containing the derivative operator D = d/da, and some given constants az, k = 0,1,...,n. As usual, we drop D°, 
the identical operator. This differential equation L[D]y = 0 can be rewritten explicitly as 


d"y d”™ty dy 
On soe $Onat Gog to Ha 5 tao y(2) = 0. (4.4.2) 


In particular, when n = 2, we have the equation 
ay” + by’ + cy =0, (4.4.3) 


whose coefficients a,b, and c are constants. No doubt, such constant coefficient equations are the simplest of all 
differential equations. They can be handled entirely within the context of linear algebra, and form a substantial 
class of equations of order greater than one which can be explicitly solved. Surprisingly, such equations arise in a 
wide variety of physical problems, including mechanical and electrical applications. A remarkable result about linear 
constant coefficient equations follows from Theorem 4.3 on page 189, which we formulate below. 


Theorem 4.15: For any real numbers a, (k = 0,1,2,...,n), an £0, and y; (i = 0,1,...,n —1), there exists a 
unique solution to the initial value problem 


any” +an-1y") +--+ + aoy(z) =0, y(%0) = yo, y/(to) = 41, ---, YY (@0) = Yn-1- 


The solution is valid for all real numbers —oo < x < @. | 


The main idea (proposed by L. Euler) of how to find a solution of Eq. (4.4.3) or Eq. (4.4.2) is based on a property 
of the exponential function. Namely, since for a constant and a positive integer k, 


k 


dak 


peer = er = Me 


it is easy to find the effect an operator L{D] has on e*”. That is, 


L[Dje** = 9,02 44, 40° (6 +t ee + ae" 
= pA e* +og\ eo fo +aghe™ tape = Live®. 


In other words, the differential operator L[D] acts as an operator of multiplication by a polynomial on functions of 
the form e**. Therefore, the substitution of an exponential instead of y(x) into Eq. (4.4.3) or Eq. (4.4.2) reduces 
these differential equations into algebraic ones, which are more simple to solve. 
For the second order differential equation (4.4.3), we have 
2 
a ee + jo +ee7 = (aD? + bD +c) e*™ = (ad? + bA+ cc) e™. 
da? dx 


Hence, the exponential function e*” is a solution of Eq. (4.4.3) if \ is a solution of the quadratic equation 


—b+ Vb? — 4ac 
2a ; 


ar? +bA+c=0 = 12 = (4.4.4) 


The quadratic equation (4.4.4) is called the characteristic equation for the differential equation of the second 
order (4.4.3). This definition can be extended for the general linear differential equation L[D]y = 0 (where L[D] is 
defined in Eq. (4.4.1)) using the relationship 


e-** L[Dje** = a,” + dn_1A%* +--+ + ag. 
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Definition 4.6: Let L/D] = a,D" + a,_1D"~!+.---+aoD° be the differential operator of order n with constant 
coefficients. The polynomial L(\) = apy A” + an—1\"~ 1 +--+ +0 is called the characteristic polynomial, and 
the corresponding algebraic equation 


An XA” + And"! +++» tap = 0 (4.4.5) 


is called the characteristic equation. 


Assuming for simplicity that the roots of the characteristic equation (4.4.5) are distinct, let them be denoted by 
A, Ag, -.+;An- Then the n solutions 


niee”, wajeor™, «. . Zee (4.4.6) 


are linearly independent because their Wronskian is not zero. In fact, after pulling out all exponentials from the 
columns, the Wronskian becomes 


Wyn, Yas +++. YniZ) = Wa) = eOrtretAe)® V0, Aas... An), 


ee 1 i. woe 
M1 d2 Xn 

V(Ai, Aa; An) MM r2 an 

A ao eee 


is the so-called Vandermonde“? determinant. It can be shown that V equals 


Vr Az. An) = TP Qs A) =P? TYP Qi), 


l<i<j<n l<i<j<n 


which is the product of all differences A; — A; with 7 > 7. For instance, when n = 3 we obtain V = (Az — A1)(A3 — 
A2)(A3 — Ar) = —(A1 — A2) (At — A3)(A2 — A3). This shows that the Wronskian of exponential functions is not zero if 
and only if all roots of the characteristic equation are different. 

The general solution of Eq. (4.4.2) is a linear combination of the functions from the set (4.4.6), that is, 


y= Cie" A. C202” Sener Ge, 
in which C1, Co,...,C,, are arbitrary constants. In particular, the general solution for Eq. (4.4.3) is 
y= Cre” + Cre”, (4.4.7) 


where \, and \z are distinct roots of the characteristic equation a\? +b’ +c =0. Actually, this result follows from 
the statement below. 


Theorem 4.16: If m1, m2, ..., Mn are all distinct numbers, and P,, P2,..., P, are polynomials, then 
Py(x)e™* + Po(a)e™” +--+ + P,(x)e™” (4.4.8) 
cannot vanish identically unless all the P;(x) (¢ = 1,2,...,m) vanish identically. 


We use mathematical induction to prove this claim. The basis case n = 1 is obviously true. Assuming 
that this is true for a given n; we prove it is true forn +1. Let m1, mo, ..., Mn, Mn+1 be all distinct numbers, and 
P,, Po, ..., Pn, Pn4i be polynomials such that 


P, (a)e™ + P2(x)e™” aero o Pi (aje™* af Pri (x)e™n+1® =0. 


If Py4i(#) = 0, then all the other polynomials are zero by the inductive hypothesis. If not, we can divide the 
equation by P,41(x)e™"*!*, obtaining 


Ry(a)e™” + Ro(a)e™* +---+ Ry(a)e™” +1=0, 


40Named after the French musician, mathematician, and chemist Alexandre-Théophile Vandermonde (1735-1796). 
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where the R; are rational, r; = m; — m,+1. Differentiating, we get 
(Ri tri Rie" +--+ + (Ri +91nRn)e™” = 0. 


If we clear fractions, the left-hand side becomes an expression of form (4.4.8), which vanishes identically. Since the 
common denominator does not vanish identically, the hypothesis of induction shows that for all i 


Ri+riRi =0, R,=Cye"™*, i=1,2,...,n. 
This contradicts the fact that all R;(x) are rational. Hence, all P;(x) are identically 0. 


Example 4.4.1: Solve the initial value problem y” —4y=0, y(0) =0, y’/(0) =3. 
Solution. The characteristic equation is A? — 4 = 0, with roots \; = 2 and Az = —2. Hence, the general solution 
of the differential equation is 
y = Ce** + Coe". 


It remains to enforce the conditions at « = 0. Now we have 
0=Cy+Co, 3 = 2C, —2Co. 
Simultaneously solving these equations for C, and C2, we conclude that C, = 3/4 and Cp = —3/4. Thus, 


3 22 - 3 20 or y= sinh(2z) 


y= 4 4 


since sinha = 5 (e* — e~®). 


d® dy d 
Example 4.4.2: Solve the equation ha ee aks + = + 6y = 0. 
as de? da?" de 

Solution. First we write the characteristic equation 


ead pA-6S 0. 


Its roots A = —1,2,3 may be obtained by synthetic division if one of the roots is known (or guessed). Suppose we 
find one root A = —1. Then the characteristic polynomial can be factored as \? — 4\? +A +6 = (A+1)(\?-—5A+6) = 
(A + 1)(A — 2)(A — 3). The general solution is seen to be y = Cye~* + Cge”* + C3e®*. We check our answer with 
Mathematica: 
L[z.y- yi) = yy? te) — 4 oy Ee) ey Ee) > 8 yi] 
CharPoly[lambda_] = Coefficient[L[x, Exp[lambda #] &], Exp[lambda x]] 
roots = lambda /. Solve[CharPoly[lambda] == 0, lambda] 
Solns = Map[Exp[# x] &, roots] 
Al1Solns[x_] = Solns/.{c1, c2, c3} 
Simplify[L[x, Al1Solns] == 0] 

Now suppose that we want to find a particular member of the set of solutions (4.4.7) that satisfies the initial 
conditions y(xo) = yo, y’(20) = yp. To answer this question, we substitute the general solution y = C; e*!* + C2 e*2* 
into the given initial conditions. This will lead to the following algebraic system of equations with respect to C1, C2: 


Cye™* + Cpe?" = yo, CyA1 e*1*9 + Coz "= yp. 


The determinant of these algebraic equations for C; and C2 is the Wronskian W eas eran; a] = (Agq—Az)eGQi trae # 
0. Therefore, we can find C; and C2 to be 


At — Ag At — Ag 


Problems| In all problems, D stands for the derivative, while D°, the identity operator, is omitted. The fourth 


derivative of the function y is denoted by y“, whereas all previous derivatives are denoted by primes; so y’” is the third 
derivative of y. The derivatives with respect to t are denoted by dots. 


bey Soa 
C= Yo ~ Yor2 e710 Co = Yori ~ Yo e720. (4.4.9) 


1. Would the function y(x) = e*” be a solution of the differential equation x? y” + 2x y' + y = 0 for some value ? 
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10. 


11. 


12. 


13. 


. Write out the characteristic equation for the given differential equation. 


(a) yy +2y=0;  (b) 2” +y'-4y=0; (c) yO -y=0; 
(d) 2y”’+y'-5y=0; (e) y”—y=0; (f) yO +24” +y =0; 
(g) 3y”+4y'+y=0; (hb) 3y”—-y"+y=0; (i) yO —2y!” +y"—2y' =0. 


. The characteristic equation for a certain differential equation is given. State the order of the differential equation and 


give the form of the general solution. 
(a) +A-6=0; (b) A?—-3A4+2=0; (ce) A? +2A\—-3=0; 
(d) \?-1=0; (e) A2-3A7-2=0; (f) 3A°4+2\?-1=0; 
8-27 -A+2=0; (h) 247-d?-1=0; (i) 4\?-1=0. 
(g) ; ‘ 


. Write the general solution to the given differential equations (D stands for the derivative). 


(a) 8y"—6y’+y=0; (b) xy" +y"—-2y'=0; (cc) 4y” —21y’ — 10y=0; 


(d) y+ 30y = 19y’; (e) y”—14y'+8y=0; (f) (0° +3D?—4D)y =0; 
(g) (D?+2D)y=0; (h) (D?+D-—6)y=0; (i) (D> — 3D? — 10D)y = 0; 
(j) (D? +. 2D)y = 3y; (k) (D?—5D+6)y=0; (1) y’” +6y" + 1ly’ + 6y = 0; 


(m) 4y”—4y'-—3y=0; (n) 4y”-—4y’—5y=0; (0) y’” +3y" —4y’— 12y =0. 


. Solve the initial value problems. 


(a) y” —2y’—3y=0, y(0)=0, y/(0) = -4. 
(b) y”—y'—6y=0, y(0) =0, y/(0) =5. 
(c) 4y” —4y’—3y=0, y(0)=0, y'(0 

(d) y" +3y’—10y=0, y(0) =2, y’(0 

(e) y” — 6y'+ 8y=0, y(0) = 2, y/(0) =6. 
(f) 2y” —5y’+2y=0, y(0) =3, y’(0) =3. 


. Find the solutions to the given initial value problems for equations of degree greater than 2. 


(a) y’—4y'’=0, y(0) =0, y’(0) =1, y"(0) =8. 

(b) y” — 2y” —5y'+6y=0, y(1)=0, y(1)=1, y”(1) =-1. 
(c) 4y/" — 4y" — 23y’ + 30y =0, (0) =0, y/(0) = —4, y"(0) = 4. 
(d) y” —2y"—y'+y=0, y(0)=0, y'(0) =5, y"(0) =3. 

(e) y!” — 3y" — 25y’—21ly=0, y(0)=5, y’(0) = —2, y(0) =8. 


. Construct the general form of the solution that is bounded as x —> oo. 


(a) y”-y=0; (b) y"=0; (c) y"+y'—6y=0; (d) dy” +3y’—2y=0; 
(e) y’+y=0; (f) y"=9; (g) yy” +108y=9y'’; (h) 3y’+17y' —6y=0. 


. Consider a constant coefficient differential equation y” +a, y’ + a0 y = 0, whose characteristic equation has two distinct 


real roots A; and A2. What conditions on the coefficients a; and ao guarantee that every solution to the given ODE 
satisfies limz—co y(x) = 0? 


Find the fundamental set of solutions for the following equations specified by Theorem 4.12, page 204. 

(a) g#=y; (b) F+y-6y=0; (c) F-3y+2y7=0; (d) BWty—y=O. 
Let D denote the derivative operator. Find the fundamental set of solutions of the following third order differential 
equations by extending Theorem 4.12. 


(a) (3D°+5D?—2D)y=0; (b) (D®>—3D?+4)y=0; (c) (D+1)(D+2)Dy =0. 
Find the solution of the initial value problem 
y"—4y=0, (0) =3,  y"(0) = 2. 


Plot the solution for 0 < t < 1 and determine its minimum value. 


Find the solution of the initial value problem 
2y" —5y'+2y=0, y(0)=1, y/(0)=-1. 


Then determine the point where the solution is zero. 


Solve the initial value problem 2’ + 5y’ — 3y = 0, y(0) = 1, y’(0) = 8. Then find the smallest positive value of 6 for 
which the solution has no minimum point. 
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4.5 Complex Roots 


Consider an equation L[D]y = 0 for the linear constant coefficient differential operator (4.4.1), page 208. Suppose 
that the corresponding characteristic equation has complex roots. To construct the fundamental set of solutions, 
we need to lay down a definition of exp(z) for the complex values of z. First, we recall some basic properties of 
complex*! numbers. 


Definition 4.7: A complex number is an ordered pair of two real numbers, often written as z = (a, y). The first 
coordinate (abscissa) x is called the real part of the complex number z. Historically, the real number (ordinate) 


y is called the imaginary part of the complex number z. For these components, the following notations are 
usually used: Rz=Rez=2 and S¥z=Imz=y. 


Two complex numbers 21 = (21, yi) and zg = (#2, y2) are equal if and only if their real and imaginary components 
are separately equal, that is, 7] = x2 and y; = yg. The set of all ordered pairs can be visualized as the set of points 
on the plane R? or the set of vectors on the plane starting at the origin. However, we distinguish the set complex 
numbers from two former sets by introducing the following arithmetic operations: 


1° Addition: 2, + 22 = (a1 +: 22, y1 + ye). 
2° Subtraction: =z, — 22 = (#1 — %2, y1 — y2). 
3° Multiplication: 2,22 = (x1 22 — yiy2, 21 y2 + Lay1). 


# Division: 2 = (Betws, a ae) 


7,2? 22 
T+ YS Ly + Y3 


Definition 4.8: The set of all ordered pairs with such operations of addition, subtraction, multiplication, and 


division is called the field of complex numbers and it is denoted as C. 


Unfortunately, there is no unique notation for geometric representation of complex numbers—actually, there are 
two of them. Euler suggested to denote by 1 = (1,0) and i = (0,1) the unit vectors of abscissa (z-axis) and ordinate 
(y-axis), respectively. Mathematicians still follow this genius. On the other hand, in engineering and computer 
science, it is a custom to use another notation, j = (0,1), for the unit vector in the positive vertical direction, while 
keeping i = (1,0) for the unit vector in the horizontal direction. Historically, the x-axis is called the axis of reals 
and the y-axis is called the axis of imaginaries. Such representation of an arbitrary complex number z in terms 
of its projections x = Rz and y = Sz, 


z=al1l+t+yi or z=arit+yj, (4.5.1) 


is called the Cartesian (or rectangular) form of a complex number. It is a custom to drop the unit vector in the 
horizontal direction and use only one vector along the ordinate, either i in mathematics or j in engineering. Hence- 
forth, we shall denote the complex number (a,0) simply as a. Since addition and multiplication are commutative, 
we have «+ yj = yj +a and yj = jy; for brevity, we write x instead of x + jO and jy instead of 0+ jy. The former 
complex number is said to be purely real and the latter is said to be purely imaginary. 

The representation in polar coordinates proves a useful alternative to the Cartesian form (4.5.1). We take the 
pole at the origin of the Cartesian axis and the polar axis along the positive axis of reals. Let r,@ denote the polar 
coordinates of the point z = (a, y) on the complex plane. Then 


x=rcosé, y=rsind 
and (4.5.1) becomes 
z=x2+jy =r(cosé+jsin6). 


The last relation gives the trigonometric form of the complex number z. The quantities r and @ are called the 
modulus (absolute value) and argument (or amplitude) of z, respectively. The modulus r = |z| = \/2? + y? is the 


41 Complex numbers were named by Carl Friedrich Gauss (1777-1855), but they were used many years previously, starting from the 
work on cubic equations by the Italian mathematician Gerolamo Cardano (1501-1576). The word “complex” means a union of two (real) 
numbers. 
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distance from the origin to the point z = x+jy. In polar coordinates, infinitely many values of # represent the same 
point on the plane, which depend on the number of whole revolutions. Therefore, the argument is not a function 
because it has an infinite number of values differing from one another by arbitrary multiples of 27. To single out 
one of them, we restrict the range of # to obtain a function @ = arg z, called the branch of the argument of z, if 
the range of arg z coincides with an interval of the length 27. If —a < @ < a, then such a value of @ is called the 
chief amplitude of z or the principal branch of the argument of z. 


Definition 4.9: The complex number zx — jy is called the conjugate of the complex number z = x + jy and is 
denoted by 7 = x — jy. Their product z-% =r? is modulus squared. 


Definition 4.10: The operation of raising the number e ~ 2.71828... to the power z = x + jy is defined by 
the equation 


et ty — e* el¥ = e® (cosy + jsiny). 


Putting xz = 0 and y = 9, we obtain Euler’s formula: 


ec — cos6 + jsind 


From elementary algebra, it is known that complex roots of a polynomial with real coefficients always come in 
conjugate pairs. Therefore, if \ = a + J is a root of the characteristic equation L(\) = 0, then X = a — jG is 
also a root of L(A) = 0. It must be kept in mind that this result is a consequence of the real coefficients in the 
equation L(A) = 0. Complex roots do not necessarily appear in pairs in an algebraic equation whose coefficients 
involve imaginary numbers. 


Example 4.5.1: Consider a differential equation y’” + y = 0, for which the characteristic equation becomes 
\? +77 = 0. Obviously, it has two complex conjugate roots: \ = +j, to which correspond two linearly independent 


solutions 1.2 = e~)"” = cos(rz) + jsin(7z). Some values of these functions are 
el"/? — cos > + jsin > = j. el” —cosa+jsina =—1. 
ei8n/2 = cos = + jsin > = —j. el?" — cos2n +jsin2nx = 1. 


eit/4 = cog" +jsin— es ig 

4 4°92 “V2 

Now we construct, in a usable form, solutions of the homogeneous equation L[D]y = 0, where L[D] is a constant 

coefficient differential operator (4.4.1), page 208, whose characteristic equation has a complex root. Let us for 
simplicity consider the second order differential equation 


ay” + by’ + cy =0 (4.5.3) 


with real constant coefficients. Suppose now that the discriminant b? — 4ac is negative. Then, the roots of the 
characteristic equation, a\? + b\ + ¢ = 0, are complex conjugate numbers; we denote them by 


b V4ac — b? 
M=atj8, 2=a-j8 [a=-—, B=, dacdP). 
2a 2a 
The corresponding complex conjugate functions 
yi(x) = een, yo(x) = eras 


form the fundamental set of solutions for Eq. (4.5.3) because their Wronskian W[y1, y2|(x) = —2j8 e?°* 4 0 for any 
x. According to Theorem 4.11 on page 204, the linear combination 


y(x) = Crys (x) + Caye(x) = Cy e°* 14° + Cy erie (4.5.4) 


is also a solution of Eq. (4.5.3). The constants C; and C2 in the above linear combination cannot be chosen 
arbitrary—neither real nor complex. Indeed, if C; C2 are arbitrary real numbers, then the solution (4.5.4) is a 
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complex-valued function. However, we are after a real-valued solution. Since y; and yz are complex conjugate 
functions, the coefficients C; and C2 must also be complex conjugate numbers C, = C’2. Only in this case their 
linear combination will define a real-valued function containing two real arbitrary constants. Using Euler’s formula 
(4.5.2), we obtain 
y(x) = C, e* (cos Bx + jsin Bx) + C2 e™ (cos Bx — jsin Bx). (4.5.5) 
We can rewrite the last expression as 
y(x) = (Ci, + C2) e*” cos Ba + j(C1 — C2) e® sin Bx. 


Finally, let C,; + Cp = C, + Cy = C3 and j(C, — C2) = j(C, — Ci) = Ca, where C3 and C4 are new real arbitrary 
constants. Then the linear combination (4.5.5) is seen to provide the real-valued general solution 


y(a) = C3 e°” cos Ba + Cye™ sin Ba = e%” (C3 cos Ba + Cy sin Ba) , (4.5.6) 


corresponding to the two complex conjugate roots \; = a+ jG, and Ax = a—j8, (8 # 0) of the characteristic 

equation. We would prefer to have a real-valued solution (4.5.6) instead of the complex form (4.5.5). In order to 

guarantee a real value of the expression (4.5.5), we must choose arbitrary constants C; and C2 to be conjugate 

numbers. It is more convenient to use real numbers for C3 and C4 to represent a general solution as a real-valued 

function. Whenever a pair of simple conjugate complex roots of the characteristic equation appears, we write down 

at once the general solution corresponding to those two roots in the form given on the right-hand side of Eq. (4.5.6). 
We can rewrite Eq. (4.5.3) in the operator form 


a[(D— a)? + 67] y=0, (4.5.7) 
where a = 2, B= sae and D stands for the derivative operator. This is obtained by completing the squares: 
b fee 
ow tite = «(0 +2-p.—4(—)-{—) +5 
2 2a 2a a 


II 
is) 


pa oe: aa | * , dac — 
2a a 4a2| 2a 4a? 
The general solution of Eq. (4.5.7) has the form (4.5.6). If in Eq. (4.5.7) we make the substitution: y = e®’ v, then 
v will satisfy the canonical equation 


vu" + B?v=0 —— v(x) = Cy cos Bx + Cy sin Ba. (4.5.8) 
Example 4.5.2: Find the general solution of 
i 
y +4y=0. 
Solution. The characteristic equation is \? + 4 = 0, with the roots \ = +2j. Thus, the real part of \ is RA = 0 and 
the imaginary part of A is SA = +2. The general solution becomes 


y(x) = Cy cos 2a + C2 sin 2a. 


Note that if the real part of the roots is zero, as in this example, then there is no exponential term in the solution. 


Example 4.5.3: Solve the equation 
y” — 2y' + By = 0. 
Solution. We can rewrite this equation in operator form as 
[D?-2D+5)y=0 or [(D-1)*+4]y=0, 
where D = d/dx. Then, the general solution is obtained according to the formula (4.5.6): 
y(x) = Ce” cos 2x + Coe” sin 2x = e” (Ci cos 2a + C2 sin 2x). 


Note that the same answer can be obtained with the aid of substitution (4.1.19), page 194, y = e*v(a), where v 
satisfies the two-term (canonical) equation v” + 4v = 0. 
If the initial position is specified, y(0) = 2, we get Ci = 2, so the solution becomes 


y(x) = 2e” cos2a + Coe” sin2a. 


This one-parameter family of curves is plotted with the following Mathematica script: 
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DSolve[{y’’ [x] - 2 y’[x] + 5 yx] == 0, y[0] == 2}, ylx], x] 


soln[x_] = Expand[y[x]] /. %[{[1]] /. 
curves = Table[soln[x], {c1, -2, 2}] 


C[1] -> ci 


Plot [Evaluate[curves], {x, -1, 2.5}, PlotRange -> {-16, 15}, 
PlotStyle -> {{Thick, Blue}, {Thick, Black}, {Thick, Blue}, {Thick, Black}}] 


Figure 4.1: Example 4.5.3. The family of solutions 
subject to the initial condition y(0) = 2, plotted with 
Mathematica. 


Note that all solutions meet at the discrete number of points « = 4 +n7,n =0, 


Figure 4.2: Example 4.5.3. The family of solutions 
subject to the initial condition y’(0) = 1, plotted 
with Mathematica. 


1, 
eter family of solutions (see Fig. 4.2) 


2,.... If the initial velocity 


y(x) = (1 — C2) e* cos 2x + Cz e” sin 2z. 


is specified, y’(0) = 1, we obtain another one-param 


Figure 4.3: Two solutions of the equation y/’+0.2y/+ 
4.01 y = 0, plotted with Mathematica. 


-10k7 


Example 4.5.4: Solve the equation 


600 F 


400 


200 F 


Figure 4.4: Example 4.5.4. Three solutions, plotted 
with Mathematica. 


y” — 3y" + Gy’ + 18y = 0. 


Solution. The corresponding characteristic equation 


dM? — 347 +91 +13 =0 


has one real root, A; = —1, and two complex conjugate roots A2,3 = 24 


Mathematica or Maple, as well as MuPad, Mazima, 
the general solution of the differential equation is 


y(a) =Cle "7 + 


or 


(A+ 1) ((A- 2)? +9) =0 


t 3j. They can be found with the aid of the 
Sage, SymPy commands Solve or solve, respectively. Hence, 


Cp e?” cos 3a + C3 e?” sin 3a. 


Problems| In all problems, D stands for the derivative operator, while D°, the identity operator, is omitted. The 


derivatives with respect to t are denoted by dots. 


1. The characteristic equation for a certain homogeneous differential equation is given. Give the form of the general 


solution. 
(a) %+1=0; (b) A? —21+4+2=0; (c) %+2\+5=0; 
(d) 7? —4+5=0; (e) A? —- 2149 =0; (f) 9\? — 6X + 37 = 0; 
(g) 44?4+44+65=0; (h) 9\74+6\482=0; (i) A*2—2A4 26 =0; 
(j) 4)? +16A\+17=0; (k) 9\? +544 +82=0; (Il) 4\°?—48\ 4145 =0. 


216 Chapter 4. Second and Higher Order Linear Differential Equations 


2. Write the general solution of the following differential equations. 


(a) yy" + 9y =0; (b) yy” + 6y’+13y=0; (ce) yy” + 4y" + 5y’ = 0; 
(d) y” — 6y’ + 25y = 0; (e) y’ +10y’+29y=0; (f) Ay” — 4y’ + 5y = 0; 
(g) 2y” + 10y’ + 25y=0; (h) y” +2y' + 10y = 0; (i) Ay” — dy’ + 17y = 0. 


3. Find the general solution of the differential equation of order 3, where D = d/dz. 
(a) (D°+8D°+25D)y=0; (b) 16y” —16y”+5y’=0; (c) (D?—3D—2)y=0. 


4. Find the solution of the initial value problems. 


(a) y"+2y'+2y=0, y(0)=1, y/(0)=1 
(b) y+ 4y'+5y=0, y(0) =0, y/(0) =9 
(c) y” —2y'+2y=0, y(0) =—3, y'(0) =0. 


(d) 4y” + 8y’ + 5y = 0, y(0) = 1, y/(0) =0. 

(e) €+2Vk?—Ba+k?x=0,k>b>0; wx =Oand ¢ =da/dt = vo when t =0. 

(f) y” + 2ay'+ (a? +1)y=0, (0) =1, y'(0) =0. (g) y"+49y=0, y(0) = 2, (0) =1. 
5. Find the solution of the given [VPs for equations of degree greater than 2. 

(a) y'” — 2y" —5y' + 6y=0, y(1) =5, yA) =0, y"(1) =0. 

(b) y”+y=0, yA) =1, y'0)=1, y") =0. 

(c) y” — 8y" + 9y' + 18y=0,  y(0) = 1, y/(0) =0, y"(0) =5. 

(d) y+ 5y" + 17y' + 13y=0,  y(0) = 2, y’(0) = 0, y"(0) = -15. 

(e) yy" +y'—-y=0, y(0) =0, y/(0) =1, y"(0) =2. 

(f) 4y’” + 28y” + 6ly’+37y=0, y(0) =1, y/(0) =0, y”(0) = —5. 

(g) 2y% + 11y’” — 4y” — 69y’ +34y=0, y(0) =1, y'(0) =3, y’(0) = —4, y’”(0) = 55. 

(h) y — 2y" + 16y’—15y=0, (0) = 0, y'(0) =0, ¥"(0) = 3, (0) = —11. 


d?y 2 dy 

6. Replace y = u/x in the equation —> + — — 
. - / AO dz? - x dx 

7. Consider a constant coefficient differential equation y’” + aiy’ + aoy = 0; its characteristic equation has two complex 
conjugate roots A; and Az. What conditions on the coefficients, ai and ao, guarantee that every solution to the given 


ODE satisfies limz-—o. y(x) = 0? 


+ y =0 and solve it. 


8. Find the solution to the initial value problem 


and determine the first time at which |y(¢)| = 3.4. 
2 


‘ . 3 2 4 . 3 5 
9. Using the Taylor series e* =1+a2+3+5+---,cosr=1—3+4—-:--, andsinx =x—4+4,—---, prove the 


BI 
Euler formula (4.5.2). 


10. As another way of obtaining Euler’s formula (4.5.2), consider two functions y: (0) = e!? and y2(0) = cos@+jsin 6. Show 
that these two functions are both solutions of the complex-valued initial value problem 


y=, yO)=1. 


11. Using a computer solver, plot on a common set of axes over the interval —5 < x < 5 some solutions to the differential 
equation subject to one initial condition y(0) = 2. At what point do all solutions meet? 


(a) yy” — 2y'+10y=0; (b) y” —4y’+ 13y=0. 


12. Solve the initial value problems. 


(a) 4y”—12y'+13y =0, y(0)=0, y'(0) =1. 
(b) 4y” —20y'+4ly =0, y(0) =2, y'(0) =5. 
(c) y” —10y' + 29y = 0, y(0)=0, y'(0) =2. 
(d) 9y" —12y'+ 40y =0, y(0) =3, y'(0) =2. 
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4.6 Repeated Roots. Reduction of Order 
Suppose that in a constant coefficient linear differential equation 
L[Dly = 0, (4.6.1) 


the operator L[D] = a,D" +an_1D"-!+---+a ,D+ao, with D = d/dz, has repeated factors. That is, the characteristic 
equation L(A) = 0 has repeated roots. For example, the second order constant coefficient linear operator L[D] = 
aD? + bD +c has a repeated factor if and only if the corresponding characteristic equation a\? + b\+c=0 has a 
double root 

v1 = dQ = —b/(2a) . 


In other words, the quadratic polynomial can be factored A? + b\ + ¢ = a(A — 1)? if and only if its discriminant 
b? — 4ac is zero. In this case we have only one solution of exponential form: 


Yj (x) = ce ot/ (2a) 


for the differential equation 
ay +by'+cy=0, b? = 4ac. (4.6.2) 


To find another linearly independent solution of Eq. (4.6.2), we use the method of reduction of order (see §2.6), 
credited to Jacob Bernoulli. Setting 


y = v(x) yi 2) = ufe) e/29), (4.6.3) 
we have 
y= v(e) nla) + o(0)vh(@) = v'(@) ule) ~ P(e) nla), 
"= Maule) - 20m) +(F) ven) 


By substituting these expressions into Eq. (4.6.2), we obtain 


aoe) nla) -2@)n(o) + (=) vente) 


+b Ge yi(x) — 5 0(2) yi(x)} + cv(x) yi (x) = 0. 


After collecting terms, the above expression is simplified to 


av" (x) y1(x) = 0. 


The latter can be divided by the nonzero term ay;(2) = ae~°*/?%, This yields v(x) = 0. After integrating, we 


obtain v’(a) = Cy and v(x) = Cia + C2, where C; and Cp are arbitrary constants. Finally, substituting for v(«) 
into Eq. (4.6.3), we obtain the general solution of Eq. (4.6.2) as 


y(x) = (Cia + Cae ee, 
Functions e?” and xe?” are linearly independent since their Wronskian 


ey rei 


W(x) = yer (ay+1)e”* 


=e Ss O. 
Therefore, these functions form a fundamental set of solutions for Eq. (4.6.2), whatever the constant ¥ is. 


Theorem 4.17: Let ao, a1, ..., @n be n real (or complex) numbers with a, #4 0, and y(a) be a n times 
continuously differentiable function on some interval |a,b|. Then y(a) is a solution of the n-th order linear 
differential equation with constant coefficients 


def 


L[Dly = S- apD*y = any™ + an—1y"—) +---+aoy(z) =0, D=d/dz, 
k=0 


218 Chapter 4. Second and Higher Order Linear Differential Equations 


if and only if 


m 
y(e) = ) oe P,(2), (4.6.4) 
j=l 
nm 
where Aj, 2, .--; Am are distinct roots of the characteristic polynomial Sia = 0 with multiplicities mj, 
k=0 
Mm, .--, Mm, respectively, and P;,(x) is a polynomial of degree mz — 1. 


We prove this statement by induction. For n = 1, the characteristic polynomial L(X\) = a1\ + ao has 
the null \; = —ao/a. So the general solution is y = C e*!”, where P;(x) = C is a polynomial of degree 0 = m; — 1. 

Now, let n > 1 and assume that the assertion of the theorem is valid for the differential equation of order 
k <n. If Ay is a root of L(A) = 0 with multiplicity mi, then L(A) = P(A)(A — A1), where P(A) is a polynomial 
of degree n — 1 whose nulls are Ay, Ag, ..., Am with multiplicities m, — 1, m2, ..., Mm, respectively. Then the 
equation P[D]y = 0 has a solution of the form (4.6.4) according to the induction hypothesis. Therefore, the equation 
L{[D]y = P(D)(D— A1)y = (D— A1) P(D)y = 0 is equivalent to (D— A1)y = YD, e*** Q(x) by the induction hypothesis, 
where Q,(zx) are polynomials in x with degree m,—1 if k > 1 and degree m;—2 if k = 1. Since (D—A1)y = D (e"*y), 
we get the required result. 


Example 4.6.1: Let us consider the differential equation 
y” +4y' + 4y = 0. 


The characteristic equation \? + 4\ +4 = 0 has a repeated root \ = —2. Hence, we get one exponential solution 
yi(x) = e~ 7". Starting with y(x) = v(x)yi (x) = u(x) e~2”, we have 


y (x) = v' (x)e~?* — 2u(x)e** 


y" (x) = v" (x)e—** — 4u'(x)e~** + 4u(x)e~**. 
Then, 


y” + 4y! + 4y = v" (x)e~ 2” — 40’ (x)e~ 7” + 4u(x)e~2* + Av! (xe 2” 


— 8v(x)e~2” + 4u(x)e~2* = v" (x)e~** = 0. 


Therefore, v’ = 0 and its solution is u(x) = Cia + C2, Thus, the general solution of the equation becomes 


y(x) = (Ciz + C2)e**. 


In general, suppose that the linear differential operator D[D] in Eq. (4.6.1) has repeated factors; that is, it can be 
written as 
L[D] = 1p] D+)”, 


where /[D] is a linear operator of lower order, and m is a positive integer. This would lead us to the conclusion that 
any solution of the equation 
(D+7)"y=0 (4.6.5) 


is also a solution of Eq. (4.6.1). The corresponding characteristic equation L(\) = 0 would have a repeated factor 
(A+/7)™. Equation (4.6.5) would have an exponential solution 


le) =e. 
Calculations show that 

(D+ y)[a* 77] = kak} eW 7% — yar e- 7? 4 yar e-71® = ka} 7, 
Then, 


(D+ y)*[2* e—7] = «(D+ y)[2*1 e-7] = k(k — 1)a*-? e™. 
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Repeating the operation, we are led to the formula 


= k(k—-1)---(kK-n+l)e%=k®e%, ifn<k 
nik yr] _ ’ ’ 
(D+ 7)"[x" e \={ 0, eee (4.6.6) 


Here, k® = k(k—1)---(kK—n+1) abbreviates the n-th falling factorial. We know that the function e~7* is a solution 
of the equation (D + y)y = 0; therefore, for all n > k, 


(D+y)"|e* e-%]=0, &=0,1,2,...,(%—1). (4.6.7) 
From Eq. (4.6.7), we find the general solution for Eq. (4.6.5) 
y(x) = (C1 + Con +... + O,2" Ye-™ 
because functions e~7, xe~7, ..., «*~1 e~7” are linearly independent (Example 4.3.3, page 206). 
Example 4.6.2: Let us consider the differential equation 
(p* — 7D? + 18D? —20D+8)y=0, D=d/dz. 


Since the corresponding characteristic polynomial is L(A) = (A — 1)(A — 2)8, the operator can be factored as 


L{pD] = p* — 7D* + 18D? — 20D + 8 = (D— 1)(D — 2)? = (D— 2)3(D— 1) 


because constant coefficient linear differential operators commute. Then the general solution of the equation is a 
sum of two general solutions corresponding to (D — 1)y = 0 and (D — 2)°y = 0, respectively. Therefore, 


y(a) = (Ci + Cort C3x7)e~ 2” +Cye*. 


4.6.1 Reduction of Order 


So far we have discussed constant coefficient linear differential equations. In this subsection, we show that the 
Bernoulli method and Abel’s theorem are applicable to reduce the order of equations with variable coefficients. 
Let us consider (for simplicity) a second order linear differential equation 


y" + p(x) y' + a(x) y =0, (4.6.8) 


where p(x) and q(x) are continuous functions on the x-interval of interest. Suppose that one solution y:(x) of 
Eq. (4.6.8) is obtained by inspection (which is just a dodge to hide the fact that the process was one of trial and 
error). According to Bernoulli, we seek the second unknown member of the fundamental set of solutions, y2(a), in 
the product form 

yo(a) = v() yi (a). 
In order y; and yz to be linearly independent on some interval, the ratio y2/y; must be nonconstant. Differentiating 
y2 = vy twice with respect to x yields 


Y=Vyi troy, and yy =v"y, + Qv'y, + vyy. 
Substituting into Eq. (4.6.8) gives vy, + 2v’y, + vy! + p(x)(u'yi + vy) + qvyi = 0 or 
oy + 0'(2y, + pyr) + oy + py + gyi) = 0. 


The coefficient of v in this expression vanishes because y1(x) is a solution of Eq. (4.6.8). Therefore, the product 
y2 = vy. solves Eq. (4.6.8) provided v(x) satisfies 


vy, + v'(2y; + py) = 0, (4.6.9) 


which is a first order separable (and linear) differential equation for u = v’. Separating the variables in Eq. (4.6.9), 
we get 
/ 


we yt py Hh 
u Y1 Y1 


—p=-—2(Iny1)' —p. 
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Formal integration yields ; 
Inu = —2Iny; — i p(t) dt + nc), 


0 


wo exp {- iar}. 
Uy xo 


where C| is a constant of integration provided that y;(2) # 0 on some interval |a, b| (a < b). One more integration 
results in 


which, upon exponentiation, gives 


v= a fu exp{- v(t) ae} dz + Co, x € |a, bj. 
xo 


Since yg = vy,, another linearly independent solution becomes 


yo(x) = nla) f vio) exp \-f p(t) ar dz, yi(%) £0 on ja, dB]. (4.6.10) 

£0 
Actually, another linearly independent solution y2 can be found from Abel’s formula (4.2.3), page 201, and Exercise 12 
asks you to generalize it for n-th order equations. Let yi (2) be a known solution of Eq. (4.6.5), then for another 
solution we have Abel’s relation (4.2.3) 


x 


yiys — Yiy2 = Wo exp{- | p(t) ar} ' 


(0) 


where Wo is a constant. If ye is linearly independent from y,, then this constant Wo is not zero. The above 
differential equation can be solved with respect to yo using the integrating factor (2) = y,7(a) to obtain the exact 


equation 
3 82] - oof fwom 


Since we are looking for just one linearly independent solution, we can set Wo to be any number we want. For 
instance, we can choose Wp = 1, and the next integration leads us to the formula (4.6.10). The following result 
generalizes the reduction of order for n-th order equations. 


Theorem 4.18: If y;(x) is a known solution of the linear differential equation 
Gay ani" +>  ai@)y’ + aole)y = 0, 
which has the property that it never vanishes in the interval of definition of the differential equation, then the 
change of dependent variable y = v y; produces a linear differential equation of order n — 1 for v. 
Example 4.6.3: For a positive integer n, let us consider the differential equation 
zy’ —(x+n)y'+ny=0. (4.6.11) 


By inspection, we find an exponential solution, y; = e”. To determine another solution, we use the Bernoulli 
substitution y = v(x) e*. Substituting this function into Eq. (4.6.11) and solving the separable equation (4.6.9) for 
u = v', we obtain u = v'! = x" e~* and the fundamental set of solutions for Eq. (4.6.11) consists of 


yi =e" and Yo = ef ate dz. 


Let w = v"/v' = u'/u; we rewrite Eq. (4.6.9) in the following form: 


1 yl! 
yt a ( + =) yi = 0. 


Assuming w = v”/v' is known, we obtain the fundamental set of solutions for Eq. (4.6.8) given by 


Wiss {-5 [ow Lai) a} snl aioe {-5 [ow ig a} . (4.6.12) 
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We substitute y; and its derivatives 


y= 5 (p+) exp {5 | 2) +) ac} =-S@+u)n, 
y= -5 (p' + w') ex {-5 fol) +w) de} + ; (p+w)? ex {-5 fol) + w) a} 


into Eq. (4.6.8) to obtain the Riccati equation (see §2.6.2 for details) w? — 2w’ = 2p’ + p? — 4q. If we know its 
solution, the fundamental set of solutions of Eq. (4.6.8) is given by the formula (4.6.12). In what follows, the 
function D(x) = 2p'(x) + p?(x) — 4q(z) is called the discriminant of Eq. (4.6.8). From §2.6.2, we know that the 
Riccati equation 

w* — 2w'’ = D(z) (4.6.13) 


has a solution expressed in quadratures when D(x) is a function of special form. For example, when D(x) = k or 
D(x) = kx~?, where k is a constant, the Riccati equation (4.6.13) is explicitly integrable. It is also known (see 
§2.6.2) that if D(x) = ka” when n = (—4m)/(2m +1) for some positive integer m, then the Riccati equation can be 
solved in terms of standard functions. 

We consider first the case when D(x) is a constant k. Then Eq. (4.6.13) becomes separable 


2 


and its integration depends on the sign of the coefficient k. Since v’’/v’ = w, the next integration yields 


+cat 
ie eee er 
cg + c4 tan(ax) 


Cy + Cov , 
a aes 
ci + cp etVi ifk>0 


>) 
c3 + C4 ervk 


where 1, C2, €3, and cy, are real constants with cyc4 # c2c3. 
Now we consider the case when D(x) = ka~? for some constant k 4 0. The Riccati equation w? — 2w’ = ka~? 
: 2(y + 1)a? : F ; F 
has a solution w = 2 + — where ¥ is a root of the quadratic equation y? + 2y = k and C, is a constant of 
av 5 ee 
integration. Solving v”/v’ = w, we obtain 


v(a) =| ef w4e de = Cy In(C, — 27+), 


Example 4.6.4: Consider the linear equation 
y+ 2y' + ce y=0 
Ax? 


Since the discriminant is D(x) = 3x~?, the equation for w becomes w? — 2w’ = 3a~?. It has a solution w = 1/s, so, 
according to Eq. (4.6.12), one of the solutions is 


1 1 
n=e{-5 | (2+=) ar} = V2 e-®, 
2 x 


We find another linearly independent solution yz as a product y2 = v(x)yi(x), where v is a solution of Eq. (4.6.9), 
ie., v"”/v' = a27!. Integration gives v = 2? and the second linearly independent solution becomes 
3/2 .-2@ 


y2= u've 
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4.6.2 Euler’s Equations 
An equation with variable coefficients 
az*y"+bay! +cy =0, x>O0, (4.6.14) 
where a, b, and c are real numbers, is called an Euler’s equation (it is also known as the equidimensional equation). 
It is a particular case of more general equation 
tany™ +2” an_ry—-) +--+» Farry! + apy =0 (4.6.15) 


of n-th order that is considered in 88.1.1. We present two approaches to deriving the general solution of the Euler 
equation: one is just guessing a solution in the form y = x’, and another one is based on the change of independent 
variable t = Inz. These two approaches are closely related. 

The Euler equation can be reduced to an algebraic problem if we seek its solution in the form y = x", where r 
is a parameter to be determined. Upon differentiation, y! = ra’! and y” = r(r —1)a"~?, we get from Eq. (4.6.14) 
that 

az’ r(r—1)a"-? + bara” + +ca2" =0, 
or collecting similar terms 
ar(r—1)a2"+bra’+cax" =0, x>0. 


Factoring out x", we get the algebraic equation 
ar’? +(b—a)r+c=0. (4.6.16) 

If this quadratic equation has two real distinct roots r; and rz, then we have two linearly independent functions 
y= a" and yo(x) =a 


that form the fundamental set of solutions to Eq. (4.6.14). 
When equation (4.6.16) has either repeated roots or complex roots, it is more convenient to make substitution 
4 = dae a 1d 1 d? 
= 11 ZG: a SS —_—_ =- - oo —_ . 
dx «dt dz? xu? dt «x? dt? 
Then the Euler equation (4.6.14) is converted into constant coefficient differential equation with respect to variable 
bs 
ayt+(b-—a)y+cy=0. (4.6.17) 


Example 4.6.5: Solve the Euler equation 
ay" —xy' +5y=0, x>0. 


Solution. In the given differential equation, we make substitution t = Inz, which yields the constant coefficient 
equation: 

y—-2y+5y=0. 
The corresponding characteristic equation A? — 2 +5 = 0 has two complex conjugate roots \ = 1 + 2j. Therefore, 
this differential equation has the general solution 


y(t) = e° [Ci cos 2t + C2 sin 2t] . 


In original variables, we have 


y(x) = «[C, cos (2 Ina) + C2 sin (2 Inz)]. 


Problems| In all problems, D stands for the derivative operator, while D°, the identity operator, is omitted. The 


derivatives with respect to t are denoted by dots. 


1. The factored form of the characteristic equation for certain homogeneous differential equations is given. State the order 
of the differential equation and write down the form of its general solution. 


(a) (A-2)?; (b) (A- DA + 2)*; (c) (A? —2d+2)?; 
(d) iN + 4); (e) (A-—1)?(A+3)%; (f) (- 3)(A° +1)’; 
(g) (A?4+1)?; (hb) (? +9)0° —2d+4+2)?; (i) (A+1)Q074+4)°; 
(Gj) (A? 416)?;) (k) (A? 4+:1)8Q? — 4445); (I) (A? +6 413)?. 
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10. 


aS 


12. 


13. 


14. 


. Determine the values of the constants ao, a1, and a2 such that y(x) = ao + aix + a2x 


Write the general solution of the second order differential equation 
(a) yy" —6y'+9y=0; (b) yy" —8y'+16y=0; (ce) dy" — dy ty =0; 
(d) y” —2y' +y =0; (e) Wy” —10y’+y=0; (f) l6y” + 8y' +y =0; 
(g) (2D + 3)*y =0; (h) = (D+1)*y = 0; (i)  (3D—2)?y=0. 


Write the general solution of the differential equation of order larger than 2. 


(a) y — 6y'”+9y" =0; (b)  y'” — By" + 16y’=0; (ce) 4y"” + 4y" +-y' =0; 
(d) yy" —2y" +y' =0; (e) (D*+18D°+81)y=0; (f) 9y'" + 6y” +y' =0; 
(g) y+ 6y’"+9y”=0; (h) (0? +1)?y =0; (i) (°+D-6)*y=0; 
(j) gy) a 6y"”" ab y" = 0; (k) Ay!" au 12y/"" st 9y’ = 0; (1) y ae Ay!" =0 


Find the solution of the initial value problem for the second order equation. 


(a) oy” +4y'+4y=0, y(0)=1, y(0)=1.  (b) 4y”— 20y'+ 25y=0, y(0) =1, y‘(0) = 


(c) 4y”—4y’+y=0, yA)=0, y()=1. (d) 4y"4 28y'+49y=0, y(0) = 


. Find the solution of the initial value problem for the third order equation. 


(a) y!” +3y" + 3y' +y =0, y(0) = y'(0) =0, y(0) =2. 

(b) y+ y" — 5y’ + 3y =0, y(0) = 2, y(0) =5, y"(0) =8. 
(c) y” + 5y” + Ty’ + 3y =0, y(0) = 2, y/(0) =0, y”(0) = §. 
(d) y+ 2y” =0, y(0) = y"(0) =0, y'(0) = 4. 

(e) y!” —3y'-2y=0, y(0)=1, y(0) =0, y"(0) =8. 

(f) 8y'" — dy” — 2y’+y=0, y(0)=4, y/(0) =0, y’(0) =3. 


. Consider the initial value problems. 
(a) 25y"” — 20y' + 4y=0, y(0)=5, y(O)=b. (b) yy” — 12y' + 9y=0, (0) = 2, (0) =. 
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Find the solution as a function of b and then determine the critical value of 6 that separates negative solutions from 


those that are always positive. 


. Find the fundamental set of solutions for the following differential equations. 


(a) y"+3y + 4(941/0?)y=0; (db) y"— Ay! tan(2r)— 5y = 0, 
eae ae (a) y’ +44) ¥ +4 b)u=0 
(ec) y+ 2ay' + 2%y =0; (f) ay” + (2— 3a)y! + (2x — 4)y = 0. 


2 


differential equations 


(a) (14+927)y”=18y; (b) (44+27)y"=2y; (ce) (@+3)y" + (Qx —5)y’ — 4y = 0. 


Use the reduction of order technique to find a second linearly independent solution. 


(a) (l—2*)y"”—2ey'+2y=0, y=ae.  (b) y” —2ey'+2y9=0, mae. 
2 


. One solution y1 of the differential equation is given, find another linearly independent solution. 


(c) y’ +tanazy’ =6cot? ry, yi =sin? x. (4d) 2(¢%)+ 4 y=0, n= ae. 
Sy. DS dP, = , a er: " = a , = 

(e) (L+a°)y"=ay'+2y/2", y=a". (f) wy ty=(@—l)y, w=l—-s. 

(g) (w+ 2)y" = (30+ 2)y’+12y, yw =e. (hb) y”—4y'+42?y=0, ys =sin(2”). 


Find the discriminant of the given differential equation and solve it. 
(a) wy” +2y'+y/(4e)=0; (b)  2*y” +a(2—2)y=0. 


The Legendre polynomial Po(x) = 1 is clearly a solution of the differential equation (1 


a) y 


uw 


is a solution to the given 


2ry’ = 0. Find 


another linearly independent solution of this equation. 


The Chebyshev polynomial To() = 1 of the first kind is clearly a solution of the differential equation (1 - @) y'—-ry' = 


0. Find another linearly independent solution of this equation. 


The Fibonacci polynomial F\(x) = 1 is clearly a solution of the differential equation (4 
another linearly independent solution of this equation. 


3xy’ = 0. Find 


The Lucas polynomial L(x) = 1 is clearly a solution of the differential equation (4 + x”) y’ +axy’ =0. Find another 


linearly independent solution of this equation. 
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4.7 Nonhomogeneous Equations 


In this section, we turn our attention from homogeneous equations to nonhomogeneous linear differential equations: 


d” qr-l 
an(@) Sr + Ona) Sa +++ + a0(z)y(2) = F(2), (4.7.1) 
da” da” 
where f(x) and coefficients ap(x), ..., ao(%) are given real-valued functions of x in some interval. The associated 
(auxiliary) homogeneous equation is 
an(x)y™ + dn—1(x) y"Y +--+ + ao(x)y(z) = 0, (4.7.2) 


where y‘™ stands for the n-th derivative of y(x). It is convenient to use operator notation for the left-hand side 
expression in equations (4.7.1) and (4.7.2): L[a,D] or simply L, where 


L{z,D] = an(x)D” + an_1(x)D"—' + --- + ag(zr), (4.7.3) 


and D stands for the derivative operator D = d/dz. To find an integral of the equation L[x,Djy = f, we must 
determine a function of x such that, when L/x,D] operates on it, the result is f(x). The following theorem relates a 
nonhomogeneous differential equation to a homogeneous one and gives us a plan for solving Eq. (4.7.1). 


Theorem 4.19: The difference between two solutions of the nonhomogeneous equation (4.7.1) on some interval 
|a, b| is a solution of the homogeneous equation (4.7.2). 

The sum of a particular solution of the driven equation (4.7.1) on an interval |a,b| and a solution of the 
homogeneous equation (4.7.2) on |a, b| is a solution of the nonhomogeneous equation (4.7.1) on the same interval. 


When the differential operator L[x,D] acts on a function y, we write it as L[x,D]y or L[y] or simply 
Ly. Let y; and y2 be two solutions of the nonhomogeneous equation (4.7.1): 


Ly = f(x), Ly. = f(z). 
Since L is a linear operator (see Properties 1 and 2 on page 190), we obtain for their difference 
Llyn — yo] = Ly — Lye = f(x) — f(z) = 0. 


Similarly, for any particular solution Y(a) of Eq. (4.7.1) and for a solution y(x) of the homogeneous equation 
(4.7.2), we have 
LY +y] = L[Y] + Lily] = f(z) +0 = f(z). 


Theorem 4.20: A general solution of the nonhomogeneous equation (4.7.1) on some open interval (a, b) can 
be written in the form 


y(x) = yr(&) + yp(z), (4.7.4) 
where 
Yn(x) = Crys (x) + C2y2(x) +++» + Cryn(2) 
is the general solution of the associated homogeneous equation (4.7.2) on (a,b), which is frequently referred to 
as a complementary function, and yp(z) is a particular solution of Eq. (4.7.1) on (a,b). Here C; represents 


an arbitrary constant for 7 = 1,2,...,n, and {yi(x), y2(x),-...,Yn(x)} is the fundamental set of solutions of 
Eq. (4.7.2). 


This theorem is a simple corollary of the preceding theorem. 


Theorem 4.21: [Superposition Principle for nonhomogeneous equations] The general solution of the 
differential equation 
Lx, Diy = file) + fo(z) +--+ fm(z), D=d/de, 
on an interval |a, b| is 
y(@) = yr(@) + Ypi(@) + Ypa(@) + +++ + Ypm(@), 
where L|z, D] is a linear differential operator (4.7.3), ypj;(a), 7 =1,2,...,m, are particular solutions of L[x, Djy = 
f;(z) on ja, b|, and y,(x) is a general solution of the homogeneous equation L[x,D]y;, = 0. 
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This follows from the fact that LC is a linear differential operator. a) 

Now we illustrate one of the remarkable properties of linear differential equations that, in some cases, it is possible 
to find the general solution of a nonhomogeneous equation Ly = f without knowing all the solutions of the associated 
homogeneous equation Ly = 0. Consider a nonhomogeneous second order linear differential equation 


y” + p(x)y' + q(a)y = f(x) (4.7.5) 


on some interval where the coefficients p(x), g(a) and the forcing function f(x) are continuous. Suppose that we 
know one solution yi(z) of the associated homogeneous equation y” + p(x)y’ + q(x)y = 0. Following Bernoulli, 
we seek a solution of Eq. (4.7.5) as a product y = u(x) yi(a#), with some yet unknown function u(x). Substituting 
y = uy, into Eq. (4.7.5) yields 

yu" + (24, +pyi)ul = f, 


which is a first order linear equation in u’. Therefore, it can be solved explicitly. Once wu’ has been obtained, u(x) 
is determined by integration. 


Example 4.7.1: Find the general solution of 2x?y” + 2xy’ — 2y = 4x? on the interval (0, 00). 

Solution. The associated homogeneous equation x?y” + 2xy’ — 2y = 0 clearly has y; = x as a solution. To find 
the general solution of the given nonhomogeneous equation, we substitute y = u(x) 2, with a function u(x) to be 
determined, into the given nonhomogeneous equation. Using the product rule of differentiation, we obtain 


x? (xu” + Qu’) + 2a (xu' + u) — Qau = 427, 


: d 
which reduces to x°u’ + 4a?u! = 4a. Since the latter becomes exact upon multiplication by «, we get — (x*u’) = 
2 dx 


4°. The next integration of xt u’ = «+ + C, yields the general solution u = « + C, x~* + Cy; multiplying it by 
Y1 =x, we obtain: y= 22 +C,a-2+Coa. 


4.7.1 The Annihilator 


The relation between homogeneous (4.7.2) and nonhomogeneous differential equation (4.7.1) is better understood 
when the language of differential operators is involved. Upon introducing the derivative operator D = d/dz, a 
differential equation (4.7.1) can be written in compact form L[x,D] y(x) = f(a). Correspondingly, the associated 
homogeneous equation (4.7.2) has the form L[x,D] y(x) = 0. It is natural to introduce the following definition. 


Definition 4.11: The linear differential operator L[z,D] is said to annihilate a function y(x) if 


Lx, D] y(z) =0 


for all x where the function y(a) is defined. In this case, L[x,D] is called an annihilator of y(z). 


Note that the annihilator is not unique and can be multiplied (from left) by any linear differential operator. For 
instance, if L is an annihilator for y(x), then LL = L? is also an annihilator of the function y(z). 

Now we turn our attention to constant coefficient linear differential operators, written as L[D] = AnD" +an—1D"~1+ 
-»-+4@ D+ ap. It is a custom to drop the identity operator D° and write the coefficient ap alone. From the equation 
De” = de**, it follows that D’e*” = \” e*” and 


Libje** = Lije*. (4.7.6) 


Eq. (4.7.6) shows that D—7 annihilates e?”. Therefore, if the operator L[D] annihilates the exponential function e7”, 
then L[D] has a factor (D — y). This means that its characteristic polynomial L(A) is divisible by A — +. 

A solution of the constant coefficient equation L[D] y = 0 contains a term like x”, with n = 0,1,2,..., only 
when the operator L[D] has a factor D”*!. Similarly, solutions of homogeneous equations with constant coefficients 
comprise terms such as e® cos Bx or e®” sin 8x only when the operator L[D] has a factor ((D — a)? + 87). Say for 
the function e** sin Bx, we have 


[(D — a)? + 8] e®* sin Ba = (D? — 2aD + a? + 87] e™ sin Bz. 
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Substitution yields 
[(D — a)? + 67] e®” sin Bx = De® sin Bx — 2aDe™ sin Bx + (a? + 6") e® sin Bax 
= (a? — 8”) e™ sin Bx + 2aBe™ cos Br 


— 2a*e sin Bx — 2aBe%* cos Bx + (a? + 6B?) e% sin Bx = 0. 


Let Pr(x) = pex® + pp_yx*-1 +--+ + po and Qz(x) = qea® + q,—12*—-! + ---+ 40 be polynomials in x of order k 
(p? + qZ > 0), then the function 


P, (x) e® cos Bx + Q;(x)e** sin Bx, pz +qz > 0, (4.7.7) 
is a solution of the constant-coefficient differential equation 
L[D] f(x) =0, D=d/dz, 
if and only if its characteristic polynomial L(\) has a multiple [(A — a)? + 6?]**1, that is, 
L(A) = [(A— a)? + BF" 11), 


where L1(A) is a polynomial of order n — 2k — 2 > 0. Now we consider a nonhomogeneous differential equation with 
constant coefficients 

L [D] y(«) = [a,D” ale ipcao ea a,D + ao] y(x) = f(z), (4.7.8) 
assuming that we know an annihilator ~[D] of the driving term f(x), that is, w [D] f(a) = 0 for all x. Then the driven 


equation (4.7.8) can be reduced to a homogeneous equation, as the following statements assure us. 


Theorem 4.22: Let p, g, and L be polynomials such that L(A) = p(A)q(A) and ¢ is relatively prime to 7, the 
annihilator of the forcing function f(a) in the right-hand side of Eq. (4.7.8). Then there exists a solution of the 
differential equation L[D]y = f, which is also a solution of 


yD] LD] y(x) = ¥ [D] f(x) = 0. (4.7.9) 


Such a solution can be obtained by applying a polynomial differential operator to any solution u of the equation 
p[Dju = f. 


Since g and w are relatively prime, there exist polynomials h and g such that 
gqt+ hy =1. (4.7.10) 


Now, let wu be any solution of the equation p[D]u = f and set y = g[D]u. To complete the proof, we need only to 
show that y is a solution of both (4.7.9) and (4.7.8). First, we multiply (4.7.10) by p and apply the corresponding 
differential operators to the function u. This gives 


g(D] q[D] p[Dju + h[D] b[D] pDju = pu, 
LD] gDjut+h[D] oP] pPju = plDlu, — since g[Dju = y, 
Lply = f, by WP]f=0 and pPju=f. 


Thus, y is a solution of Eq. (4.7.8) and we have 
YP] pDly = Y[D] PP] gD\u = gD] yD] f = 0 


since p[D]u = f and w[D]f =0. So the equation (4.7.9) is also fulfilled. 


Corollary 4.4: If polynomials L and w are relatively prime and 7(D) is an annihilator of f, then a solution of 
the equation L[D] y = f can be obtained by applying a polynomial differential operator to f. 
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Now we look into an opposite direction: given a set of linearly independent functions {y1, y2,..-,Yn}, find the 
linear differential operator of minimum order that annihilates these functions. We are not going to treat this problem 
in general (for two functions y1, y2, the answer was given in §4.2); instead, we consider only constant coefficient 
differential equations: 

LDly=f or any +anry 2) +---+ary! +a0y=f. (4.7.11) 


As shown in §§4.4—4.6, a solution of a linear homogeneous differential equation L[D|y = 0, for some linear constant 
coefficient differential operator L, is one of the following functions: exponential, sine, cosine, polynomial, or sums or 
products of such functions. That is, a solution can be a finite sum of the functions of the form (4.7.7). 

We assign to a function of the form (4.7.7) a complex number o = a + jf, called the control number. Its 
complex conjugate o = a — jG is also a control number. Actually, the control number of the function (4.7.7) is a 
root of the characteristic equation ~(A) = 0 associated with the annihilator ~[D] of f(x). Since we consider only 
differential equations with constant real coefficients, the complex roots (if any) of the corresponding characteristic 
equation may appear only in pairs with their complex conjugates. 

The control number guides us about the form of a solution for an equation (4.7.11). Here f(a) has a dual role. 
It is not just the forcing function of our nonhomogeneous differential equation L[D] y = f, but it is also a solution to 
some unspecified, homogeneous differential equation with constant coefficients w[D] f = 0. We use f(a) to deduce 
the roots (control numbers) of the characteristic equation w(o) = 0 corresponding to our unspecified, homogeneous 
differential equation w[D] f = 0. This gives us the form of a particular solution for the nonhomogeneous differential 
equation whose solution is what we are really after. Such an approach is realized in §4.7.2. 

Before presenting examples of control numbers, we recall the definition of the multiplicity. Let L(A) = anA” + 
an—1A\"~1 +++++ a9 be a polynomial or entire function and Xo is its null, that is, a root of the equation L(A) = 0. 
The multiplicity of this null (or the root of the equation L(A) = 0) is the number of its first derivative at which it 
is not equal to zero; that is, Ao is a null of the multiplicity m if and only if 


L(Ao) =0, L'(Ao) = 0,...,L°"-Y(Ag) = 0, but L°) (Ag) £ 0. 


From the fundamental theorem of algebra, it follows that a polynomial of degree n has n (generally speaking, 
complex) nulls, Az, A2,-+- , An, some of which may be equal. If all coefficients of a polynomial are real numbers, the 
complex nulls (if any) appear in pairs with their complex conjugate numbers. The characteristic polynomial can be 
written in the product form 

L(A) = an(A — Ar) (A = AQ)" (A= Ag)”, 


where Aj, A2,...,Ax are distinct roots of the equation L(\) = 0. The power m, in this representation is called the 
multiplicity of zero (or null) A; (j =1,2,...,&). 


Example 4.7.2: Find the control number of the given functions. 
(a) f(z)=241, (b) f(z) =a? +2e4+j (©) f(a) =e; 
(d) f(x) =2e4*; (e) f(x)=e-* cos2z; (f) f(x) = (x4 2)e* sin2z. 
Solution. In case (a), the function f(a) is a constant; therefore the control number is zero. 
(b) The function f(x) is a polynomial of the second order; therefore, the control number gd = 0. (c) o = —1. 
(d) The control number for the function 


e 73” — cos 2a —jsin2x2 (Euler’s formula) 
is o = —2j (or 2j, it does not matter because both are complex conjugate control numbers). 
(e) The control number for the function e~* cos 2” is 0 = —1 + 2j or o = —1 — 2j (it does not matter which sign is 


chosen). (f) The control number is o = 1 + 2j (or 1 — 2j). 


Example 4.7.3: Find a linear differential operator that annihilates the function x? e*” sin z. 


Solution. Since the control number of the given function is 3 + j, the operator that annihilates e®” sin x is 
(D —3)?+1=D? —-6D+10D° = D* —6D +10. 


The multiple x? indicates that we need to apply the above operator three times, and the required differential operator 
is 

~[D] = [(D — 3)? + I). 
To check the answer, we ask Maple for help. First, we verify that [(D — 3)? + 1]? = D* — 12D? + 56D? — 120D + 100 
does not annihilate the given function: 
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f :=x->x*x*exp (3*x)*sin(x) ; 
diff (f(x) ,x$4)-12*diff (f(x) ,x$3)+56*diff (f(x) ,x$2)-120*diff (f(x) ,x)+100*f (x) 


Since the answer is —8 e®” sin, we conclude that [(D — 3)? + 1]° annihilates the given function. 


Example 4.7.4: Find the linear homogeneous differential equation of minimum order having the solution y = 
C; cosx+ Cp sinz + C3x, where C), C2, C3 are arbitrary constants. 

Solution. To answer the question, we ask Mathematica for help. 
expr = (y[x] == C1 Cos[x]+C2 Sin[x]+C3 x +C4) 
equations=Table[D[expr,{x,k}],{k,0,4}] (* evaluate 4 derivatives *) 
the4deriv=Simplify[Solve[equations, y’’’[x], {C1,C2,C3,C4}]] 
de[x_, y_] = (y’’??? [x] == (y’’?? [x] /. the4deriv[[1]])) 


This gives the required equation y + y = 0 of fourth order. 


4.7.2 The Method of Undetermined Coefficients 


We know from previous sections 4.4—4.6 how to find the general solution of a homogeneous equation with constant 
coefficients. The task of this subsection is to discuss methods for finding a particular solution of a nonhomogeneous 
equation. Various methods for obtaining a particular solution of Eq. (4.7.1) are known; some are more complicated 
than others. A simple technique of particular practical interest is called the method of undetermined coeffi- 
cients. In this subsection, we present a detailed algorithm of the method and various examples of its application. 


The method of undetermined coefficients can only be applied to linear differential equations with real constant 
coefficients: 
any + iggy? +++-+ay, y’ + aoy = f(x), (4.7.11) 


when the nonhomogeneous term f(x) is of special form. It is assumed that f(a) is, in turn, a solution of some 
homogeneous differential equation with constant coefficients w[D|f = 0 for some linear differential operator wD]. 
Namely, the function f(a) should be of the special form (4.7.7), page 226, for the method of undetermined coefficients 
to be applicable. 

The idea of the method of undetermined coefficients becomes crystal clear when the language of differential 
operators is used. Denoting by L[D]y the left-hand side expression in Eq. (4.7.11), we rewrite it as L[D]y = f. If f 
is a solution of another linear constant coefficient homogeneous differential equation w[D]f = 0, then applying w to 
both sides of Eq. (4.7.11), we obtain 


v[D] L[Dly = YD] f = 0. (4.7.12) 


Since wD] L[D]y = 0 is a homogeneous linear differential equation with constant coefficients, its general solution has 
a standard form (Theorem 4.17, page 217) 


y(x“) = dee" Pilz); (4.7.13) 


where P;(x) are polynomials in x with arbitrary coefficients and A, are roots of the algebraic equation W(A) L(A) = 0. 
Now we extract from the expression (4.7.13) those terms that are included in the general solution of the homogeneous 
solution L[D] y = 0. The rest contains only terms corresponding to the roots of the equation (a) = 0: 


Y = de?" Qi(@), 
j 


which is your particular solution. To ensure that y, is a required solution, we determine the coefficients of Q,(x) 
from the equation 


ED] Se! Qj(z) = F(). 


Before presenting specific examples, let us review the rules for the method of undetermined coefficients. 
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Superposition Rule. If the driving term f(a) in Eq. (4.7.11) can be broken down into the sum f(x) = fi(x) + 
fo(z) +--+: + fm(a), where each term f;(x), 7 =1,2,...,m, is a function of the form (4.7.7), apply the rules 
described below to obtain a particular solution y,;(z) of the nonhomogeneous equation with each forcing term 
f;(z). Then the sum of these particular solutions y,(x) = ypi(@) +--+ + Ypm(x) is a particular solution of 
Eq. (4.7.11) for f(x) = fi(x) + fo(x) +--+ + fin (2). 


Basic Rule: When the control number is not a root of the characteristic equation for the associated homogeneous 
equation. If f(a) in Eq. (4.7.11) is the function of the form (4.7.7), page 226, and its control number is not a 
root of the characteristic equation L(A) = 0, choose the corresponding particular solution in the same form 
as f(x). In particular, if f(a) is one of the functions in the first column in Table 229 times a constant, and 
it is not a solution of the homogeneous equation L[D]y = 0, choose the corresponding function y, in the third 
column as a particular solution of Eq. (4.7.11) and determine its coefficients (which are denoted by Cs and 
Ks) by substituting y, and its derivatives into Eq. (4.7.11). 


Modification Rule: When the contro] number of the right-hand side function in Eq. (4.7.11) is a root of multiplic- 
ity r of the characteristic equation, choose the corresponding particular solution in the same form multiplied by 
x’. In other words, if a term in your choice for y, happens to be a solution of the corresponding homogeneous 
equation L[D]y = 0, then multiply your choice of y, by x to the power that indicates the multiplicity of the 
corresponding root of the characteristic equation. 


Table 229: Method of undetermined coefficients. o is the control number of the function in the first column. All C's 
and is in the last column are constants. 


Termin f(z) | o || Choice for y,(x) 
e Ce’ 
a” (n= Olde) Cra” + Ona" t+ +--+ Cla +Co 
x” er (Cy2z™ + Cp_12" 1 +--+ +Cyx + Coe 
cos ax j C;, cosax + Ko sinax 
sin ax j C, cosax + Ko sinax 
x” cosax j (Cyx” 4 = + Cia + Co) cosaaz+ 
+(Ky2” rm ---4+ Kya + Ko) sinax 
x” sinax j (C,x” 4 _ + Cra + Co) cosaxz+ 
+(Kya"+ Ky-12"* +-+-+ Kia + Ko) sinax 
e* cos Bx j e°* (C1 cos Bx + K2 sin Bx) 
e° sin Bx j e°* (C1 cos Bx + K2 sin Bx) 

x” e& cos Ba j e® cos Ba(Cy2z” + Cp_12™ 1 +--+ + Cx +Co)+ 
+e sin B2(Kya" + Kye" 1 +--+» + Kiz+ Ko) 
M 2% sin Bar j e®” cos Ba(Cyz” + Cp_1a™ 1 +++» + Cx +Cpo) 

+e% sin Bz(Kn2x" + Kn_i2" 14+---+ Kix + Ko) 


x 


The method of undetermined coefficients allows us to guess a particular solution of a specific form with the 
coefficients left unspecified. If we cannot determine the coefficients, this means that there is no solution of the form 
that we assumed. In this case, we may either modify the initial assumption or choose another method. 

Remark. The method of undetermined coefficients works well for driving terms of the form 


f(a) = Py(a) e®” cos(Ba + y) + Qz (x) e* sin(Bx + vy). 


Such functions often occur in electrical networks. | 
Let us consider in detail the differential equation 


a he ba ape e™ (4.7.14) 


where h and o are constants. If o is not a root of the characteristic equation L(,) SF an A” +On—1A7 1 +: -+ao = 0, 
then we are looking for a particular solution of Eq. (4.7.14) in the form 


y(«) = Ae” 
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with a constant A to be determined later. Since 


we have 
L[D] e°* = L(o)e** with D=d/dz. 


Hence, A = h/L(oc) and a particular solution of Eq. (4.7.14) is 


If o is a simple root of the characteristic equation L(A) = 0, then Table 229 suggests the choice for a particular 
solution as y,(x) = Axe®”. Its derivatives are 


y(t) = Ae™+oy,(z), 
yp(t) = Age? +oy) (x) =2Ace™ +07 yp(z), 
yp (z) = 2A eer Le y,(x) = 3A oe +07 y,(z), 
and so on; 
y)\(z) = nAo™*e +0" yp(z). 


Substitution of y,(x) and its derivatives into Eq. (4.7.14) yields 


A [an nAo”—! e7® + ano” Yp + Qn—1(n — 1)o”? e°* + a,-10"* Yp tert aoyp| 


= AL(o)y,(x) + A [an nAo”"! + an—i(n — 1)o"? +--+ +.a)] e 


d 

— AL(a)yp(z) +A (= x(0)) ect = AL'(a) ef* = her 
ol 

since L(o) = 0 and L'(c) 4 0. From the latter relation, we determine A to be h/L’[o], and a particular solution 


becomes 
hx 


Yp(2) = L'(c) € 


ort 


In general, if o is a root of the characteristic equation of multiplicity r, then a particular solution of Eq. (4.7.14) 
is 
hao 
Yp(x) = Lo) e* (4.7.15) 


This follows from the product rule 


dé a ae ; k k! 
sae ea) =F (Suen @), (0) = Ee 
and the equation 
L[D]x” e?* = L) (oc) e7”. | 


Now we present various examples to clarify the application of this method. 
Example 4.7.5: (Basic rule) Find a particular solution of the nonhomogeneous differential equation 
LD] y = (D? - 2D —3) y = 3e”” or y” — 2y' — 3y = 3e*”. 
Solution. The characteristic equation corresponding to the homogeneous equation L[D] y = 0, 
M-— 21-3 =0, 


has two real roots Ay = —1 and Ag = 3. The associated homogeneous equation has the general solution y, = 
Cyie~* + Cy e?* with some arbitrary constants C, and C2. Thus, the function f(x) = 3e?* is not a solution of 
the homogeneous differential equation—its control number, 0 = 2, does not match the roots of the characteristic 
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equation. Therefore, we are looking for a particular solution in the form y(x) = Ae?*, with unknown coefficient A. 
We find two derivatives of y,(x) that are used in the equation so that y’/(x) = 2Ae?* = 2y,, y”(x) = 4Ac?” = 4yp. 
Now we substitute the functions into the original equation to obtain 


A(4—2-2-—3)e”" = 3e*, 
Canceling out the exponent, we get —3A = 3, hence A = —1. Our general solution is, therefore, 
y(z) = —e?* + Che"? + Coe. 


We can find a particular solution of the given differential equation using Eq. (4.7.15). In our case, L[D] = 
D? — 2D —3, r =0 (since \ = 2 is not a root of the characteristic equation) and L(2) = —3. Therefore 


3 x 3 x Hi 
uel) = Tye aa “ 


=>—-€ 


Example 4.7.6: (Basic rule) Find a particular solution of the differential equation 
y” — 2y' — 8y = —32? + 2/3. 


Solution. The polynomial on the right-hand side is not a solution of the homogeneous equation y’’ — 2y’ — 3y = 0 
because its control number o = 0 is not a root of the characteristic equation \? — 2\ — 3 = 0. Since the right-hand 
side function —3x? +2/3 in the given differential equation is a polynomial of the second degree, we guess a particular 
solution in the same form, namely, as a quadratic function 


Yp(t) = Ax? + Br +C. 
Substituting the function y,(x) and its derivatives into the differential equation, we obtain 
2A — 2(2Ar + B) — 3(Ax? + Br + C) = —32? + 2/3. 


Equating the coefficients of the like powers of x, we have 


3A = -3, 4A—3B=0, 2A-2B-3C = 2/3. 


The solution of the last system of algebraic equations is A = 1, B = —C = —4/3. Thus, a particular solution of the 
nonhomogeneous differential equation has the form 


4 


Yp(x) = a g(t —1). 


Example 4.7.7: (Basic rule) Find a particular solution of the differential equation 
y” — 2y' — 3y = —65 cos 2x 


and determine the coefficients in this particular solution. 

Solution. The characteristic equation \? — 2\ — 3 = 0 corresponding to the homogeneous differential equation 
y” — 2y’ — 3y = 0 has two real roots A; = —1 and Ag = 3. The control number of the function f(x) = —65 cos 22 is 
o = +2j. Therefore, the right-hand side is not a solution of the homogeneous differential equation, so we choose the 
function 


Yp(x) = Acos 2a + Bsin 2x 


as a particular solution with undetermined coefficients A and B. Its derivatives are 
yp(v) = —2A sin2x + 2B cos2x, y),(x) = —4A cos 2a — 4B sin 2x = —4yp. 
Substituting the function y, into the given differential equation L[D] y = —65 cos 2z, we obtain 


4Asin2x — 4Bcos2x — 7Acos2a — 7Bsin 2x = —65 cos 2z. 
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Collecting similar terms, we get 
(4A — 7B) sin 2x — (4B + 7A) cos 2a = —65 cos 22. 


Functions sin2a and cos2z are linearly independent because they are solutions of the same differential equation 
y” +4y = 0 and their Wronskian equals 1, see Theorem 4.9 on page 201. Recall that two functions f(a) and g(x) are 
linearly dependent if and only if f(a) is a constant multiple of the other function g(x). Therefore, we must equate 
the corresponding coefficients to zero. This leads us to 


44—7B=0, 4B+7A=65. 
Solving for A and B, we obtain the particular solution to be 
Yp(x) = 7 cos 2x + 4sin 2a. 


We can derive the same result directly from Eq. (4.7.15). Since cos 2 is the real part of e743”, namely, cos 22 = Re¥*, 
a particular solution of the given differential equation is the real part of its complex solution: 


—65 94 —65 . 
xv) = R—— e2se =k ese 
wee) L(2j) (2)? — 2(2j) — 3 
Since 
— 65 85 HC CH(T=4j)_ 
we have 


Yp(x) = R(7 — 4j) eF* = R(7 — 4j) (cos 2x + jsin 2x) = 7cos 2x + 4sin 2z. 


Example 4.7.8: (Basic rule) Find a particular solution of the differential equation 
y — 2y' — 3y = 652 sin 2x — 7 cos 2z. 
Solution. Table 229 suggests the choice for a particular solution 
Yp(x) = (Ax + B) sin 2x + (Cx + D) cos 2x 


with coefficients A, B, C, and D to be determined because the control number of the right-hand side function 


go = +2j does not match the roots of the characteristic equation. The first two derivatives of the function y, are 
y,(z) = Asin 2a +2(Ax + B) cos 2x + C cos 2x — 2(Cx + D) sin 2x 
= (A-2Cz—2D)sin2z 4+ (2Ar +2B+4C) cos 22, 
y,() = —2C sin 2x + 2(A— 2Cax — 2D) cos2x + 2A cos 2x — 2(2Ar + 2B +C) sin 2x 


= —4(Ar+B+4+C) sin2a + 4(A— Ca — D) cos 2a. 
Substituting into the nonhomogeneous equation L[D] y = 65a sin 2x — 7 cos 2a yields 


L[D] y = —4(Ax + B+C) sin 2x + 4(A — Ca — D) cos 2x — 2(A — 2C'x — 2D) sin 2x 
— 2(2Ar + 2B+4C) cos 2x — 3(Ax + B) sin 2x — 3(Cx + D) cos 2x 


= 65x sin 2x — 7 cos 2x 
or, after collecting similar terms, 


LD] y = —|[(24 + 7B + 4C — 4D) + «(7A — 4C)| sin 2x 
— [(-44+4B+2C + 7D) + 2(7C + 4A)] cos 2a = 65x sin 2x — 7 cos 2x. 


Equating the coefficients of sin 22 and cos 2x, we obtain 


—(2A+7B +4C — 4D) — (7A — 4C) = 652, 
(-444+4B+2C4+7D)+2(7C +4A) =7. 
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Two polynomials are equal if and only if their coefficients of like powers of x coincide, so 
4C —7TA=65, 2A+7B+4C-—-4D=0, 4A4+7C=0, -4A4+4B4+2C04+7D=7. 
Solving for A, B, C, and D, we determine a particular solution 
Yp(x) = —(2 + 7x) sin 2x + (4x — 3) cos 2x. 


The tedious problem of coefficient determination can be solved using Mathematica: 

eq = y’’[x] - 2 y’[x] - 3 y[x] == 65*x*Sin[2 x] - 7*Cos[2 x] 

assume[x_] = (a*x + b)*Sin[2 x] + (c*#x + d)*Cos[2 x] 

seq = eq /.{y-> assume} 

system = Thread[((Coefficient[#1, {Cos[2 x], x*Cos[2 x], Sin[ 2 x], x*Sin[2 x]}]) &) /@ seq] 
cffs = Solve[system, fa, b, c, d}] 

assume[x] /. cffs 


Example 4.7.9: (Modification rule) Compute a particular solution of the differential equation 


y” — 2y' — 3y=4e”. 
Solution. As we know from Example 4.7.5, the right-hand side term 4e~* is a solution to the corresponding 
homogeneous equation. Therefore, we try the function y,(«) = Ave~* as a solution because A = —1 is a simple root 
of the characteristic equation \? — 2\ — 3 = 0. Its derivatives are 


Yp = Ae *—Ave™”, yo =—-2Ae*+ Aze™. 
We substitute the function y,(z) = Ax e~* and its derivatives into the differential equation to obtain 


A[-2+4—2+2x—- 32] e7” =4e™*. 


—x 


The xe~*-terms cancel each other out, and —44e~” = 4e-* remains. Hence, A = —1 and a particular solution 


becomes y,(x) = —xe~”. 


If we want to obtain a particular solution from Eq. (4.7.15), we set L(A) = \? — 2A—3. Since \ = —1 is a simple 
root (of multiplicity 1) of L(A) = 0, we set r = 1 in formula (4.7.15) and, therefore, 


L)(A) = L(A) = 24-2 = 2(A- 1). 


From Eq. (4.7.15), it follows that 
Aa 


Yp(x) = ey 


Example 4.7.10: (Modification rule) Find a particular solution of the differential equation 
zx 


y” — 2y' —3y=4ae”. 


Solution. It is easy to check that o = —1, the control number of the forcing function 4% e~”, is the root of the 
characteristic equation \? — 2A — 3 = (A — 3)(A+ 1) =0. Table 229 suggests choosing a particular solution as 


Yp(z) = a(A+ Ba)e™, 
where A and B are undetermined coefficients. The derivatives are 


Yp = —(Ar+ Bx*\e* + (A+ 2Br)e*, 
yy = (Ax + Ba?) e~* — 2(A+ 2Bar)e"* + 2Be™. 


Substituting y, and its derivatives into the equation, we obtain 


(2B —4A—8Bxr)e* =4re™. 
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Multiplying both sides by e”, we get the algebraic equation 
2B—4A—8Ba = 42. 
Equating like power terms, we obtain the system of algebraic equations 
2B-—4A=0, -8Bar = 4a. 


Its solution is B = —1/2 and A = —1/4. Hence, 


vp(#) =—F(1+2x)e-*. 


Example 4.7.11: (Modification rule) Find the general solution of the differential equation 
y” — Ay! + 4y = 2€?". 


Solution. The corresponding characteristic equation (A — 2)? = 0 has the double root \ = 2 that coincides with the 
control number o = 2 of the forcing function 2 e?*. Hence, the general solution of the corresponding homogeneous 
equation is y,(x) = (Cy + Cox) e?”, with arbitrary constants C; and C2. As a particular solution, we choose the 
function 

itn) = Age. 


The power 2 in the multiplier x? appears because of the repeated root \ = 2 of the characteristic equation. Substi- 
tuting this into the nonhomogeneous equation, we obtain 


o. _ Ay’, + 4y, = 2A ee, 


where 
Yp(e) = 2Age?” + 2Aa? e?” = 2Axe* + yp(x), 
yp(v) = 2A? + 4Are® + dy! (x) = 2Ac™ + 8Are*® + Ayp(z). 


The expression y/’ — 4y/, + 4y, should be equal to 2 e?”. Therefore, A = 1 and the general solution is the sum 
P P P 


y(x) = Yp(x) + yn (x) = 2? e?* + (Cy + Cox) €?*. 
We can obtain the same result from Eq. (4.7.15), page 230. In our case, the characteristic polynomial L(\) = 
d? — 4. + 4 = (A— 1)? has double root A = 2. Therefore, r = 2 and L‘")(\) = L’(X) = 2. From Eq. (4.7.15), it 
follows 7 
20° on 2 920 


Y(t) = Tippy ° =e 


Example 4.7.12: (Modification rule and superposition rule) Find a particular solution of 


L[Dly = y” — 2y' + 10y = 9e* + 26 sin 2x + 6e® cos 3a. 


Solution. The right-hand side function is the sum of three functions having the control numbers 1, +2j, and 1+ 3j, 
respectively. Thus, we split up the equation as follows: 


y” — 2y' + 10y = 9e", or L[D]y = 9e”, (a) 
y” — 2y’ + 10y = 26sin 22, or L[D]y = 26 sin 22, (b) 
y” — 2y' + 10y = Ge” cos 32, or L[D|y = 6e* cos 32. (c) 


The nonhomogeneous terms of the first two equations (a) and (b) are not solutions of the corresponding homogeneous 
equation y” — 2y’ + 10y = 0 because their control numbers o = 1 and o = 2j do not match the roots \1,2 = 1+ 3j 
of the characteristic equation \? — 2\ +10 = 0, but the latter one does. 
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We seek a particular solution of the equation (a) in the form 
yi(a) = Ae”, 
where the coefficient A is yet to be determined. To find A we calculate 
vile) =Ae*=yi(x), Yi (@) = (2) 
and substitute for yi, y,, and y// into the equation (a). The result is 
yi — 2y, + 10y, = 9e”, or 9Ae” = Ye”. 


Hence, A = 1 and y;() = e® is a particular solution of the equation (a). The value of the constant A can also be 
obtained from the formula (4.7.15) on page 230. 


Table 229 suggests the choice for a particular solution of the equation (b) to be 
yo(x) = Asin 2x + Bcos2z, 
where the coefficients A and B are to be determined. Then 
y5(x) = 2Acos2z—2Bsin2zx, ys(x) = —4Asin2x — 4B cos 2x = —4y2(z). 
Upon substitution these expressions for y2, ys, and yy into the equation (b) and collecting terms, we have 


L[D]y = —2(2A cos 2a — 2B sin 2x) + 6(A sin 2% + B cos 2x) 
= (6B — 4A) cos2x + (644 4B) sin 2a = 26sin 2a. 


The last equation is satisfied if we match the coefficients of sin 2x and cos 2x on each side of the equation 
6B—4A=0, 6A+4B=26. 
Hence, B = 2 and A = 3, so a particular solution of the equation (b) becomes 
yo(a) = 3sin 2x + 2 cos 22. 
To find a particular solution y3 of the equation (c) we assume that 
y3(x) = «(Acos3a + Bsin3z) e”, 
where coefficients A and B are to be determined numerically. We calculate the derivatives y, and yJ to obtain 
y3(x) = (Acos3z + Bsin 3x) e* + 32(—Asin3x + Bcos3z) e* + y3; 
y3(z) = 6(—Asin3xz + Bcos 3a) e* + (Acos3z + Bsin3z) e* — 
—9x(A cos 3x + Bsin 3x) e* + 3z(—Asin 3x + Bcos3z) e® + y3(z) 


= 6(—Asin3z + Bcos3z) e* + 2(Acos3z + Bsin3z) e” 
—9x(Acos3z + Bsin3x) e” + 62(—Asin 3a + Bcos 3x) e* + y3(z). 


Substituting these expressions into the equation (c) yields 


L[Djyz = —8y3 + 2(—3Asin3z + 3B cos 3x) e* + 2(Acos3z + Bsin3z) e® 
+2x(—3A sin 3x + 3B cos 3x) e” — 2(Acos3a + Bsin 32) e” 
—2x(—3A sin 3x + 3B cos 3x) e” — 2y3 + 10y3 = Ge” cos 32. 


Collecting similar terms, we obtain 


6(—3A sin 32 + 3B cos 3x) e” = 6e” cos 3a. 
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Equating coefficients of sin3a and cos3z yields A = 0 and B = 1. Hence, 
y3(“) = ve” sin3z. 


Since all these calculations are tedious, we may want to use Eq. (4.7.15), page 230, in order to determine a particular 
solution. Since the right-hand side of Eq. (c) is the real part of 6e“+%))”, we obtain a particular solution to be the 
real part of 
Or aitaie 
L'(1 + 3j) ; 


where L(\) = A? — 2\4+ 1 and L’(A) = 2(A— 1). Therefore, 


6x 


F 6x F 
= kK —————_—— (1+3j)e _ R L _3ju _ © sin 30. 
Up 1+ 3j-h° 6° e xe” sin 3x 


With this in hand, we are in a position to write down a particular solution y, of the original nonhomogeneous 
equation as the sum of particular solutions of auxiliary equations (a), (b), and (c): 


Yp(x) = yr (x) + yo(x) + y3(v) = e*® + 3sin 2x + 2cos 2x + re” sin3z. 


Example 4.7.13: (Modification rule and superposition rule) Solve the initial value problem 
LD] y = y" —2y’+y=6re™ +27, y(0) =0, y'(0) =1. 


Solution. The general solution y, of the corresponding homogeneous equation y’’ — 2y’ + y = 0 (also called the 
complementary function) is of the form 
yn(x) = (Ci + Cox) e” 


because the characteristic equation \? — 2A + 1 = (A— 1)? = 0 has the double root \ = 1. To find a particular 
solution of the nonhomogeneous equation, we split this equation into two 


y” —2y' + y = 6a e* or L[D] y = 6ze*, (d) 
where the forcing term, 62 e”, has the control number o = 1, and 
yy! +y=x* or LPly=2", (c) 


where «? has the control number o = 0. We determine a particular solution y; (a) of the equation (d) according to 
Table 229 as 
yi(z) = 2?(Ax + B)e* 


since \ = 1 is the double root of the characteristic equation (\ — 1)? = 0. That is why we multiplied the expression 
(Ax + B) e® by x”, where the power 2 of x indicates the multiplicity of the root 1 = 1. We calculate its derivatives 


yi(z) = (3Ax? + 2Br) e” + (Ax? + Bx?) e*, 
yi(z) = (6Ar+2B+ Ac? + Ba? + 6Ax? + 4Bz) e*. 
Substituting these expressions into the equation (d) yields 
(6Ax + 2B) ec” = Gare”. 
Hence, A = 1, B = 0, and a particular solution of the equation (d) becomes y; (x) = x e”. 


Let us consider the equation (e). The right-hand side function in this equation is not a solution of the corre- 
sponding homogeneous equation. Therefore, Table 229 suggests a particular solution yg in the same form: 


yo(z) = Ax? + Br +C. 


Its derivatives are 
yo(x) =2Ar+B, yy(x) = 2A. 
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Upon substitution of these expressions into the equation (e), we have 
2A —2(2Ae+ B)+Az?+Be+C=2? or Az? +(B-—4A)zr+2A—-2B+C =2". 
Equating the power like terms of x, we get the system of algebraic equations for A, B, and C: 
A=1, B-4A=0, 2A-2B+C=0. 


Thus, A= 1, B = 4, C = 6 and a particular solution of the equation (e) becomes y2(x) = x? +42 +6. Consequently, 
the general solution of the driven equation is the sum of particular solutions y; and y2 plus the general solution of 
homogeneous equation yp,(x), that is, 


y(x) = yn tyr t yg = 22 +40 +64 a2 e* + (Cy + Cox) e*. 


Substituting this expression into the initial condition, we obtain the relations allowing us to determine the unknown 
constants C; and C2: 
y(0) =6+C, =0, y'(0)=4+C,+C, =1. 
Hence, C, = —6, Cp = 3 and 
y(z) =a? + 4 + 6+ (-6+32+ 2°) e* 

is the solution of the given initial value problem. 

In all problems, D stands for the derivative operator, while D°, the identity operator, is omitted. The 
derivatives with respect to t are denoted by dots. 


1. Find the control number of the following functions. 


(a): (b) 2° +22; (ce) e; 
(d) x e*; (e) es (f) cos 22; 
(g) x’ sin3x;  (h) e?” sina; (i) (a + 1)? e® cos 3a; 
(j) a em. (k) sin2x+cos2x; (Il) e ” (cos3a@ — 2@ sin 3a). 
2. Obtain a linear differential equation with real, constant coefficients in the factorial form an(D—A1)™!(D— Az)? +--+ for 
which the given function is its solution. 
(a) y= 2e7+3e°7"; (b) y=5+4e-**; (c) y = 22+"; 
(d) yor +e”; (e) y=a?—1+4+cos3a; (f) y =2e * sin 2a; 
(g) y = (« +1) cos3a; (h) y = xe” cos 32; (i) y =x e”” + cosa; 
(j) y=a? +e sin2x;  (k) y = 3x e?* cos 5a; (1) y = 2? +e~* cos 2x. 


3. State the roots and their multiplicities of the characteristic equation for a homogeneous linear differential equation with 
real, constant coefficients and having the given function as a particular solution. 


(a) y= 2zxe?; (b) y= xe 4 Qa: (c) y= e 2” sin 3a; 
(dq) y=e™ sin2z; (e) y=4r+are""; (f) y=2+4cos 22; 
(g) y= 223 — e— 3; (h) y=1+ 2a? + e~?* sina; (i) y=cos3z; 

(ij) y=2cos3a — 3 sin3z; (k) y=2 cos3a — 8 sin 22; (l) y=e ” cos2z; 
(m) y=e"(¢+sin2x) +322; (n) y=cos*a; (0) y=ae?*; 

(p) y=a'e +3e"; (q) y=sin® a; (r) y=o? te. 

4. Find a homogeneous linear equation with constant coefficients that has a given particular solution. 

(a) Yp(x) = cos 2a; (b) yp(z)=e7+2e7";  (c)  yp(x) =e” sin 2a; 
(d) p(x) = 3x7; (e) yp(x)=sinz+e*; (f) yp(x)=-2 cosa; 
(g) op(@) =ae™*; (h)  yp(x) = sinha; (i) yp(x) = x cosha; 

Gi) yp(z)=ae* sin2a; (kk) yp(2) = 27e-7*; (1) yp(x) = (sinh x)?. 

5. Write out the assumed form of a particular solution, but do not carry out the calculations of the undetermined coeffi- 
cients. 

(a) y"+4y=a; (b)  y” +4y = (a +1) cos 2z; 

(c) y”+4y=cosz; (d) y” +4y = (a +1) sin 2x + cos 22; 

(e) y’+4y=asing; (f) yy” +2y' — 3y = cos 2x +sinz; 

(g) y” +4y =e; (h) yy” +2y' — 3y = 20e* cos 22; 

(i) y” +4y = (4-1); (j)  y” +2y' — 3y =a sin + cos 22; 

(k) y”+2y'-3y=(@4+1)?;) () y”+2y’ -—3y=e-* sing; 
(m) y” +2y’—3y =e"; (n) y” +2y' — 3y =e” +e-** cosa; 

(0) y+ 2y' — 3y = we"; (p)  y” — 2y' +5y = re” sin 2z; 

(q) y+ 2y! — 3y = 27 e"; (vr) y” — 2y’ + 5y = e* + cos 22; 

(s) y"—5y'+b6y= oe: (t) yy” — 2y' + 5y = xe® cos 2a 4 e® sin 22; 


(uu) y” — 2y’ + 5y = cos 22; (v) 2y” + 7y' —4y = en 42. 
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6. Find the general solution of the following equations when the control number of the right-hand side function does not 
match a root of the characteristic equation. 


(a) y”ty=2°; (b)  y +y! —6y = 27 — 22; 
Cc + — = 6e°": _ aa = Ae": 
(c) y+ 2y’ — 8y = 6e”*; (d) y"” —6y' + 9y = 4e”; 
Z S a ers “ + = 9sinhz; 
(ce)! — dy = de” (f) y+ 4y’ + 4y = 9sinh 
= 12e" + 6x" 4 x; 4 ae = 87°: 
(g) y+ 5y' + 6y = 12e" + 6x7 +102; (bh) y” + 3y' + 2y = 82°; 
(i) y” — 2y' + 2y = 2e* cos x, (j)  4y" —4y! + y = 827e"/?; 
(k)  y” — 4y’ + By = 242 e~** — 2c”; (1) (D?4+D+4 Dy = 32’; 
(m) + 5D)y = 17 + 10s; (n) (0? +D— 6)y = 50sina; 
Oo D* —D— = 40+ e.77: D2 —3D— = mer: 
(aq) (D' — ly = 4sina; (r) (0? -4D+4)y =e"; 
(s) (D8 +D?—4D—4)y=12e"—4e; (tt) (0? +D + 13/4)y = —7sin (V32). 


7. Find the general solution of the following equations when the control number of the right-hand side function matches 


a root of the characteristic equation. 


(a) 
(d) 


(D? + D—6) y=5e™, 
(D? — 3D — 4)y = 502 e*™, 


(b) 
(e) 


(D? —-D—2)y=18e™", 
(D? —-4D + 4)y = 2e?”, 


(D? — 10D + 26)y = e** sing, 
(D? — 6D + 13)y = 2e*” cos 2a. 


(c) 
(f) 


8. Solve the following initial value problems. 


(a) y” +4y’ +5y =8 sing, 
(b) y” —y=4ze’, 
(c) o! ty = 200", 


y(0) = 0, y'(0) =1; 
y(0) = 8, y/(0) =1; 
y(0) = 0, y'(0) = 2; 


(d) y” —y' — 2y = (16x — 40)e-** + 2a +1, (0) = —3, y/(0) = 20; 
(e) y" — Ty! — 8y = ~(140? — 182 + 2)e*, (0) = 0, (0) = 9: 

(f) y" + Qy’ + 5y — 4e~*, y(0) = 0, y' (0) = 2: 

(g) y” + 2y' — 3y = 25ae**, (0) = —6, y/(0) = -3; 


(h) y” +6y' + 10y = 4re*” sina, 
(i) y” + 8y' + 7y = 12sinha +7, 


(j) y” _ 3y" — 5y = 39 sin(2x) + xce**, 


(k) y” — 2y' + 10y = 6e” cos 3a, 
(1) y+ 2y' +y=2e™, 


y(0) =0, y'(0) = 1; 
y(0) = 3/8, y’(0) = 3/8; 
y(0) = -3, y/(0) = —27; 


y(0) = 1, y/(0) =1; 
y(0) =0, y/(0) =1. 


9. In the following exercises, solve the initial value problems where the characteristic equation is of degree 3 and higher. 
At least one of its roots is an integer and can be found by inspection. 


(a) y" +y' =2r+2cosx, y(0)=1, y’(0) =0, y”(0) = 2; 

(b) y’” + 3y” —4y = 18re", y(0) =0, y’(0) = —10/3, y”(0) = —5/3; 
(c) oy” —3y" + 3y'-y=27, y(0)=—3, y/(0) = -3, y”(0) =0; 

(d) y+y" —2y=sinx —3cosz, y(0) =3, y/(0) =0, y’(0) = 0; 
(e) yy” +y"—y'-y=—4e, (0) = 0, y’(0) = 3, y"(0) = 0; 

(f) y/” — 2y” —4y’+ 8y=8e7", —-y(0) = 0, y'(0) =5, y"(0) =6; 
(g) y!” —y" —4y' + 4y=3e", y(0)=1, y'(0) = —4, y"(0) = -1; 
(h) y”—2y"+y' = 2x, y(0) =0, y'(0) =4, y"(0) = 3; 

(i) yO —y” = 1227, y(0) =1, y/(0) =1, y"(0) = 24, y'"(0) = 2; 
(j) (D? — 3D? +. 4)y = 16 cos24+8sin2x, y(0)=1, y’(0) = —2, y”(0) = —3; 
(k) (D?-—3D—2)y=100 sin2x, y(0) =7, y’(0) =2, y”(0) = —27; 
(1) (D? +4D? +9D+10)y = 24e”, y(0) =4, y/(0) = —3, y’(0) = 39; 
(m) (D?+D—10)y=13e?”, y(0) = 2, y’(0) = 0, y”(0) =9; 


(n) y!” — 2y” —y' + 2y =e", 


y(0) = 2, y’(0) =4, y”(0) = 
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4.8 Variation of Parameters 


The method of undetermined coefficients is simple and has important applications, but it is applicable only to 
constant coefficient equations with a special forcing function. In this section, we discuss the general method (which 
is actually a generalization of the Bernoulli method presented in §2.6.1) credited?” to Lagrange, known as the 
method of variation of parameters. If a fundamental set of solutions for a corresponding homogeneous equation 
is known, this method can be applied to find a solution of a nonhomogeneous linear differential equation with variable 
coefficients of any order. Let us start, for simplicity, with the second order differential equation 


y” + p(x)y’ + q(x)y = f(a) (4.8.1) 


with given continuous coefficients p(x), q(x), and a piecewise continuous (integrable) function f(a) on some open 
interval (a,b). The method is easily extended to equations of order higher than two, but no essentially new ideas 
appear. 

In the language of linear operators, the problem of finding a solution of Eq. (4.8.1) is equivalent to finding a 
right inverse operator for L = D? + p(x)D + q(x), where D = d/dz is the operator of differentiation. In other words, 
y(x) = L~*[f](x) and the existence of the inverse operator L~' is guaranteed by Theorem 4.3 on page 189. The 
only problem is how to go about finding such a solution. 

The continuity of p(x) and q(x) on the interval (a,b) implies that the associated homogeneous equation, y” + 
py’ + qy = 0, has the general solution 

yn(v) = Crys (x) + Crya(z), (4.8.2) 


where C and C2 are arbitrary constants (or parameters) and {yi(x), yo(x)} is, of course, a known fundamental set 
of solutions that was found by some method or other. We call y;,(a), a general solution (4.8.2) of the associated 
homogeneous differential equation to Eq. (4.8.1), a complementary function of Eq. (4.8.1). The method of varia- 
tion of parameters involves “varying” the parameters C, and C2, replacing them by functions A(«) and B(x) to be 
determined so that the resulting function 


Yp(x) = A(x)yi(x) + B(x)ya(zx) (4.8.3) 


is a particular solution of Eq. (4.8.1) on the interval (a,b). By differentiating Eq. (4.8.3), we obtain 


Yp(x) = A'(x)yr (a) + A(w)yi (x) + B(x) y2(x) + B(x) yo (a). 


The expression (4.8.3) contains two unknown functions A(x) and B(x); however, the requirement that y, satisfies 
the nonhomogeneous equation imposes only one condition on these functions. Therefore, we expect that there are 
many possible choices of A and B that will meet our needs. Hence, we are free to impose one more condition upon 
A and B. With this in mind, let us enforce the second condition by demanding that 


A'(x)yi (x) + B’(x)yo(x) = 0. (4.8.4) 


Thus, 
Yp(x) = A(a)y; (x) + B(x)yd(a), 


so the derivative, y/,, is obtained from Eq. (4.8.3) by differentiating only y; and y2 but not A(x) and B(x). Next 
derivative is 


Vp (2) = A(a)yy (@) + A'(x)y, (x) + B(x) yo (@) + B(w)yo(2). 


Finally, we substitute these expressions for yp, y,,, and yj), into Eq. (4.8.1). After rearranging the terms in the resulting 
equation, we find that 


A(x) [yi (x) + p(a)yi (x) + a(@)y1(@)] + Bla) yz (x) + p(x) yo (x) + a(a)yo(a)] + A'(x)y\ (x) + BY(x)yo(x) = f(a). 
Since y; and yo are solutions of the homogeneous equation, namely, 


yj (x) + p(x)y; (x) + a(a)ys(x) = 0, 7 = 1,2, 


#2 Joseph-Louis Lagrange (1736-1813), born in Turin as Giuseppe Lodovico Lagrangia, was a famous mathematician and astronomer, 
who lived for 21 years (1766-1787) in Berlin (Prussia) and then in France (1787-1813). 
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each of the expressions in brackets in the left-hand side is zero. Therefore, we have 
A'(x)y (x) + B'(x)yo(x) = f (2). (4.8.5) 


Equations (4.8.4) and (4.8.5) form a system of two linear algebraic equations for the unknown derivatives A’(a) and 
B'(x). The solution is obtained by Cramer’s rule or Gaussian elimination. Thus, multiplying Eq. (4.8.4) by yj (a) 
and Eq. (4.8.5) by —y2(x) and adding, we get 


A'(x) (yy — yoy) = —y2(x) f (2). 


The expression in parenthesis in the left-hand side is exactly the Wronskian of a fundamental set of solutions, that 
is, W(y1, y232) = yiys — yoy;. Therefore, A’(a)W(x) = —yo(x) f(x). We then multiply Eq. (4.8.4) by —y{ (2) 
and Eq. (4.8.5) by y:(z) and add to obtain B’(#)W(a«) = yi (x) f(x). The functions y; (7) and y2(a) are linearly 
independent solutions of the homogeneous equation y” + py’ + gy = 0; therefore, its Wronskian is not zero on the 
interval (a,b), where functions p(a) and q(x) are continuous. Now, division by W(x) 4 0 gives 


yyy — yoy,’ yyy — yout 
By integration, we have 
yo(x) f(x) yi (x) f(x) 
A(z) =— | 2X" az t+, Biz) = | SS“ dar + &, 4.8.6 
() W (yi, 2; 2) * () W (yi, y2; £) : en) 


where OC, C2 are constants of integration. Substituting these integrals into Eq. (4.8.3), we obtain the general solution 
of the given inhomogeneous equation: 


x 


y(x) = —yi(2) / aa da + yo(x) / ae da + Cyyi(x) + Coyo(x). 


Since the linear combination y;,(x) = Cyy1(%) + Coy2(x) is a solution of the corresponding homogeneous equation, 
we can disregard it because our goal is to find a particular solution. It follows from Theorem 4.19 (page 224) that 
the difference between a particular solution of the nonhomogeneous equation and a solution of the homogeneous 
equation is a solution of the driven equation. Thus, a particular solution of the nonhomogeneous equation has the 
form 


la) = f G(e,8) f6)aé, (4.8.7) 
xO 
where function G(z, €), called the Green function of the linear operator*? L = D? + p(x)D + q(z), is 


One Aen) 
Ceo) = Ono mone 


The Green function depends only on the solutions y; and ye of the corresponding homogeneous equation L[y] = 0 
and is independent of the forcing term. Therefore, G(x, €) is completely determined by the linear differential operator 
L (not necessarily with constant coefficients). 


(4.8.8) 


Example 4.8.1: We consider the equation 
(D?-3D+2)y=e", D=d/dz. 


The complementary function (the general solution of the associated homogeneous equation) is y;,(a) = Cie” + Co2e?*, 
so we put 
Yp(z) = A(x) e* + B(x) ee. 


where A(x) and B(x) are unknown functions of «. Since y/, = A(x) e* + A(x) e* + 2B(a) e?” + B’(a) e?*, we impose 
the condition 
A' (x) e* + B' (x) e®* =0. 


43 George Green (1793-1841) was a British mathematical physicist whom we remember for Green’s theorem from calculus. 
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Substituting y, into the differential equation, we obtain the second condition for derivatives A’(x) and B’(z): 
Al (x) e® + 2B! (a) e?* = e7*. 
From this system of algebraic equations with respect to A’(xz) and B’(x), we have 
A'(2)=-e**, Bi(z)=e°™. 
Integration yields 


iL 1 
A(x) = a i +Ci, B(«)= =a e °* + Co, 


where C, and C2 are arbitrary constants. Therefore, the general solution becomes 


1 1 1 ? 
y(“) = . e 7 — 3 é *4+Cie” +Coe™, or y(a) = 6 e7* + Cye® + Cye?”. 


The Green function for the operator D? — 3D + 2 is G(a, €) = e?(*-®) — e®-§, 


Example 4.8.2: Solve the equation 
(D? + 1)y = escx = 1/sing, D=d/dz. 
Solution. The general solution of the corresponding homogeneous equation is 
yn(x) = Ci sina + Co cosa, 
that is, y,(a) is the complementary function for the operator D? + 1. Let 
Yp(x) = A(x) sinz + B(x) cosa 


be a solution of the given inhomogeneous equation. Then two algebraic equations (4.8.4) and (4.8.5) for derivatives 
A’ and B’ become: 


A’(x) sinaz + B(x) cosa = 0, A'(x) cosx — B’ (x) sinx = csca. 
This system may be resolved for A’(x) and B’(«), yielding 
A'(x) =cosxcscx =cotax, B'(x) =—sinxcscx = —1. 


Integrating, we obtain 


A(a) = [cota +C, =In|sinz|+Ci, B(x) =-«#+C. 


A particular solution of the nonhomogeneous equation becomes 


Yp(x) = sina - In|sinz| — x cosa. 


If we want to find the solution of the initial value problem for the inhomogeneous equation (4.8.1), it is sometimes 
more convenient to split the problem into two problems. It is known from Theorem 4.20 on page 224 that the general 
solution of a linear inhomogeneous differential equation is a sum of a complementary function, y,, and a particular 
solution of the nonhomogeneous equation y,. Thus, a particular solution can be found by solving the initial value 
problem 


Yp + p(x)y, + 4(2)¥p = f(x), Yp(to) = 0, y,(20) = 0, (4.8.9) 


and the complementary function is the solution of the initial value problem 


yn + P(x)yn + 4(z)yn =9, yn(20) = yo, y,(Z0) = Y0- (4.8.10) 
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Example 4.8.3: Solve the initial value problem 
y! —2y’ +y =e /(e? +1), y(0) =3, y/(0) = 1. 
Solution. We split this initial value problem into the two IVPs similar to (4.8.9) and (4.8.10): 
Up — Yip + Up =e" /(e7 +1)*,  yp(0) =0, y,(0) = 0 
and 


Yn — 24, tyr =0, yn(0) = 3, y,(0) = 1. 


Then the solution to the given initial value problem is the sum: y = yp + yn. We start with the homogeneous 
equation y” — 2y’ + y = 0. The corresponding characteristic equation \* — 2A + 1 = 0 has a double root = 1. 
Therefore, the complementary function is 

Yn(£) = (Cy + Cox)e”. 


Substituting this solution into the initial conditions y(0) = 3 and y’(0) = 1, we get 


The next step is to integrate the nonhomogeneous equation using the variation of parameters method. Let y1(x) = e” 


and y2(x) = xe* be linearly independent solutions, then from Eq. (4.8.8) we find the Green function: 


- = yi(€)y2(x) — y2(€)yi (x) fe eee 
Ce.) = OuOonOn® ~F-98 


According to the formula (4.8.7), a particular solution of the nonhomogeneous equation with homogeneous initial 
conditions becomes 


x ez x (x = E)es 
To evaluate the integral, we integrate by parts (— f vdu = —vu+ f{ udv) using substitutions: 
v=a—€, dvu=—dé u=(1+e)~ eS ae dé. 
3 ’ cs (e& + 1)2 


The result is = ag 
oer x = zx a, 
Yp = re e€ [ ia 


In the last integral, we change the independent variable by setting t = e§/?. Then 


x/2 x/2 x/2 

© dé dt ee de oT aah 
——_ = 2 ao = 2 —-2 5 
9 l+eé& 7 t(1 + t?) 7 t 1 +t 


1 
= In? —In(14 2)|—, =n a —In Z =a—-—In(1+e")+In2. 
i 1+e* 2 


Hence, 


Yp(x) = —e* [In (1 + e®) —In(2)]. 


Now we extend the method of variation of parameters to linear equations of arbitrary order. Let us consider an 
equation in normal form: 


y”) + an_1(x)y?—) +--++ao(x)y = f(x) or L|x,Dly = f, (4.8.11) 


defined on some interval J and assume that the complementary function yp, = Cyy1(x) + Coyo(x) +--+ + Cryn(x) is 
known, where Ci are arbitrary constants and yz, k = 1,2,...,n, are linearly independent solutions of the homoge- 
neous equation L[x,D]y = 0. Following Lagrange, we seek a particular solution of Eq. (4.8.11) in the form 


Yp(x) = Ar(x)yi (x) + Aa(x)yo(x) +--+ + An(2)yn(z) , (4.8.12) 
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where we impose the following n conditions on the unknown functions A(z), k = 1,2,...n: 


Ai (2)y1 + Ag(2)y2 +--+ Aj, (x)yn = 0, 
Aj (x)yy + Aa(z)yg +++ AL (x)¥n = 9, 


(4.8.13) 
Ay (a)yt"? + Ag(e)yg"? +++ An (ay = 0, 
Ay (e)yt"? + Ag(a)yg"? +++ An (af? = Fl) 
for all x € I. This is a system of n algebraic linear equations in the unknown derivatives A‘ (x), ..., A/,(x), whose 
determinant is the Wronskian W[yi (x), y2(x),.-.,Yn(x)]. Therefore this system has a unique solution and after 


integration, we obtain the functions A,() that can be substituted into Eq. (4.8.12) to get a particular solution, 
which in turn can be written in the integral form as 


Yp(x) = i G(x, €) f(g) dé. (4.8.14) 


The function G(a, €) is called the Green function to the operator L[z,D] =D" + an_i(z)D"~1 +--+ + ao(x) and 
it is uniquely defined by the ratio of two determinants: 


yi(€) yo(€) - Yn (€) 
yi (€) yo(e) ss) 
Gla,é) = - [Winnie nl) « (4.8.15) 
ye) ye) ye” (€) 
yi (2) yo(x) Yn(Z) 


The integral expression in the right-hand side of Eq. (4.8.14) defines a right inverse for the operator L. Actually this 
inverse operator is uniquely defined by the following conditions, which we summarize in the following statement. 


Theorem 4.23: Let G(x, €) be defined throughout the rectangular region R of the x€-plane and suppose that 
G(a,€) and its partial derivatives O*G(z,€)/Ox*, k = 0,1,2,...,n, are continuous everywhere in R. Then 
G(x, €) is the Green function to the linear differential operator L[x,D] = an(x)D” + an_i1(x)D"~! +--+ + ao(z), 
D = d/dz, if and only if it is a solution of the differential equation L |x, D] G(x, €) = 0 subject to the following 
conditions 

dG(x, §) 


Oe yee samoren| ena 


is left as Problem 10, page 245. | 


Our next example demonstrates the direct extension of the variation of parameters method for solving driven 
differential equations of the order greater than two. All required steps are presented in the Summary, page 255. 


De ral| SG) a 


Oe a OE eae Beara) 


Example 4.8.4: Let us consider a constant coefficient linear nonhomogeneous equation of the fourth order 
y + Qy" + y = 2 esct. 


The characteristic equation (A? + 1)? = 0 of the associated homogeneous equation has two double roots: \; = j and 
A2 = —j. So, the fundamental set of solutions consists of four functions 


yi =cost, y2=sint, y3=tcost, ya =tsint. 


The main idea of the method of variation of parameters is based on the assumption that a particular solution is of 
the following form: 


y = Ai (t) yr + Aa(t) yo + As(t) ys + Aa(t) ya, 
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To determine four functions, Ai(t), Ao(t), A3(t), and A4(t), we impose four conditions: 


( 
a(t)ys + Ay(t)ys = 0, 
( 


Al (t)y1 + Ab(t) yo + A 
Aj (t) yy + Ag(t) yo + Ag(t) ys + Ault) ys = 0, 
Ai (t) yt + Ag(t) yg + Ag(t) ys + Alyy = 0, 
Ai (t) yt" + Ag(t) yo’ + Ag(t) y3’ + Ab(t) yf’ = 2 eset. 


Since finding a solution of a fourth order algebraic system of equations is quite time consuming, we ask Maple to 


help us with the following commands: 
eqi:= al*cos(t) + a2*sin(t) +a3*t*xcos(t) + a4*t*sin(t); 


eq2:= a2*cos(t) - al*sin(t) +a3*(cos(t)-t*sin(t)) + a4*(sin(t)+t*cos(t)); 
eq3:= al*cos(t) + a2*sin(t) + a3*(2*sin(t)+t*cos(t)) +a4*(t*sin(t)-2*cos(t)); 
eq4:= alxsin(t) -a2*cos(t) +a3*(t*sin(t)-3*cos(t)) - a4*(3*sin(t)+t*cos(t)); 
solve({eql=0, eq2=0, eq3=0, eq4=f}, {al,a2,a3,a4}); 


This results in 
Aj(t)=tecott—1, A$(t)=cott—t, A3(t)=—cott, A(t) =—-1. 


Integration yields 
A; =t In|sin¢| — jf s|sinelat, Ag = In|sint|+2?/2, As; = —In|sint|, Ag = -t. 
Substituting the values of functions A;(t), A2(t), A3(t), and A4(t), we obtain a particular solution to be 
y =sint (In| sin¢| — ¢?/2) — cost f In | sin ¢| dt. 
The Green function for the given fourth order differential operator is 
Glew 6) = 5 sine - 9) — 5 (@- 8) cos(e - 


because the function G(x) is the solution of the following IVP: GY +2G”+G=0 G(0) = G’(0) = G"(0) = 0, 
Goya. 


Problems| In all problems, D stands for the derivative operator, while D°, the identity operator, is omitted. The 


derivatives with respect to t are denoted by dots. 


1. Use the variation of parameters method to find a particular solution of the given differential equations. 


(a) y" +y=cota; (b) y’ +y=3csca; 

(c) y+ 4y = 4sec? 2a; (d) 9y” +y = 3csc(x/3); 

e "4 dy =8 sin? x; f "4 9y = 9 sec 3a; 
¥ Y ¥ y 

(g) yy’ +4y’+5y=e°7* tang; (h) y”-y= 2/(e* — 1); 

(i) y” —9y = a Se: G) y’-y= =. 


2. Given a complementary function, y,, find a particular solution using the Lagrange method. 
(a) xy" +ay’-y=2*, yr(x) =Cix4Cox™}; 
(b) 2? y” —4ay’ + 6y = 22°,  yn(x) = Cia? + Coa; 
(c) a? y” —3ay’ +4y =4a4,  yn(a) = Cia? 4+ Cox? Ing; 
(d) 2? y"” +ay’+y=Ina, yn(x) = Ci cos(Inx) + Co sin(Inx); 
(e) 2? y"” +3ay' —3y=2¢, yn(x) = Cia+Cox™*; 
(f) vy" +3ry +y=927, yr(a2) = Cra t+ Coa Ine 


3. Use the variation of parameters method to find a particular solution to the given inhomogeneous differential equations 
with constant coefficients. Note that the characteristic equation to the associated homogeneous equation has double 
roots. 

—22 —3a 
(a) y” +4y'+4y = 2S} (b) yy” +6y'+9y= f+ 27x? + 180; 
(c) y” —4y' +4y =2e?” Inxz+25 sina; (d) y” —2y’+y=4e" Ina; 
(ec) y” —6y' + 9y =a 7 e**; (f) y+ 6y' + 9y = = 
” 2 y =e 7/(1 2\. h i Die. eb 1 2 
(g) yl +2y+yse"/(1+2°*); (h) (D—1)°y=e*/(1—2)’. 


4.8. 


13. 


14. 


15. 


16. 


17. 


18. 
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. Use the Lagrange method to find a particular solution to the given inhomogeneous differential equations with constant 


coefficients. Note that the characteristic equation to the associated homogeneous equation has complex conjugate roots. 
(a) 4y” —8y'+5y=e" tan? 2; (b) y”+y=2sec?a2; (c) y”+y=6 sec*a; 
(d) 9y” — 6y’ + 10y = 13e”; (e) yy" +y' =cscea; (f) y+ 4y’ =8 tan2z. 


. Use the method of variation of parameters to determine the general solution of the given differential equation of the 


order higher than two. 
(a) y”+4y'=tan2t; (b) y” —y" +y'-y=csct; 
(c) y%—y=sect;  (d) y” —4y” +y/ + by = 12/(te’). 


. Find the solution to 


1, ift>0, 


” 7 _ _ 
y +y=H(t)H(r-1t), where HwO={ 0, ift<o, 


subject to the initial conditions y(0) = y’(0) = 0 and that y and y’ are continuous at t = 7. 


. Compute the Green function G(x,&) for the given differential operator by using (4.8.15). 


(a) D?(D— 2); (b) D(D?—9);  (c) D®—6D?+11D—6; 
(d) 4D°—p?+4D-1; (e) D?(D?4+1); (f) Dt-1. 


. Find a particular solution y” + 4y = 2 sin? # in two ways. 


(a) Use variation of parameters. 


(b) Use the identity 2 sin? 2 = 1 — cos 2x and the method of undetermined coefficients. 


. Prove that the function (4.8.14) satisfies the following initial conditions yp(x%o) = y, (to) =--- = ys" (x0) = 0. 
. Prove Theorem 4.23 on page 243. 

11. 
12. 


Using the variation of parameters method, solve problems 6 and 7 on page 238. 


Solve the initial value problem 


Lily] 2 vy" — Qe+1)y'+(@t+ly=20'e* nz (@>0), y(1)=2, y/(1) =4, 


given that yi(z) =e” is a solution of the homogeneous equation Ly] = 0. 
Solve the initial value problem 


def 


Lily] © x (1+ 30”) y” +2y! —6ry =(14+327)” («@>0), y(1)=2, y'(l) =4, 
given that yi(#) = 2~' is a solution of the homogeneous equation L[y] = 0. 


Solve the initial value problem 


def "W 


Lly] = xy 


2(a@+1)y'+(a#+2)y=30%e" Inx (x >0), y(1) =3e?, y’(1) =6e’, 


given that yi(z) = e” is a solution of the homogeneous equation Ly] = 0. 


Find the continuous solution to the following differential equation with the given piecewise continuous forcing function 
subject to the homogeneous initial conditions (y(0) = y’(0) = 0). 


F _f 4, if0<t< §, 
cn a { 0, otherwise. 


Find the Green function for the given linear differential operators. 


(a) D?+4; (b) D?—D-2; (c) D?+4D+4; 
(d) 4D?—8D+5; (e) D?+3D-—4; (f) «dD? — 2D +42; 
(g) «D?—(1+227)D; (h) (1—27)D?—2xD; (i) D®*+3D?-pD-32. 


Find the general solution of the given differential equations. 
1d du 1d / du 
merce (2 eae ear Fe b eavecs pics ee 
par ¢ *) ag (~ | 
(c) y”+3y'+2y=12cosht; (d) ty” +ty’—y=1. Hint: y. =t. 
Use the Green function to determine a particular solution to the given differential equations. 


(a) y"+4y'+4y= f(x); (b) y” —4y'+5y= f(a); 
(c) y” +w?y = f(x); (d) y” —k’y = f(a). 
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4.9 Bessel Equations 


The differential equation 
gy” + ay + (2? —v*)y=0, (4.9.1) 


where v is a real nonnegative constant, is called the Bessel** equation of order v. It has singular points at « = 0 


and x = ov, but only the former point is regular. We seek its solution in the form of a generalized power series: 


Co Co 
y(a) = 2 y An 2” = y An, ET, 
n=0 n=0 


where a is a real number to be determined later. Without any loss of generality we assume that ao 4 0. Substituting 
y(a) into Eq. (4.9.1), we obtain 


lee) Co loc) co 
S- (n+a)(n+a—1)ay,2"t* + S- (n+ a)an xt? + ys 6,6? air s an at? = 0. 
n=0 n=0 n=0 n=0 


Setting the coefficients of each power of x equal to zero, we get the recurrence equation for the coefficients: 
[(n + a)(n+a—1)+(n+a) —v7] an + an—2 =0 (n > 2) 


or 
[(n +a)? —v?] an +an-2=0 (n=2,3,4,...). (4.9.2) 


We equate the coefficients of x* and x«°t1! to zero, which leads to equations 
(a? —v*)agp=0 and [(n+1)?—v*] a, =0. 


Since ap 4 0, it follows that 


e—v?=0 or a=+tv (4.9.3) 


and a; = 0. The recurrence relation (4.9.2) forces all coefficients with odd indices to be zero, namely, aox41 = 
0, k=0,1,2,.... 
We try a = +y as a solution of Eq. (4.9.3). Then the recurrence equation (4.9.2) gives 


an—2 


an = — ———~, n22 
n(n + 2v) 
and, therefore, 

_ ao = ao 
= “90420) 2+)’ 
* a ag —— ag a ao 
4 “A4+2Qv)~ 4-224 0) 22221 (1 + 0)(2 40)’ 
fe es ee 
6 6642) 23311 + V)(2Q+V)340)’ 

ee ee, ee, REI 
2k ( ) Pe (1 +v)\(2+v)---(k+v)’ y4t5 4) 


It is customary to set the reciprocal to the arbitrary constant ag to be 1/ap = 2” T(v)v, where 
co 
T(v) =| fe" di, Rv >0, (4.9.4) 
0 


is the gamma function of Euler. Since I'(v) possesses the convenient property [(v + 1) = vI'(v), we can reduce the 
denominator in ag, to one term. With such a choice of ag, we have the solution, which is usually denoted by J, (): 


el (—1)* z 2k+v 
Jy (x) = » a (4.9.5) 
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an—2 
ee 
oF n(n — 2v)’ oe 
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and gives another solution of Eq. (4.9.1) to be 


is, (—1)* (2) Vy 
. a (—v+k+1) 


This is the Bessel function of the first kind of 
order —v. 


Figure 4.5: The reciprocal [-function 
called the Bessel function of the first kind of the order v. The recurrence equation (4.9.2) with a = —v is 
reduced to 
If v is not an integer, the functions J,(#) and J_,(x) are linearly independent because they contain distinct 
powers of x. Thus, the general solution of the Bessel equation (4.9.1) is given by 
y(x) = e1 Jr (x) + 2 Jv (#) 
with arbitrary constants c, and c2 


(4.9.6) 
When v = n, an integer, the denominators of those terms in the series for J_,(a) for which -n+k+1 <0 
contain the values of the gamma function at nonpositive integers. Since the reciprocal [-function is zero at these 
points (see Fig. 4.5) all terms of the series (4.9.6) for which & < n—1 vanish and 
k 2k—n 
' (1) () 


< ki [(-n+k+1) 
We shift the dummy index k by setting 7 =k —n, then k = 7 + n and we get 


->7 =1yrts (go 


)! 
= Gj +tn)!T( 


—n+j+n+1) 
love) 2j+n 
yr — n 
ps Ce = (-1)" Jn() 
j=0 (9 
because I'(j + 1) = j! for integers 7. For example 


oo 2k 
ey) u al 

J = eee ee i eae 

o(2) » (2 Bane | Raye Wane’ 

Since the Bessel functions J,,(x) and J_»() are linearly dependent, we need to find another linearly independent 
integral of Bessel’s equation to form its general solution. This independent solution can be obtained by using the 
Bernoulli change of variables (see §4.6.1) 
with v = n leads to 


2 


(x) = u(x) Jp (a). 


or 


Substituting y(x) and its derivatives into Eq. (4.9.1) 
(u" Jn + Qu! Fl + J) + 2(ul Jn + ud.) + (a? — 0 


2Vudn =0 


(u” Jn + 2u' J!) + 2u' In = 0 
because J,(z) is an integral of Eq. (4.9.1). Letting p = u’, this equation may be rewritten as 


d 
‘D JIn + p22? J), + Jn) = 0 or Lae | 
Separating the variables, we obtain 


d Ji(@) 1 
ap —— 9 ak ) + = dz. 
p JIn(L)  & 
44Tn honor of Friedrich Wilhelm Bessel (1784-1846), a German astronomer who in 1840 predicted the existence of a planet beyond 
Uranus. Equation (4.9.1) was first studied by Daniel Bernoulli in 1732 
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After integration, we have 


Cy du Cy 
zJ2(a) Pda z[Jn(a))2 


Inp = —-2lInJ,(x) —Ineg+InC, =In 


Next integration yields 


dx 
ula) =C; ij A. @e + C2, 


where C; and C2 are arbitrary constants. Hence, the general solution of Eq. (4.9.1) becomes 


dx 


ue) = Cp Tay #0) Ie | aoe 


The product 


dz 
Inte) | PERCH 


defines another linearly independent solution. Upon multiplication by a suitable constant, this expression defines 
either Y,,(2), the Weber* function, or N,,(x), the Neumann* function. We call these two functions N,,(x) and 
Yn (x) = 7N,,(a”) the Bessel function of the second kind of order n. 


It is customary to define the Neumann function by means of a single expression 


cosy7m J,(x) — J_,(«) 


N,(z) = (4.9.7) 


sin v7 


which is valid for all values of v. If vy = n, an integer, sinna = 0 and cosna = (—1)”, then, by Eq. (4.9.7), Nn(x) 
takes the indeterminate form 0/0. According to l’H6pital’s rule, we obtain 


where y = I’(1) = 0.5772156... is the Euler*” constant. For example, 


x _1)k 
Yo(z) = tNo(z) = 2Jo(z) (n= +7) -2 », = 


a\2k 4 
oe 


k= 
Recall that the sum of reciprocals of the first n integers is called the n-th harmonic number, H, = S7;_, k7'. 
Obviously, the functions Yo(x) and No(a) are unbounded when x = 0. The functions Y,(x) and J_,(x) approach 


infinity as « + 0, whereas J, (x) remains finite as x approaches zero. 
Hence, the general solution of a Bessel equation of any real order v may be written 


y(a) = CJ (x4) + CoY.(x) or y(a) = Ci JL(a) + CoN, (x) 
with arbitrary constants C; and C3. 


“5Heinrich Martin Weber (1842-1913) was a German mathematician whose main achievements were in algebra, number theory, and 
analysis. He introduced the function Y,(«) in 1873. 

46Carl (also Karl) Gottfried Neumann (1832-1925), was a German mathematician who was born and studied in Kénigsberg (now 
Kaliningrad, Russia) University. 

47Sometimes ¥ is also called the Euler—Mascheroni constant in honor of ordained Italian priest, poet, and teacher Lorenzo Mascheroni 
(1750-1800), who correctly calculated the first 19 decimal places of y. It is not known yet whether the Euler constant is rational or 
irrational. 


4.9. Bessel Equations 249 


(a) 


Figure 4.6: (a) graphs of Bessel functions J,(x) for n = 0,1,2,3,4; and (b) graphs of Weber’s functions Y,,(a) for 
n= 0,1,2,3,4. 


4.9.1 Parametric Bessel Equation 
Another important differential equation is the parametric Bessel equation: 
gy” + acy! + (2? — v?)y = 0. (4.9.8) 
This equation can be transformed into Eq. (4.9.1) by replacing the independent variable t = Ax. Then 
dy __dy dt _, dy _ 


a a a ae 
and 
dz? da \dz dx dt dt? dz dt? , 
Therefore, 
, ; dy 2 2\2 Oi st 2 dy 
cy =cdy=tyats, vy =axaMy=ty=t oe 


Substitution t = Ax into Eq. (4.9.8) yields 
Pyttyt+(? —v?)y =0, 
which is the regular Bessel equation. Thus, the general solution of Eq. (4.9.8) becomes 
y(x) = Cy J, (Av) + C2 J_L(Ax) for noninteger v 


or 
y(x) = Cy JL (Av) + C2 Y_(Ax) for arbitrary v. (4.9.9) 
For \ = j, namely, \? = —1, we have 


ay” +ary' — (a? +v7)y=0. (4.9.10) 


This equation is called the modified Bessel equation of order v. From (4.9.9), it follows that the modified 
Bessel function of the first and second kinds 


co 2Qk+v 
ore. (3 I_,(a) — I,(a) 
L (2) =7-” J.Gz) = _ a) ac eee 
(2) =F" Gx) d MT(k+v+1)’ (2) 2 sin(v7) 
form a fundamental set of solution to Eq. (4.9.10). For example, 
2 4 6 


x x x 
b@)-ltoteptape 


ana, 


There are many amazing relations between Bessel functions of either the first or second kind. Some of them are 
given in the exercises. 
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Wate nopsaeseeeee: 


roa  * * 


Figure 4.7: (a) Graphs of the modified Bessel functions of the first kind J9(ax) (dashed line) and Ip(x); (b) graphs 
of the modified Bessel functions of the second kind Ko(x) (dashed line) and K2(x). 


4.9.2 Bessel Functions of Half-Integer Order 


The Bessel functions whose orders v are odd multiples of 1/2 are expressible in closed form via elementary functions. 
Let us consider the Bessel function of the first kind of order 1/2: 


(— 1" 5g). 


Tietz “RIT(R+1+1/2 


From the well-known relation (Qk +1)! 
+1)! 


it follows that 


JR Q2K+1 A /g2h41/2 
Syale) = ET kl ae RQk+D! Vn Es 
ue So (H1)h ht 2 | 
= BASE », “ka D a 2 sin x. 
By a similar argument, it is possible to verify the following relations: 


2 2, 2 
J_y)2(@) = ere cost, Iy/2(x) = ons sinhz, I_j/2(“) = eae cosh x. 


4.9.3 Related Differential Equations 


Let Z,(a) denote either a solution of Bessel equation (4.9.1) or a solution of the modified Bessel equation (4.9.10). 
Let us introduce the function that depends on three parameters (p, q, and k) 


u(a) = 279Z, (ka?). (4.9.11) 
Then this function u(x) is a solution of either the differential equation 
wu! +2 (1 —2q) ul + (k2p?a?? — pv? +. @?) u=0 (4.9.12) 
or 
vu" +2 (1—2q) ul — (k*p?a? + p?v? — @?)u=0 (4.9.13) 


depending on which Bessel equation the function Z,,(t) satisfies. 
Example 4.9.1: To find the general solution of the differential equation 
y" -f a’g*y = 0, 

we change variables y = u/z, « = Wkt (or x? = kt), where k will be determined shortly. Using the chain rule, we 
get 

dt = 2x = d 2d 

dx k dz k dt’ 

d d d 1 d 1 2 1 

ay _ a (er u) — _i/2 i; 4 so ty = gil? an (=) hag dl yy 5 out sa? u, 
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where dot stands for the derivative with respect to t. Now we find the second derivative: 


2 
fy 8 (20hie ters) 24 (2arartein) 


da? dx k dt 
4 1 1 
= Fait Pi lo Put oo a. 


Substituting this expression for the second derivative into the given equation and multiplying by x°/?, we obtain 


4 4 1 1 
pvitpru-ju+@ciu=0 = Ai + 40+ («ie 2) u=o. 


Now we set k = 2/a, and find that u(t) is a solution of the Bessel equation 


1 
Pit ut (e-Z)u=0 


Therefore, the given differential equation has two linearly independent solutions 


7 ax? 7 ax? 
nto) =I (SZ) and wola) = 2a (ZS), 


Example 4.9.2: Let us consider the Airy equation*® 


1 


y =ay (-o<2< oo). (4.9.14) 
Since this linear differential equation has no singular points, we seek its solution in the Maclaurin form 


y(x) = a9 + aya + aga? +++» = S> ana” 
n>0 


Substituting the series for y(x) and for its second derivative y’(r) = 77° 9 @n42(n+2)(n+1) x” into Airy’s equation 


yields 
aes = oa" t= Gacin” 
n=0 m=1 


For two power series to be equal, it is necessary and sufficient that they have the same coefficients of all powers of 
xz. This gives 
a22-1=0, AQn4o(n+2)(n+1)=an-1, n=1,2,3,... 
or 
an—-1 


at2 = el ee errr =0. 9. 
An+2 (n+ 2)(n +1) n 3 az =0 (4.9.15) 


Equation (4.9.15) is a recurrence relation of the third order. Since ag = 0, every third coefficient is zero, namely, 


a2 0, as 0, ag 0,...,@3n+2 =0, i aa Oe le eee 


Instead of solving the difference equation (4.9.15), we set g = 1/2, p = 3/2, k = 2/3, and v = 1/3 in Eq. (4.9.13) 
to obtain the modified Bessel equation that has two linearly independent solutions, known as Airy functions. 
They are denoted by Ai(a) and Bi(z), respectively. Airy functions commonly appear in physics, especially in optics, 
quantum mechanics, electromagnetics, and radioactive transfer. These functions are usually defined as 


1 /a 2 
=> r= 3 Ky /3(2), Z£> 3 x/? 5 (4.9.16) 
; x 2 
Bila) = 4/2 [hyl)+ Lap], 2= 52°, (4.9.17) 


48Sir George Biddell Airy (1801-1892) was an English astronomer and mathematician who was a director of the Greenwich Observatory 
from 1835 to 1881. 
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for positive x, and 


=F | ysl)- BYinl™ | ue [Ji3(z) + Ji/3(2)] , (4.9.18) 


os ls 


-S\— Ji j3(z) + Yijs(z 2| = fz Las) — Jy /3(z)] (4.9.19) 


for the negative argument, where z = = me 2. Airy’s functions are implemented in Mathematica as AiryAi[z] and 
AiryBi[z]. Their derivatives are denoted as AiryAiPrime|z] and AiryBiPrime[z]. In Maple, they have similar nomen- 
clatures: AiryAi(«), AiryBi(x) and their derivatives AiryAi(1,7), AiryBi(1,z). Maxima and Sage share the notation 
airy_ai and airy_bi. MATLAB on prompt airy(x) or airy(0,x) returns the Airy function of the first kind Ai(z) 
and on prompt airy(2,x) returns the Airy function of the second kind, Bi(a). Its computer algebra package Mu- 
Pad has special commands: airyAi and airyBi, respectively. SymPy uses airyai and airybi notations for these 
functions. 


(a) (b) 
Figure 4.8: (a) Graph of Airy function Ai(z). (b) Graph of Airy function Bi(x). 


[Problems] 


1. Express I3(a) and I4(x) in terms of Io(a#) and I1(«). 


2. Show the following closed form formulas for Bessel functions. 


2 sin x 2 cos x . 
(a) J3/2(a) = i. ( oo cost) (b) J_3/2(“) = — ae ( = + sin); 
2 3 : 3 2 3 3), 
(c)  Js/2(%) = = (3-3) sina > cosa}; (d) J_5/2(“) = a (4-1) cos + 3 sina}; 


(e)  Jzj2(4) = 2 (3 — x) sin x2 — (= - 1) cos, 


3. Find Ji(0.1) to 5 significant figures. 


4. Using generating functions 


prove the recurrences 


2 2 
— Jn (2) = Inai(z) + In—r(a), — In(a) = In-1(z) —Inga(z), n=0,41,.... 


5. Prove the following identities. 


(a) Jie) = 5 a(x) - Jola)) (b) < [ei(a)] = 2 Jol); 
we) = 1 _ Jo (a), Jo(v) _ 2 — Jo(). 
©) T@~ 2” Fa)’ © T@) ~ = AG) 
(e) 4 [2? A(z) +2 Jo(x)] = 27 Jo(z)+Jo(x);  (f) Ji (x) = Jo(x) - - Ji (x); 
(g) + [2x Ji(x) — x? Jo(x)] =a A(a): (h) Jo(a) = -Si (2). 
6. Show that, if m is any positive integer, 


rs [x” Jv(x)] = 2” Jr—-m(2), (=) " [x-’ J.(2)] = (-1)" 2” Jug (a). 


x dx 


4.9. 


10. 


11. 


12. 


13. 


14. 
15. 


16. 


17. 


18. 
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. Verify that the Bessel equation of the order one-half 


xy” tay’ 4 G i) y=0, x>0 


can be reduced to the linear equation with constant coefficients u’ + u = 0 by substitution y = 2~!/? u(a) . 


. Verify that when x > 0, the general solution of 


zy” +(1—2a) ay’ + B72? y =0 


is y(x) = «° Z,.(Bx), where Z, denotes the general solution of Bessel’s equation of the order a and 3 ¥ 0 is a positive 
constant. 


. Show that the equation y” + b’x°~y = 0 has the general solution 


1/a 1/a = 1/a 
pea) ea eC Pag ea) 
a a a a a 


where c1, C2 are arbitrary constants. 


Express the general solution to the Riccati equation y’+4a?+y* = 0 through Bessel functions. Hint: Make substitution 
ul = uy. 


Show that a solution of the Bessel equation (4.9.1) can be represented in the form J,(2) = «~'/? w(x), where w(x) 
satisfies the differential equation 
1 
aw" 4 (« v? 4 i) w= 0. 


Determine the general solution of the following differential equations. 


(a) ay” +y'+ay =0; (b) ay” + 2ay' + xy = 0; 

(c) ay” +y'+4ay =0; (a) 2?y” + 3ay'+ (1+ 2)y=0; 
(e) xy” + ay! 4 (Qa? 1)y = 0; (f) xy" + ay’ + 2xy = 0; 

(g) 2?y"+ay'+(8a7-§)y=0; (h) 2?y” 4+ 4ay'+(2+2)y = 0; 
(i) ay” +y'+Fy=0; (j) zy” + 2y'+ Fy =0; 

(k) xy" + 2y' + sary = 0; (1) a?y” + ay’ + (4x7 — 16)y = 0. 


Find a particular solution to the initial value problems: 

(a) 2 y" ty +(2?—-1y=0, y@)=1 

(b) 427 y"+(e@+1)y=0, y(4)=1. 

(c) cy” +3y’+ay=0, y(0.5) =1; Hint: the function 2~'Ji(2) satisfies this equation. 
(d) 4a?y" + 4ay’+(a—1jy=0, y(4)=1. 

(e) vty” +23 y'+y=0, y(0.5) = 3; Hint: use substitution t = 27". 

(f) ry” —3y'+ay=0, y(2)=1. 
Show that 4J//(x) = Jv_2(x) — 2JL(x) + I4a(x), v > 2. 


Verify that the differential equation 
cy +(1—-2v)y+ay=0, x«>0 


has a particular solution y = x” J) (x). 

Show that if Ny (2) = limy+n Nv(x), n = 0,1,2,..., where Ny is defined by Eq. (4.9.7), then 
1fa ‘ _ 10 n 

Nq(a) = + er [J.(2) — (-1) 1-(a))} = tim 22 [5,(x) — (-1)"J-v@) 


Ov von 7 OV 


v=n 


co 

Bessel’s equation of order 0 has a solution of the form y2(a) = Jo(x) Ina + > aya”. Prove that the coefficients in the 
k=1 

above series expansion are expressed as 


yo (x) = ee (1+ 5 feces i) ey + Jo(x) nz. 


1 
Find the general solution of y” + ss y + by = 0, x > 0, where a and b > 0 are constants. 
x 
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19. Show that if 
_4)n ,4n _4)n ,4n+2 
berz = >> oe beiz = 5— de 


dn-+2 2? 
2 +2 ((2n + 1)!) 


where j is the unit vector in the positive vertical direction, so j? = —1. The functions ber z and beiz are named 
after William Thomson, lst Baron Kelvin*?; they are implemented in Mathematica and Maple as KelvinBer|v, z] and 
KelvinBei[v, z], respectively. 


Show that the function Io(z /j) is a solution of the differential equation 


20. Find the general solution of the differential equation 


by making the change of variable y = u,/2. 


21. Find the general integral of the differential equation 


by making the change of variable y = xu. 


22. Find the general integral of the differential equation 


” x 9 _ 
v'+(c 7) u=0 


by making the change of independent variable 4e” = t?. 


23. Find the general solution of the differential equation 


Ay” + Oxy =0. 


24. Show that the Wronskian of Airy functions is Ai(x)Bi' (a) — Ai’ (x)Bi(x) = at 
7 


25. Show that the Airy functions satisfy the following initial conditions: Ai(0) = 37?/3/T (2), Ai(0) = a ma Ae (=) and 
Bit) = 3-4/1 (2), Br) = 3*/1 G). 


26. Use the transformations u = x“y and t = b2* to show that a solution of 
vu" + (1—2a) au’ + (b’e?x* + a? — k?c?)u =0 
is given by u(x) = Jy (ba*). 


27. By differentiating under the integral sign, show that the integral 


y(x) = cos (a sin @) dé 
0 


satisfies Bessel’s equation of order zero. 
28. Show that y = /Z Je (4 sv") satisfies 
vl?y” +y=0 for x>0. 


49Lord Kelvin (1824-1907) was a Belfast-born British mathematical physicist and engineer. 
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de 


10. 


11. 


12. 


13. 


14. 


Since the general solution of a linear differential equation of n-th order contains n arbitrary constants, there are two 
common ways to specify a particular solution. If the unknown function and its derivatives are specified at a fixed point, 
we have the initial conditions. In contrast, the boundary conditions are imposed on the unknown function on at least 
two different points. The differential equation with the initial conditions is called the initial value problem or the 
Cauchy problem. The differential equation with the boundary conditions is called the boundary value problem. 


The general linear differential equation of the n-th order (D = d/dz) 


Lz, Dly & an(a) y + ana (a) y"P +++ + ao(a) y(a) = f(a). (4.1.5) 


is said to be homogeneous if f(x) is identically zero. We call this equation nonhomogeneous (or inhomogeneous) 
if f(x) is not identically zero. 


Let functions ao(x), a1(x), ..., @n(x) and f(a) be defined and continuous on the closed interval a < x < b with an (x) £0 
for x € [a,b]. Let xo be such that a < zo < b and let yo, y1, .-., yn—1 be any constants. Then there exists a unique 
solution of Eq. (4.1.5) that satisfies the initial conditions y (ao) = yx (k =0,1,...,n—1). 


d 
. A second order differential equation is called exact if it can be written in the form aa F(z,y,y') = f(a). In particular, 
x 


: . . d 
a linear exact equation is ers [P(x) y’ + Q(x) y] = f(z). 
x 
An operator transforms a function into another function. A linear operator LZ is an operator that possesses the 
following two properties: 


L[y. + y2] = Llyi] + Lye], Ll[cy] = cLly], for any constant c. 


The adjoint to the linear differential operator L[y](x) = a2(x) y”(x) + ai(x)y’(x) + ao(x)y(x) is the operator L’ = 
D?a2(x) —Dai(x)+ao0(x), where D stands for the derivative. An operator is called self-adjoint (or Hermitian) if L = L’. 


Principle of Superposition: If each of the functions y1, y2,...,yn are solutions to the same linear homogeneous 
differential equation, 


an(x)y + an—1(v)y""? +++» + a1(x)y’ +a0(2)y=0 or L{z,D]y =0, (4.1.8) 


then for every choice of the constants c1,c2,...,Cn the linear combination c1y1 + coy2 +--:+CnYn is also a solution. 


A set of m functions f1, f2,..., fm each defined and continuous on the interval |a,b| (a < 6) is said to be linearly 
dependent on |a, b| if there exist constants a1,a2,...,@m, not all of which are zero, such that aifi(x) + a2fo(x) + 
--+ + Qmfm(x) = 0 for every x in the interval |a,b|. Otherwise, the functions fi, f2,..., fm are said to be linearly 
independent on this interval. 


Let fi, fo,...,fm be m functions that together with their first m — 1 derivatives are continuous on the interval 
|a,b|. The Wronskian or the Wronskian determinant of f1, fo,..., fm, evaluated at x € |a,b|, is denoted by 
Wf, fe,---,fm](x) or W (fi, fe,---,fm}£) or simply by W(x) and is defined to be the determinant (4.2.1), page 199. 


Let fi, fo,..., fm be m functions that together with their first m— 1 derivatives are continuous on the interval |a, b|. If 
their Wronskian W[f1, f2,---, fm|(xo) is not equal to zero at some point Xo € |a,b|, then these functions fi, fo,..., fm 
are linearly independent on |a,b|. Alternatively, if fi, fo,..., fm are linearly dependent and they have m — 1 first 
derivatives on the interval |a, b|, then their Wronskian W (fi, f2,..., fmjx) = 0 for every x in |a, d]. 

Let y1, y2,---, Yn be the solutions of the n-th order differential homogeneous equation (4.1.8) with continuous coefficients 
ao(x), a1(x), ..., Q@n(x), defined on an interval |a,b|. Then functions yi, y2,...,Yn are linearly independent on this 
interval |a,b| if and only if their Wronskian is never zero in |a,b|. Alternatively, these functions y1, y2,...,Yn are 


linearly dependent on this interval |a, b| if and only if W[y1, y2,---,Yn](x) is zero for all x € |a, b. 


For arbitrary set of linearly independent smooth functions y1, yo,..., Yn, the linear differential operator of order n that 
annihilates these functions is - 
Lix,D)f = [yr Y2,-++5 nsf] 
Wim, ye, eine Yn! 


The Wronskian of two solutions to the differential equation of the second order (4.2.2) satisfies the Abel identity (4.2.3), 
page 201. 


Any set of solutions y1, y2, ..-,Yn is called a fundamental set of solutions of (4.1.8) if they are linearly independent on 
some interval. If yi, y2, -..,Yn is a fundamental set of solutions of Eq. (4.3.2), then the formula 


y(x) = Ciyi (x) + Coya(x) +--+ + Cryn(2), 


where C),C2,...,Cn are arbitrary constants, gives the general solution of L[x,D]y = 0. 
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Summary for Chapter 4 


There exists a fundamental set of solutions for the homogeneous linear n-th order differential equation (4.3.2) on an 
interval where the coefficients are all continuous. 


Let yi, y2,---;Yn be n functions that together with their first n derivatives are continuous on an open interval (a, 6). 
Suppose their Wronskian W (yi, y2,..-,Ynjx) # 0 for a < x < b. Then there exists the unique linear homogeneous 
differential equation of the n-th order for which the collection of functions y1, y2,..., Yn is a fundamental set of solutions. 


To any linear homogeneous differential equation with constant coefficients, 
An y™ + ni yr) ferret aoy(x) — 0, (4.4.2) 


corresponds the algebraic equation an” + GQn—1A7 1 eee + ao = 0, called the characteristic equation. 
If all roots of the characteristic equation are different, the set of functions 


AL AQz 


* yo(x) =e , ttt, Yn(x) =e 


Anz 


yi(z) =e 
constitutes the fundamental set of solutions. 


If the characteristic polynomial L(A) = anA” + Gn—1A"~ 1 +--+ + a1A + ao of the differential operator L[D] = anD” + 
dn—1D"~ 1 +--+ + a0, D stands for the derivative operator, has a complex null \ = a +j, namely, L(a + j8) = 0, then 


yt =e” cos Bax and yo = e°* cos Bx 


are two linearly independent solutions of the constant coefficient equation L[D]y = 0. 


If the characteristic polynomial L(A) = anX” +an—1A"~ | +--+ +a0 = an(A—A1)™ -- +» (A—As)™* for constant coefficient 
differential operator L[D] = anD” + an—1D"~* +---+ ao has a null \ = X; of multiplicity mx, that is, L(A) contains a 
factor (A — Ax)’""*, then the differential equation L[D]y = 0 has mx linearly independent solutions: 


Ape 


m,p-l 
Ce Ber. - kale 


r 
ek 


If a solution yi (x) to the differential equation with variable coefficients an (x)y™ + any" +---+aoy = 0 is known, 
then the Bernoulli substitution y = vy1 reduces the given equation to an equation of order n — 1. 

The expression D(a) = 2p’(a)+p?(x)—4gq(z2) is called the discriminant of the differential equation y” +p(x)y’+q(x)y = 
0, which can be reduced to the Riccati equation w? — 2w’ = D(x), w = y"/y’. 


Two linearly independent solutions, yi(x) and y2(x), to the second order differential equation y” + p(x)y’ + q(x)y = 0 
can be expressed via w, the solution to the Riccati equation w? — 2w’ = 2p’ + p? — 4q: 
1 —2 2 
yi =expy—5 [ (plz) +w)dre, yal) =m (2) f yi“ (x) exp) — f p(t) dt ¢ de. 
xo 
The linear differential equation 
zany ae re A cia bess tary +aoy = 0, (4.6.15) 


called the Euler equation, can be reduced to a constant coefficient case either by substitution x = Int, or seeking a 
particular solution in the form y = x", for some power r. 


The general solution of the nonhomogeneous linear differential equation (4.7.1) on some open interval (a, b) is the sum 


y(2) = yr(x) + yp(2), (4.7.4) 


where yp(x) is the general solution of the associated homogeneous differential equation (4.7.2), and yp(x), which is a 
particular solution to a nonhomogeneous differential equation. The general solution y;,(x) of the homogeneous equation 
(4.7.4) is frequently referred to as a complementary function. 


The method of undetermined coefficients is a special method of practical interest that allows one to find a particular 
solution of a nonhomogeneous differential equation. It is based on guessing the form of the particular solution, but with 
coefficients left unspecified. This method can be applied if 


(a) Equation (4.7.11) is a linear differential equation with constant coefficients; 


(b) the nonhomogeneous term f(z) can be broken down into the sum of either a polynomial, an exponential, a sine 
or cosine, or some product of these functions. In other words, f(x) is a solution of a homogeneous differential 
equation with constant coefficients, ~[D] f = 0. 


There are several steps in the application of this method: 
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(a) For a given linear nonhomogeneous differential equation with constant coefficients 
any' (x) + an—ry"? (a) +++ + aoy(a) = f(2), 
find the roots of the characteristic equation L(A) = 0, where 
E(X) = Gg A" Ggad™ baa. 


(b) Break down the nonhomogeneous term f(a) into the sum f(x) = fi(a) + fo(x) +---+ fm(«) so that each term 
fi(x), j =1,2,...,m, is one of the functions listed in the first column of Table 229. That is, each term fj (2) has 
the form 

fi(x) = Pe(x)e** cosBx or f;(x) = Py (x) e** sin Ba, 
where P(x) = coz® + a*~!+---+ cx is a polynomial of the k-th order, and o = a +j6 is a complex (8 4 0) or 
real (6 = 0) number, called the control number. 


(c) Let the control number o; of the forcing term f;(x), 7 = 1,2,...,m, be the root of the characteristic equation 
L(A) = and” + GQn—1A"~' +--+ + a9 = 0 of the multiplicity r > 0. We set r = 0 if o; is not a root of the 
characteristic equation. It is said that there is a resonance case if r > 0, and a nonresonance case if r = 0, that 
is, the control number is not a null of the characteristic polynomial. Seek a particular solution yp;(x) in the form 


Ypj(@) = x" e** [Rix (x) cos Bx + Rox (x) sin Ba], 
where Rix(x) and R2x(x) are polynomials of the k-th order with coefficients to be determined. 
(d) Substitute the assumed expression for yp;(x) into the left-hand side of the equation L[D] yp; = f; and equate 
coefficients of like terms, where D = d/da. 
(e) Solve the resulting system of linear algebraic equations for the undetermined coefficients of polynomials Rix (x) 
and R(x) to determine yp; (x). 


(f) Repeat steps (b) — (e) for each 7 = 1,2,...,m, and sum the functions yp(%) = ypi(@) + Yp2(a) +--+ + Ypm(x) to 
obtain a particular solution of the driven equation L[D]y = f. 
The method of variation of parameters (or Lagrange’s method) for solving the inhomogeneous differential equation (4.1.5) 


requires knowledge of the fundamental set of solutions y1, y2, ..., Yn for the corresponding homogeneous equation (4.1.8). 
With this in hand, follow the following steps. 


A. Form a particular solution as a linear combination of known linearly independent solutions 
Yp (x) = Ar(a)yr (a) + A2(a)yo(x) +--+ + An(x)yn (a) (A) 
with some yet unknown functions Ai(a), A2(x),..., An(a). 


B. Impose n auxiliary conditions on the derivatives of the unknown functions 


Aiys(x) + Aoye(x) +++ + Anyn(e) = 0, 
Aiyi (x) + Aoyo(x) +--+ Anyn(e) = 0, 
wae = 0 
Ait a) + Aays” (@) +++ Anu (2) = 0, 
Aig”? (a) + Aus” (ae) +--+ Any (az) = f(a) /an(z). 
C. Solve the obtained algebraic system of equations for unknown derivatives Aj, A), ..., A’. 
D. Integrate the resulting expressions to obtain Ai(x), Ao(x), ..., An(x) and substitute them into Eq. (A) to determine 


the general solution. 
28. The Green function for the linear differential operator is uniquely defined either by the formula (4.8.15), page 243, or 
as the solution of the initial value problem formulated in Theorem 4.23 on page 243. 
29. The differential equation 
ay” +ry+ (2? —v*)y=0, (4.9.1) 


where v is a real nonnegative constant, is called the Bessel equation. 


30. The Bessel equation (4.9.1) has two linearly independent solutions 
ove) (-1)* a 


I(x) = 2 (4.9.5) 
7 T(v+k+1) 
called the Bessel function of the first kind of the order v, and 
Nox cosum Jy (x) — dake (4.9.7) 


sin vt 
called the Neumann function. Its constant multiple Y,(x) = 7 N,(z) is called the Weber function. 
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Review Questions 


Review Questions for Chapter 4 


In all problems, D is the derivative operator, that is, Dy = y’; its powers are defined recursively: D” = D(D"~*), n = 1,2,..., 


with D° being the identical operator, which we drop. 
Section 4.1 


Ii, 


In each of the following initial value problems, determine, without solving the problem, the longest interval in which 
the solution is certain to exist. 


(a) cosy tay +%y=1, ¥(0)=1, ¥'(0)=0; (b) (@—1)?y’tey!—2?y=sine, y(2)=1, y'(2) <0; 
(c) (2? —a)y"+ay=sinz, y(—1)=0, y/(-1) =1. 

(d) sinzy” +ay’+a?y=2%, y(0.1) =1, y/(0.1) =2. 

(e) vy" +a7y'+a%y=sin22, y1)=1, y/(1)=2. (f) (at+1)(a—-2)y"+ary'+5y=cosz, y(0) =1, y/(0) =—-1. 
(g) (@?-4)y”—2(e+2)y'+(e@-2)y=0, y(0)=-1, y’(0) =1. 


. By changing the independent variable t = y(x), reduce the given differential equation to the equation without the first 


derivative. 
(a) y"—2y’ +e y=0; (b)  2?y” + ay’ + (2? —v*)y =0; 
(c) ay” +2y’+ay=0; (d) 2’?y”— ay’ +9ry =0. 


. Changing the independent variable, reduce the given variable coefficient differential equation to a constant coefficient 


equation. 
(a) y”—y' +e7*y =0; (b) (1—27)y" — ay’ + 4y = 0; 
(c) (l+2?)y’+ay'+9y=0; (d) (2?—1)y”+2y— 16y =0; 
(e) 2ay” +y! — 2y =0; (f)  aty” + 2a°y! — 4y = 1/2; 
(g) ay” —y' —42°y =0; (h)  a?y" + (e+ 2°/(1—27)) y +4(1— 27)y = 0; 
(i) (1+27)y"+ay'+4y=0; (j) xy” + (2? - 1)y’ — 6r°y =0. 


qe+r 


. For what values of parameters p, g, and r does the differential equation wu” + == u’ + Torry U = 0 have an integrating 


ax+b 

factor p(%) = (ax + p)1-Pb/ a” ePr/a9 
By the use of a Riccati equation (4.1.23), find an integral for each of the following differential equations. 

(a) y”—y! (2+4e?*) —5e** y=0; (b) zy” In2x—y’ — 9ry(In 2x)? = 0; 

(c) 2° y" — ay’ + 2y = 0; (d) zy" — 2y’ — 4a°y = 0; 

(e) ay’ + (22? +a—-2)y’+a3y=0; (f)  y” — 2y'— 16e**y = 0. 
With the substitution y = y(a)v(x), transfer each of the following differential equations into an equation that does not 
contain the first derivative. 


(a) (a? +1)y" + 4ay’ + (4a + 6)y = 0; (b)  x?y" + (Qa? — 2x)y’ = (2a — 2 — 5a”)y; 
(c)  y" — by + (9 — 2/a)y = é (d) (x? +9)y" — day’ + 6y = 0; 
(e) y” —2y’ si caer #) = 4 sins (f)  2?y”- 22 y= (2-~e - 4) 
(g) «?y” — 4a3y’ = (6 — 4a* + 22”)y; (bh) vy” +2y’+ ay =0. 
Solve the following linear exact equations. 
(a) (w@+y"+ 2y' +sinz = 0; (b) w(x + Vy" + (4x ; ae 4 Qy =e"; 
(c) ay" + 6x? y' + Gry = 1227; (d) te ae cP = 
(e) y+ ta)y' ty =0; (f) y’ ‘+ Teer! + 2059 y = der; 
(g) yy” +cotry’ 7 wesc! x =0; (h)  sinzy” +sinzy=cosz; 
(i) ay" + 12, Me ora =1  () (+e) y"(2e* +2)y' +y = f(a); 


(k)  aty” + (42° 4 2x)y’ + 2y=0; (1) (sinx)y” + (cosa+seca)y’ + ysecxtanx = cosa. 


Solve the following nonlinear exact equations. 


@) 24 In(L + 2*y') 4 wa =0; (b) ayy” + (wy’)? + deyy! + y? = 0; 
(c) +-4SmxX=2, (d)  zyy” + a(y’)? + yy’ = 2a. 


Find an ite fae that reduces the given differential equation to an exact equation and use it to determine its 
general solution. 

(a) y" — Fy =e™ (bY oy + (14+ 1/a)y! + y/a = 30. 

(c) y” — (sec wy! = = L. (d)  aty” 4+ 2a y' = —k/x* (k > 0). 

(e) (#4 zy "4 (a? + 4a + 3)y! + doy = 5x? + 8a + 3. 
(f) (3a +a7)y” + Co 4a — a?)y’ = 44 4y + dary. 


(g) (a+ )y"” +3y’ by/e=1/vi +a. 
(bh) (y/)?(1 +a? +y?) + 4% (1+ y? — 327) + yy"(1+ 2? +y?) = 22”. 
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10. 
11. 
12. 
13. 
14. 


15. 


16. 


17. 


Reduce the coefficient of y in y” + xy = 0 to unity. 


In the answer to the preceding problem, reduce the coefficient of the first derivative to zero. 


Reduce the coefficient of y in y” +2” y = 0 to unity. 
Reduce the coefficient of the first derivative in the equation of the preceding problem to zero. 


Suppose that yi(#) and y2(x) are solutions of an inhomogeneous equation L{y] = f, where L is a linear differential 
operator and f(a) is not identically zero. Show that a linear combination y = C; yi (a) + C2 y2(x) of these solutions 
will also be a solution if and only if Ci + C2 = 1. 


Solve the differential equations 


d du Aa? r? — 1 d 2 du 7 eee 
(a) 2 (St) + SEH o w) Z(FS +a°r°u=0; 


using the change of variable u = v(r)/./r and u = v(r)/r, respectively. 


2 2 
Consider the partial differential equation (called the Laplace equation): is + a = 0. Show that the substitution 
z y 
u(x,y) = r/o f(ax— By), where a and £ are positive constants, reduces the Laplace equation to the ordinary differential 
2 
equation a ~ >» +qf=0, with p=1/(a? + 6?) and q=p/a?. 


Formulate a differential equation governing the motion of an object with mass m = 2 kg that stretches a vertical spring 
4cm when attached and experiences a resistive force whose magnitude is three times the square of the speed. 


Section 4.2 of Chapter 4 (Review) 


1. 


12. 


Determine whether the given pair of functions is linearly independent or linearly dependent. 
(a) 203 & —5a: (b) ©?” & e~**; (c) t&t' (t 40); 
(d) (a#+1)(#— 2) & (Qa -1)(a +2); (e)  e8* & +9), (Ff) t&tln|[t| (40). 

Prove that any three polynomials of first degree must be linearly dependent. 

Are the following functions linearly independent or dependent on the indicated interval? 

(a) arctan(x) and arctan(2z) on (—4,4). (b) 2” and z* on (0,00). 


c) Vl—<2? and x on (-1,1). d) 1 and In on (—oo — 1). 
( ) ? z+1 

(e) Ina and log, on [1, 0). (f) «+1 and |x +1] on [—1,00). 
Write Abel’s identity (4.2.3) for the following equations. 

(a) y" — 4y' + 3y = 0; (b)  2y” + 4y'—y =0; 

(c) ay" —2xy'+a°y=0, (d) (a? —l)y" + 4cy' +27 y =0; 


(e) (2x? — 3a —2)y”’+5y’+ay=0; (Ff) (#? —4)y”+2y' —5y =0. 
Find the Wronskian of the two solutions of the given differential equation without solving the equation. 
(a) y’+y'/e+y=0; (b) (2? 4+1)y"-—ay'+(e#+2)y=0. 


If y; and y2 are two linearly independent solutions of the differential equation 


a(x) y" + ar(x)y! + ao(x) y = 0 
with polynomial coefficients ao(#), a1(x), and a2(x) having no common factor other than a constant, prove that the 
Wronskian of these functions yi and y2 is zero at only those points where a2(x) = 0. 


The Wronskian of two functions is W(«) = #? +2x+1. Are the functions linearly independent or linearly dependent? 
Why? 


Suppose that the Wronskian W[f,g] of two functions is known; find the Wronskian W[u,v] of u = af + bg and 
v=af + Bv, where a, b, a, and 6 are some constants. 


If the Wronskian of two functions f and g is x sin, find the Wronskian of the functions u = 2f + g and v = 2f — 3g. 


Prove that the functions e 
n=0,+1,+2,.... 


ale cee, and e?* cosx are linearly independent on any interval not containing oT + nn, 


ax 


. Prove that if a and @ are distinct real numbers, and n and m are positive integers, then the functions e*”, re°”, ..., 


m—1 be 


a”—te®, and e8*, xe®*,..., are linearly independent on any interval. 


Are the functions fi(x) = |x|? and f(x) = x? linearly independent or dependent on the following intervals? 
(a) O<a<l1, (b) -l<ax<d0, (ce) -l<aeK<l. 
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13. 


14. 


15. 


16. 


17. 


18. 
19. 


20. 
21. 


22. 


23. 


24. 
25. 


26. 


Review Questions 


Prove the generalization of the previous exercise. Let f be an odd continuous differentiable function on an interval 
|-—a,al, a > 0, (that is, f(—x) = —f(a)) and that f(0) = f’(0) = 0. Show that the Wronskian W[f,|f|]() = 0 for 
x €|—a,al, but that f and |f| are linearly independent on this interval. 


What is the value of constant W [y1, y2](#o) in the formula (4.2.3) when two solutions yi, y2 satisfy the initial conditions 
yi(to) =a, yi(to)=b, y2(eo) =a, yo(xo) = 8? 

Find the Wronskian of the solutions yi, y2 to the given differential equation that satisfy the given initial conditions. 
(a) vy" +ay'+(L—2)y=0, yr(1)=0, yi(1) = 1, y(1) = 1, (1) = 1. 

(b) (1—2*)y" — 2ay' + 6y=0,  y1(0) = 3, 1 (0) = 3, y2(0) = 1, y3(0) = -1. 

(c) a?y" —3ay'+(1+a)y=0, m(1)=1, vi(-1) = 1, yo(-1) =0, (-1) = 1. 

(d) y” — (cosx)y’ + 4(cot x)y=0, y1(0) = 1, 910) = 1, y2(0) =0, y2(0) = 1. 

(e) y+ 2ay’+a°y=0, yn(0) =1, (0) = 1, y2(0) = 0, (0) = 1. 

(f) VI + a3 y" —307y’+y=0, y(0) =1, 91 (0) =1, y2(0) = —1, ¥3(0) =1. 

(g) (L-2*)y" —30y'+8y=0, (0) =—1, 1(0) =, y2(0) = 0, yo(0) = 1. 


Let wi, u2,..., Un be linearly independent solutions to the n-th order differential equation yO +dn-1 (x)y-P feet 
ao(x)y = 0. Show that the coefficient functions ax(x), k = 0,1,...,n —1, are uniquely determined by w1, u2, ..., Un. 


Let f be a continuous differentiable function on |a,b|, a < b, not equal to zero. Show that f(x) and xf(«) are linearly 
independent on |a, b|. 

If the Wronskian W[f,g] = t® e”* and if f(t) = t?, find g(t). 

Verify that yi(x) = 1 and ye(z) = x'/* are solutions to the differential equation yy” + 2(y’)? = 0. Then show that 
C1 + Cox'/? is not, in general, a solution of this equation, where C; and C2 are arbitrary constants. Does this result 
contradict Theorem 4.4 on page 191? 

do Mn 


Find the Wronskian of the monomials ayx!, azn cesar 


Let fi(a), fe(x), ..., fn(a) be functions of x that at every point of the interval (a,b) have finite derivatives up to the 
order n — 1. If their Wronskian W[/f1, f2,..., fn](@) vanishes identically but W[fi, fe,---,fn—1](x) # 0, prove that the 
set of functions {f1, f2,..., fn} is linearly dependent. 


Show that any two functions from the set (j? = —1) 
filx) =e", fo(a) =e", fa(x) =sina, fala) = cosa 


are linearly independent, but any three of them are linearly dependent. 


If {u(x), v(x)} is known to be a fundamental set of solutions for a second order linear differential equation y” + p(x) y’ + 
q(x) y = 0, what conditions must a,b,c,a satisfy so that {au + bv,cu + dv} is also to be a fundamental set for the 
same equation? 

Let u(x) and v(x) be differentiable on the interval |a, b|, and let W[u, v] be their Wronskian. Show that 44 = uv" —vu". 


For given three linearly independent functions y1(x), y2(x), and y3(«), derive a linear differential equation 


y+ p(2)y" + q(x) y' +r(a)y =0 
for which these three functions are its solutions. In other words, express three functions p(x), q(x), and r(a) in terms 
of {y1, y2, yz} assuming that their Wronskian is not zero. 


Let f and g be two continuously differentiable functions on some interval |a,b|, a < b, that have only finitely many 
zeroes in |a,b| and no common zeroes. Prove that if W[f, g](x) =0 on |a, b|, then f and g are linearly dependent. Hint: 
Apply the result of the previous problem to the finite number of subintervals of |a, b| on which f and g never vanish. 


Section 4.3 of Chapter 4 (Review) 


1. 


Prove that every solution to the differential equation #(t) + Q(t)a(t) = 0 has at most one zero in the open interval (a, b) 
if Q(t) < 0 in Ja, dj. 


Prove that zeros of two linearly independent solutions to the equation #(t) + Q(t)x(t) = 0 separate each other, that is, 
between two sequential zeros of one solution there is exactly one zero of another solution. 


If y: and ye are linearly independent solutions of (a? +1)y” + zy’ + y =0, and if W[y1, y2](1) = V2, find the value of 
Wyn, y2](2). 


Review Questions for Chapter 4 
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4. Suppose that two solutions, yi and y2, of the homogeneous differential equation (4.3.1) have a common point of inflection; 


prove that they cannot form a fundamental set of solutions. 


5. Let yi and y2 be two linearly independent solutions of the second order differential equation (4.3.1). What conditions 
must be imposed on the constants a, b, c, and d for their linear combinations y3 = ayi + by2 and ya = cyi + dy2 to 


constitute a fundamental set of solutions? 


Section 4.4 of Chapter 4 (Review) 


1. Write out the characteristic equation for the given differential equation. 


(a) 2y"—y'+y=0; (b)  y"+2y'-3y=0; (ce) 4y%+y=0; 
(d) y"+5y'+y=0; (e)  3y'” + 7y =0; (f) y@ — 3y” + 5y =0; 
(g) Ty” +4y'-2y=0; (hb) y”—3y"+4y=0; (i) ay $y" — dy" +y' =0. 


give the form of the general solution. 
(a) 8\?+14\—-15=0; (b) 347—5A4+2=0; (c) 27-A-3=0; 
(d) 9\7-3A\—-2=0; (e) A?-—444+3=0; (f) A2+2\—-8=0; 
(g) A®4+3A4+10=6\?; (h) 2427-7? -5A=2; (i) BAP 411° 4553. 


3. Write the general solution to the given differential equations. 


(a) y+ 3y" —y' —3y=0. (b) 4y’” — 13y’ + 6y = 0. 
(c) 2y'” + y" — By’ — 4y =0. (d) - + 2y" — 15y’ = 0. 
(e) 4y —17y" + 4y’ =0. (f)  2y — 3y’” — 4y” + 3y’ + 2y = 0. 


(g) Me + 5y” — 8y/—12y=0. — (h) ie Qy!” — 13y" + 28y' — 24y = 0. 
(i) 4y — By” — y"” +2y’=0. (j)  4y + 20y'” + 35y” + 25y’ + 6y = 0. 


4. Solve the initial value problems. 


10. 


11. 


(a) J+ 8y—9y =0, y(0) =0, y(0) = 
(c) 24 — lly —6y = 0, y(0) = 0, ¥(0) 
(e) 49 —17y + 4y = 0, y(0) = 1, yO) = 


: (b) 37 —8y-3y=0, y(0) = 10, 4(0) = 0; 
13; (d) 3-179 —6y =0, y(0) = 19, 4(0) =0; 
4; (f) 5y— 24y —5y =0, y(0) = 1, y(0) = 


10; 


. Find the solutions to the given [VPs for equations of degree bigger than 2. 

(a) y” — 2y" — 3y' =0, y(0)=0, y'(0) =0, y"'(0) = 12 

(b) y” +y" — 16y' — 16y = 0, y(0)=0, y'(0) =8, y""(0) =0 

(c)_y — 13y"" + 36y = 0, y(0) =0, y'(0) =0, y'"(0) = —10, y"(0) =0 
(d) y - By" + by!" + By! = by, y(0) =0, y'(0) =-1, y"(0) = -3, y!"(0) = -7 
(e) 10y"” + y" — 7y' + 2y = 0, y(0)=0, y/(0) =-6, y"(0) =3 

(f) 4y — 15y” + 5y' + By = 0, y(0) =0, y/(0) = -16, "(0) = -4, y'"(0) = 4 
(g) 3y!” — Ty" — Ty’ + 3y = 0, y(0)=0, y'(0) = -12, y'"(0) = —56 


. Ifa > 0, prove that all solutions to the equation y’’ + ay’ = 0 approach a constant value as t > +00. 
Find the fundamental set of solutions of the following equations specified by Theorem 4.12. 
(a) g=9y; (b) 39-10y¥+3y=0; (c) 2W-Ty-4y=0; (d) 6Y-—y-—y=0. 
Find the solution of the initial value problem 
2y" +3y'-y=0, y(0)=3, y'(0)=-1. 
Plot the solution for 0 < t < 1 and determine its minimum value. 
Find the solution of the initial value problem 
3y” —10y'+3y=0, y(0)=2, y'(0) =—2. 


Then determine the point where the solution is zero. 


. The characteristic equation for a certain differential equation is given. State the order of the differential equation and 


Solve the initial value problem 3y” + 17y'’-—6y=0, y(0) =3, y’/(0) =—8. Then find the smallest positive value of 


6 for which the solution has no minimum point. 
Construct the general form of the solution that is bounded as t + oo. 
(a) y"—4y=0; (b) y”=0; (ce) 3y"+8y'—3u=0; (d)  2y”—Ty’— 4y=0. 
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12. 


13. 


14. 


15. 


16. 


17. 


18. 


19. 


20. 


Review Questions 


Solve the initial value problem 2y’ — y’ — 6y = 0, y(0) = a, y’(0) = 2. Then find the value of a so that the solution 
approaches zero as t > oo. 
Solve the initial value problem 6y” — y’ — y = 0, y(0) = 3, y’/(0) = 6. Then find the value of 6 so that the solution 
approaches zero as t > oo. 
For the following differential equations, determine the values of y, if any, for which all solutions tend to zero as t > co. 
Also find the values of +, if any, for which all (nonzero) solutions become unbounded as t + ov. 

(a) y+ 2(1—~y)y'—7(2-y)y=0; (b) 9” -yy'-(1+7)y =0. 
Solve the initial value problem y” — 4y’ + 3y = 0, y(0) = 1, y’(0) = 6 and determine the coordinates rm and ym of 
the minimum point or the solution as a function of {. 


Solve the initial value problem = y” + y’ —2y =0, y(0) =a, y’(0) = 1. Then find the value of a so that the solution 
approaches zero as t + oo. 


Solve the initial value problem 3y” + 4y’ —4y =0, y(0) =1, y’(0) = 8. Then find the value of 8 so that the solution 
approaches zero as t > oo. 
For the following differential equations, determine the values of y, if any, for which all solutions tend to zero as t > co. 
Also find the values of +, if any, for which all (nonzero) solutions become unbounded as t —- oo. 

(a) y”—2y'+ (7? -Vy=0; (by oy" + +9)y'- (7+ 2)y =0. 
Solve the initial value problem  y” — 6y’ + 8y = 0, y(0) = 1, y’(0) = @ and determine the coordinates rm and ym of 
the minimum point or the solution as a function of (. 


Consider the initial value problem 
y"+5y+4y=0, yO)=1, y(0)=6>0. 


a) Solve the initial value problem. 


(c 


(d) What is the behavior of zm and ym as 8 +> +00? 


(a) 
(b) Determine the coordinates x and ym of the maximum point of the solution as function of . 
) Find the smallest value of 6 for which ym > 2. 

) 


Section 4.5 of Chapter 4 (Review) 


1. 


The characteristic equation for a certain differential equation is given. Give the form of the general solution. 


(a) °4+9=0; (b) A? +21 +4+2=0; (c) A? 4+444+5=0; 
(d) 4\?-4\+5=0; (e) 16\?+8\4+145=0; (f) 9A? -6A+10=0; 
(g) A? —-2\+5=0; (h) A? +4445 =0; (i) 4\? -4\+17=0; 
(j) 9A? +6A+145=0; (k) A?—6A+13=0; (I) A? -6A+10=0. 


. Write the general solution of the given differential equation (a is a parameter). 


(a) 4y” — 4y’ + 3y = 0; (b) 64y” — 48y’ + 13y = 0; 

(c) oy” +6y"+13y=0; (d) oy” +3y"+y'+3y =0; 
(e) y”+10y'+26y=0; (fF) y” — 2ay’ + (a? + 9)y = 0; 
(g) 4y”—24y'+37y=0; (h) 9y” — 18y’ + 10y =0. 


Solve the initial value problems (dot stands for the derivative with respect to t). 


(a) 99 — 24y + 16y =0, y(0) =0, y'(0) =1; (b) 4¥-—129+9y=0, y(0) = 2, y’(0) =3; 
(c) y+18y+8ly=0, y(0)=0, y/(0)=1; (d) 9¥— 42y+ 49y = 0, y(0) =0, y/(0) =1. 


Find the solutions to the given initial value problems for equations of the third order. 


(a) y"” — 3y' — 2y =0, y(0)=0, y(0)=0, y"(0)=9. 
(b) y" +y" =0, y(0)=0, y(0)=0, y"(0)=1. 
(c) 18y"" i + 20,’ — 4y = 0, y(0)=4, y(0)=-1, y"(0)=-3 
(a) ee + By” — Ly’ +3y =0, y(0)=1, y/(0)=1, (0) =13 
Find the general solution of the given differential equations of order bigger than 2. 

(a) oy! +y" +4y' + 4y =0. (b)  4y’”" + 12y” + 13y’ + 10y = 0. 

(c)  16y’" + 5y”+17y'+13y=0. (d) 12y — 8y'"” —6ly” — 64y’ — 15y = 0. 

(e:) yy" +y'-y=0. (f)y” — 2y""— 3y/ + 10y = 0. 


(g)  y!” —2y"” +y' —2Qy =0. (h)  8y + 44y’” + 50y” + 3y’ — 20y = 0. 
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6. If a is a real positive number, prove that all solutions to the equation y” + a2y = 0 remain bounded as x — +00. 
7. Rewrite the function y(t) = sint + cost in the form y(t) = A cos (St — 6). 
The following problem requires access to a software package. 


8. Find the solution to the initial value problem 4y” — 12y’ + 73y = 0, y(0) = 2, y’(0) = 3, and determine the first time 
at which |y(t)| = 1.0. 


9. Consider an electric circuit containing a resistor, an inductor, 
and a capacitor in series with a constant electromotive force 
(see figure at the right). Applying Kirchhoff’s law (see §6.1.1), 
we obtain 

iat q 


= _ dq 
poe C es Bae 


where q(t) is electric charge, and I(t) is the current in the 
circuit. Let L = 2 henries, R = 248 ohms, C = 10~* farads, 
and EF = 127 volts. Determine the current I(t) in the circuit 
assuming that no charge are present and no current is flowing 


at time t = 0 when F is applied. 


Section 4.6 of Chapter 4 (Review) 


1. The factored form of the characteristic equation for certain differential equations is given. State the order of the 
differential equation and write down the form of its general solution. 


(a) (A-3)’; (b) (A- a0 +1)"; (c) (A+9)?; 
(qd) (7421-42): (e) (\?+4); (fy) (2-5); 
(g) (2-9)? +2445)?; (bh) (A- yO? +4)?; (i) (A—4)(? +4); 
(Gj) (Q? 4140? -— 4448); (kK) QA- 1) (40° + 1)?; (1) (4A? + 12A + 10); 
(m) (\7— 1)? +44+5)?; (mn) (A+1)°(- 8); (0) (A? —4)’. 


2. Write the general solution of the given differential equation. 


(a) 9y” — 12y’ + 4y = 0; (b)  2y"” —2V2y'+y=0; (c) Sy" + 6y" +y=0; 

(d)  4y" — 12y' + 9y = 0; (e) yy” —2/5y'+5y=0; (f) 9” +3y” — 4y =0; 
(gs) y\? — By” + 16y = 0; (h) y+ 18y” + 8ly=0; (i) yO ty" + y" =0; 
Gj) y® = 4y'" + by" = 4y'—y; (ky) y +2y"” + y' =0; QQ) y® + 6y’” + 9y” =0. 


3. Find the solution of the initial value problem. 


(a) y” —6y'+9y=0, y(0) =0, y/(0) =1. (c) 4y”+12y’+9y=0, y(0) =2, y’(0) =0. 
(b) 16y”+24y'+9y=0, y(0)=0, y/(0) =4. (d) 9y” —6y’+y=0, y(0) =0, y’(0) =3. 


4. Find the solutions to the given initial value problems for equations of the third order. 


a) y’” + Gy” + 12y’ + 8y = 0, y(0)=0, y'(0)=0, y"(0) =2. 
b) y+ y" =0, y(0)=0, y’(0)=0, y"(0)=1. 

c) 18y’" — 33y” + 20y' — 4y = 0, y(0)=4, y/(0)=—-1, y"(0)=-3 
d) 4y’” + 8y” — lly’ + 3y = 0, y(0)=1, y'(0)=1, y"(0) = 13. 
e) 4y/" — 27y” + 27y = 0, y(0)=—3, y/(0)=—-7, y"(0)=6 
f) 4y'” + 12y” — 15y’ + 4y = 0, y(0)=1, y'(0)=0, y"(0) = 20. 


5. Find the solutions to the given initial value problems for equations of the fourth order. 
(a) 9 — dy!” + By” — By’ +4y=0, — y(0) = 1, y’(0) = 1, (0) =0, (0) =3. 
(b) a + 3y'"—4y’=0, y(0) = 0, y'(0) = 2, y"(0) = —3, y!"(0) = 13. 
6. Determine the values of the constants ao, ai, and a2 such that y(x) = ao + aixz + a2x” is a solution to 
(9 + 4a”)y” — 8y = 0 


and use the reduction of order technique to find a second linearly independent solution. 
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10. 


11. 


12. 


13. 
14. 


15. 


16. 


17. 


18. 


19. 


20. 


Review Questions 


Find the fundamental set of solutions for the following differential equations. 


(a) y"+y//e-ZB+1/2%)y=0; (db) 2%y"—2y=0; 
(c) ay” —(x+2)y’ + 2y = 0; (d) ay” +(e@4+2)y’+y=0. 

One solution y: of the differential equation is given; find another linearly independent solution. 
(a) ty” +(1— 2a)ty’ + a?y =0, y(t) = #*. (b) y” = (2cotx—tanz)y’, yi = 1. 


200 aw” 


(ce) 2?y"” —2Qary'+ (a? +2)y=0, yr=asing. (d) y’—e?y' =0, m=. 


(e)  4a%y" + dary’ + (407 -—1)y=0, y = SF. (f) zy” —y' =0, yr =1. 


(g) (¢-1)?¥+6y=4(t-1y, yw =(t-1)?. (hb) cy” t+y'=0, mw =In{2I. 
(i) 2? y”—ay’+y=0, yw=az nial. (j) 2° y” + 3ay’ — 38y =0, y. =a. 


Suppose that one solution, yi(zx), to the differential equation y” +p(x)y’+¢q(x)y = 0 is known. Then the inhomogeneous 
equation y” + p(x)y’ + q(x)y = f(x) can be expressed in the following form: [D + (p(x) + yi/y1)][D — yi/m)] y = f(a). 
The required function y(a«) can be found by successive solutions of the equations 


D+ (p(z)+yi/m)lv=f(e), D-/m)]y = r(2). 
Using the above approach, find the general integral of each of the following differential equations. 
(a) sy" t+y'- fy =4e+1207; yy = 2”. (b) 3ay”—y'’ =a; yi =1. 
(c)  y" — Any’ + (407 — 2)y = 20; yr =e*. (4) y" tay’ =2; mw =1. 
(e) ay” — (Qa +1)y’ + 2y = 427; y = e?*. (f)  a?y” — 3ay' + 4y = 424; ys = 2. 
(g) wy” +(1—2x)y'+(@—-ly=e") we”. (bh) (1-2? )y" + 2ay’ =0; ys = 1. 


Use the factorization method to decompose the operator associated with the given differential equation as a product of 
first order operators and then find its general solution. 


(a) aty”—(Qe4+1)y=—4e7e"¥*; (b) 4a? yy" + y = 409”. 


Find the discriminant of the differential equation (1 — 2a — 2”)y” +2(1 + a)y’ — 2y = 0 and solve it. 


For what values of parameters p, q, and r does the differential equation wu” + ant uw + aS u = 0 have a solution 


u = 1/(ax +b)? Here a and b are two nonzero real constants. 
Show that the solutions y: and y2 defined by Eq. (4.6.10) are linearly independent. 


By the use of Riccati’s equation (4.1.23) (consult Problem 5 on page 258), find the general solution for each of the 
following differential equations. 

(a) y"—2y! — 16x4y =0; (b) yo" — ary + By =0; 

(c) ae? + 1)y’—a7e7y=0; (d) y”—y' —9e*y =0,; 

e ue+uot +ax°y =0; x nx—y —azyln’rz=0. 

3 9 y’ 3y 0) f yl y yl 3 0 
(g) ay” +(1—2xcosx)y’+(xsinaz—cosx)y=0. (h) y” +(2e"—1)y’ —3e?? y =0. 
(i) y” — Qesinx + cot x)y’ + (a sin? « — sina)y = 0. 


xy” _ ( 
xy” _ ( 


Suppose that yi(x), yo(x), -.-, Yyn—1(x) are linearly independent solutions of the n-th order variable coefficients differ- 
ential equation y + pn—i(a)y) +---+ pi(x)y’ + po(x)y = 0. Find the fundamental set of solutions for the given 
equation. 


The Dickson polynomial £3(x) = x° — 2z is a solution of the differential equation («7 — 4) y” + 3ry'—15y =0. Find 
another linearly independent solution of this equation. 


The Pell polynomial P(x) = 2z is a solution of the differential equation (x? + 1) y” + 3xy’ — 3y = 0. Find another 
linearly independent solution of this equation. 


The Hermite polynomial H2(x) = 4a? — 2 is a solution of the differential equation y” — 2x y’ + 4y = 0. Find another 
linearly independent solution of this equation. 


Show that if y is a solution of the linear equation 


a(x) y” + [a(x)b(x) — a’(x)] y' + a*(x) c(x) y = 0, 


then u = y’/(a(z) y) is a solution of the Riccati equation u’ + a(x) u? + B(x) u + c(x) = 0. 


Consider the initial value problem 


y’t+yy't+y=0, y(0)=1, y'(0)=1, 


where 7 is a constant. Determine ycriz, the damping constant value that makes the given differential equation critically 
damping (that is, the corresponding characteristic polynomial has a double root). Use a computational software to plot 
the solutions of the given IVP for y = 1,2, and 3 over a common time interval sufficiently large to display the main 
features of each solution. 
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21. Solve the Euler equations. 
(a) 2?y” —zy’+5y =0; (b) a7y"—-2y’+17y¥=0; (cc) 2?y”-2y=0; 
(d) x?y”’ —2ry’+y=0; (e) x?y” — dry’ + 9y = 0; (f)  a?y”+1ley’ + 25y = 0; 
(g) ay” —3ry'+29y=0; (h) a2?y"’+7ry'+10y=0; (i) a?y” +4ry’ +2y=0. 


22. Finda nen solution of the given initial value problem for the Euler equations. 


(a) oy" —20y' +2y=0,y)=Ly(=—1 (b) 4x%y” +1224 +3y =0, y() =1, y'(1) = 5/2 

(c) ay" +3xy'—3y=0, y(2) = : y’(2) = 0; (d) ay” —9ry' + 25y =0, y(1) = 1, y/(1) = 3; 

(e) 1y" bey’ +9y=0, y(1)=2, y'(1)=5; = (Ff) oy” — Bary’ + 10y =0, y(1) = 1, y'(1) = 9; 
1)=5; (hb) o*y”-12y=0, y(1) =1, y/(1) = 11; 


(1 1 
(g) wey" 3ry/ + 13y =0, y(1 Mi cs ; 
(i) ay" +ay'—-9y=0, y(1) =1, y’(1) = 9; Gj) ay" — Try + 16y = 0, y(1) = 3, (1) = 4. 
23. Using a computer solver, plot solutions to the initial value problems in the interval 1 < a < 5. 
(a) vy” —30y/ +3y=0, y(1)=3,y'(1)=1,  (b) 92*y" + 3ry'—8y=0, y(L) =3, y(L) = —1/3; 
(c) ey" +2y—9y=O yD =Ly(=— (ad) ey" +ey'—y=0, y(l) = 1, y'(1) = 2; 
(e)  27y"”+7ry'+8y=0, y(1) =2, 9/11) =-6;  (f)  2?y” —3ry'+5y=0, (1) = 1, y'(1 


Section 4.7 of Chapter 4 (Review) 


I 
w~ 


1. One solution y; of the associated homogeneous equation is given; find the general solution of the nonhomogeneous 


equation. 
(a) wy” +(L—a)y’-y=e*,y=e*; (b) (1-2? )y" — 2ay' + 2y = 2, 1 = 2. 
2. Use the method of undetermined coefficients to find a particular solution of the following differential equations: 
(a) (D? —4)y = 16x e?*; (b) (D?—1)y = 4re?; 
(c) D(D+1)y=2 sing; (d) (D?—4D+5)y = (x — 1); 
(e) (D? +4)y = sin 32; (f) (D2 + 3D)y = 28 cosh 4a; 
(g) (D?—l)y=2e7+6e"; (h) (D?+2D+10)y = 25a? +3; 
(i) (D? —D—2)y = 3e*; (j) (3D? +10D+3)y = 92 + 10 cosz. 
3. Use the method of undetermined coefficients to find the general solution of the following differential equations. 
(a) (D?+2D+2)y = 2a cosz; (b) (D—1)*y =e"; 
(c) (D—1)(D?+4)y=e* sin2xz; (d) D(D? -2D+ 10)y = 6ze?; 
(e) (D? —1)y = 22 cosa; (f) (D?+6D+10)y = 2cosxe%”. 


4. Using the method of undetermined coefficients find the general solution of the following equations. 
(a) y”+6y'+9y=e**;  (b) yy” —y’-2y=1+a+e’*; 
(c) yy’ ty =a? 4+2e°%; (d) y’ +y=2sine +2; 
(e) yy" =a2+2sinhaz; (f) yy” —6y"4+11y’ — by =2e”*. 


5. Solve the following initial value problems using the method of undetermined coefficients. 


(a) y’-—9y=1+4+cosh3z, y(0)=1, y/(0) =0. 

(b) y”’+4y=sin32, y(0)=1, y’(0) =0. 

(c) 6y” —5y”+y=27, y(0)=0, y'(0) =-1. 

(d) 4y” —4y'+y=27e*/?, (0) =1, y'(0) =0. 

(e) 4y’” — 12y’ + 9y = xe2"/?, (0) = 0, y’(0) = —1. 

(f) y” +3 y' —y= 122? +622 —2*, y(0)=4, y/(0) =—8. 
(g) y” — 6y' +13y=4e*", (0) = 2, y’(0) =4. 

(h) y”—4y=e°* — 22, (0) =1, y/(0) =0. 

(i) y” +9y =6 cos3z, y(0) =1, y’(0) =0. 


6. Write out the assumed form of a particular solution but do not carry out the calculations of the undetermined coefficients. 


(a) yO +y@ = 12-62? e7*; (b) vo —y=e" cosa; 
(c) y”’4+2y'’+y=3e* sinz+2re *; (d) — By + 6y” = cos 2xx? e**; 
(e) y — 4y’" + 5y” = a? sin 2a + xe® cos2x; (f) a + 6y” + 10y’ = cosas + 2°; 
(ge) yO +y 4 ay!” 4+ 2y” +y! +y = 27; (h) y + 10y” + 9y = cos a; 
(i) oy!” — by” + 1ly’ — 6y = 144% e™*; (j) yl Ty” + 16y’ — 12y = 9e**. 
7. Use the form yp(t) = A coswt + B sinwt to find a particular solution for the given differential equations. 
(a) #+4y = 12 sin 2¢; (b) (D° — 6D*) y = 37 cost — 12; 
(c) y+4y = cost; (d) #+7y+6y = 100 cos 2¢; 


(e) yt+7y+10y = —43 sin3t—19 cos3t; (f) Yy-2y— 15y = 377 cos 2t. 
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10. 


11. 


12. 


13. 


14. 


15. 


16. 


Review Questions 


The principle of decomposition ensures that the solution of the inhomogeneous equation L[D]y = f +g, where LID] is a 
linear operator (4.7.3), can be built up once the solutions of L[D]y = f and L[D]y = g are known. Can this process be 
inverted? Find conditions on the coefficients of a linear operator L[{D] and functions f and g when the inverse statement 
of the decomposition principle is valid. That is, from the solution to L[D]y = f + g we can find solutions of equations 
L[D\y = f and L[D]y = g. 


In the following exercises solve the initial value problems where the characteristic equation is of degree 3 and higher. 
At least one of its roots is an integer and can be found by inspection. 


(a) (D?+D?—4D—4)y = 8% +8+6e", ee y'(0) = —4, y”(0) = 12; 
(b) (D?—D)y= 2x, y(0)=4, y/(0)=1, y"(0) = 

(c) (D?—D?+D—ly=4sinz, y(0) =3, y/(0) = —2, y”(0) = -3; 

(d) (D?+D? —4D—4)y=3e-* —4r—8, y(0)=1, y/(0) =0, y"(0) =2; 
(e) (D*—1)y= 7a”, y(0) =2, y'(0) =—1, (0) =-14, y'"(0) = -1, 


(f) (*-1)y=4e7*, (0) =0, y’(0) =—-1, y"(0) =4, 9” (0) = -3; 
(g) y® — 8y' =16xe*, y(0) =1, y'(0) = §, y"(0) =2, y'"(0) =4 
(bh) y”t+y=27—-a2+1, y(0)=2, y'(0) =—2, y”(0) =3; 
(i) yf” — Ty" +6y=62", y(0) = 22, y’(0) =1, y"(0) =3. 


Write the given differential equations in operator form L[D]y = f(a), where L[D] is a linear constant coefficient differential 
operator. Then factor L[D] into the product of first order differential operators and solve the equation by sequential 
integration. 


(a) y" —y- 2y=1/(e* +1); (b)  y" —y' — 24y = 13 sin 3a; 

(c) y” —4y'+4y = 25 cosa +2e7"; (d) y”—-y= } 

(ce) y” — 4y' + 5y = 25(« + 1)’; (f) yy + gy aes 

(g) y’—y' —6y=6r+3e?; (h)  2y” + rs — 2y = 42; 

(i) y" +y'- y= Fe; G) "3 Ube 

(k)  y” — 4y' + 13y = 8e?” cosa; QQ) oy’ + “el =1+22e" 
(m) y”’—y' =2sinz; (n) yl Sy! 432i a ae 

Find a linear differential operator that annihilates the given functions. 

(a) «x? + cos 2a; (b)  e” sin3a; (c) e"(1+ 22); (d) a? e?*; 

(e) sin? a; (f) «ve* cos2e; (g) «w+ cos3z; (h) «cos 3a; 

(i) (xe*)’; (j) «? +sin 2a; (k) (cosh 3x)?; (1) sina + cos 22; 
(m) 2? + cos? a; (n) «+e cos2x; (0) sing+e* cosa; (p) «+e sin3a; 
(q) we? sin3x; (rr) cos* 22; (s) sin 2x + cos 3a. 

Find linearly independent functions that are annihilated by the given differential operator. 

(a) (D-1)?+4; (b) D? +4; (c) [p* i A\[(D — 1)? +4); 

(d) [D—5]°; (e) D*[D+2)*; (Gy) Pil 

(g) [D-1) +1); (h) p’-4); (i) p?—1]p? +1); 

(j) D*—-1; (k) D*+1; (1) 2D?+3D-—2; 
(m) D(D+4); (n) D?+4D+4; (0) (D+4)’; 

(p) (dD? +4); (q) 2D?—3D-2; (r) 3D?+2D-1. 


For the equation y” +3y’ = 9+ 272, find a particular solution that has at some point (to be determined) on the abscissa 
an inflection point with a horizontal tangent line. 


/ 


Find a particular solution of the n-th order linear differential equation an (x)y\”) +an—1(x)y~) +.» -+ai(x)y 
f(x) when 


tao(x)y = 


(a) f(x) = ma +b, ao £0, a1, m, and 6 are constants. 


(b) f(x) = mz, ao = 0, and a, is a nonzero constant. 


For the equation y’” + 2y’ = 48x, find the solution whose graph has, at the origin, a point of inflection with the 
horizontal tangent line. 


Consider an alternative method to determine a particular solution for the differential equation (D — A1)(D — A2)y = f, 
where D is the derivative operator and \i # A2. Let yi and y2 be solutions to the first-order differential equations 


a ie 
Ay — Aa’ 


(D—A1)y1 = (D — Az) y2 = 


f 
d2— Ai 
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Then y = yi + y2 is a solution to the given equation (D — A1)(D — A2)y = f. Using this approach, find a particular 
solution to each of the following equations. 

(a) y" +y' — by = 30e*/(1 +e"); (bb) y”—10y’+2ly=12e"; (c) y” —9y=10 cosa; 

(d) y” — 2y' + 5y = 125 2; (e) y” —4y =8/(1 +7"); (f)  y” — 2y’ + 2y = 10 cosha; 

(g)  y”—y'—2y=6e"*/(e* +1); (hb) y” -y=2(L +e"); (i) Qy" + 7y'—4y =e*/(1 +e”). 

17. A uniform circular cylinder of height h, radius r, and mass m floats in a liquid of density po, with its axis being vertical. 
Suppose that cylinder is covered by the liquid to a height ho < h, which means that its density p is less than po. 
Archimedes’ Principle tells us that the downward gravitational force of the cylinder must be counterbalanced by the 
buoyancy force of the liquid: 

mg = po (mr? ho) g. 
Solving for ho and using m = p (xr7h) for the mass of the solid cylinder, we have hopo = hp. If the cylinder is pushed 
downward x units with the periodic force, then the equation of motion becomes 


mz = mg — po (mr? (ho + z)) g+A sin(wt), 


where A is a constant magnitude. Find the general solution for two cases when the natural frequency wo = ./pog/(ph) 
is different from w and when it is equal to it. 


Section 4.8 of Chapter 4 (Review) 


1. Use the variation of parameters method to find a particular solution of the given differential equations. 


(a) y” +9y = tan 32; (b) y” + 4y = 8sec? 22; 

(c) y" +4y = 4 tan?(2t); (d) y”+9y =9 tan? (3t); 

(e) y" + Ay = 4csc2x sg 4a(ax = 1); (f) y” + 25y = 25 sec(5x) + 130 e°; 
(g) y"-2y'’+y= oo: (h) yy” — 6y’ + 10y = e®” sec? 2; 

(i) yo” +25y = Oe; (ji) y” —4y' + 8y = A455. 


2. Given a complementary function, y,, find a particular solution using the Lagrange method. 
(a) xy!" 4+ 42y" — Qay’ + 2y = 82°; yn(z) = Chat+Coa7!+C3 2”. 
(b) a3y” — 327y" + 6ay’ — 6y = 62°; yn(2) = Cra + Cox? +C32°. 
(c) (1—a?)y” — 2ay’ = 2a — 1; Yn =O, +ConS®, -l<a2<l. 


1-2? 


(d) vy” — (1+ 2x?)y’ =82° e* Yn = C1 + C2 e* 
(e) (sin 4a)y” — 4(cos? 2x)y’ = 16 tana; yr(x) = Cy + C2 cos 2z. 


3. Use the Lagrange method to find a particular solution to the given inhomogeneous differential equations with constant 
coefficients. Note that the characteristic equation to the associated homogeneous equation has a double root. 


(a) y” —2y' +y= Se; (b) y”+6y’ +9y = — + 27x? + 182; 
(c) y” —6y' + 9y = 42-2 e®* Ina; (d) y” — 4y' + 4y = 2c?” Ina + 25 sing; 
(e) (D+2)?y =e 7/2; (f) (2D+1)?y =e7*/?/a; 
(g) (D+3)?y =e**/2?; (h) (3D—2)?y =e7?/9/a?. 


4. Use the method of variation of parameters to determine the general solution of the given differential equations of order 
higher than 2. 


(a) yy!” +4y' =8 cot 2t; (b)  y’” — 2y" — y! 4+ 2y = 6e-*: 
(c) y®—2y"”+y=4 cost; (d) y”—y" +y!—y=5et cost; 
(e) yn" mF y' — oat; (f) y" + y’ =tanz. 


Section 4.9 of Chapter 4 (Review) 
1. Express the following functions as Taylor expansions. 
(a) Jo(Vx), (b) Va (V2). 
Show that the function y(x) = Vx Jo(./Z) satisfies the differential equation 4x? y” + (x + 1)y =0. 
Using the substitution t = \/z, convert the differential equation 4a? y” + 4a y’ + (x — 1)y = 0 into a Bessel equation. 
Show that 8J/" (2) = Jn—3(x) — 3Jn—1(@) + 3Jn41(2) — In43(2). 


Prove the recurrence relations 


oxy oy 


Ji(x) = = I(x) — Jv4i(2), 25,() = Jv—1(#) — Jv4i (2). 


x 
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10. 


11. 


12. 
13. 


14. 


15. 


. Establish the following identities. 


NleR 


(a) [ tmmat=21(@) (b) [oe lat = -2 a) + 


2 
a 


ke “y(t)=0, a>0. 


. Using the change of variables y = ux '/?, transform the Bessel equation (4.9.1) into 


w+ [1-5 (8-3) ]u=o 


2, 


. Consider the modified Bessel equation x?y” + ay’ — (a? +1/4)y = 0 for x > 0. 


Review Questions 


k 
. Verify that the function y(x) = Jo (2 4/ — a) is a particular solution to the differential equation my(t) + 
m 


(a) Define a new dependent variable u(«) by substitution y(«) = a«~'/?u(x). Show that u(«) satisfies the differential 


equation u” —u=0. 


cosh & sinha 


a and a 


(b) Show that the given equation has a fundamental set of solutions 


Express the general solution to the Riccati equation y’ = 4x” — 9y? through Bessel functions. Hint: Consult Exam- 


ple 2.6.10 on page 103. 


By appropriate change of independent variable, find the general solution of 


y (\7e7* v) y= 0. 


Show that the Wronskian of J,(x) and N, (a) is 2/(7a). 
(a) Prove that the Wronskian of J, and J_y satisfies the differential equation 


= [cW (Jv, J-v] = 0, and hence deduce that W(J,, J-v) = C/ax, where C is a constant. 
£ 


(b) Use the series expansions for J), J—,, and their derivatives to conclude that whenever v is not an integer C = 


=2 


Tar’ where C is the constant in part (a). 


Show that [esce) da = 2? h(x) + eJo(x) — ‘| Jo(a) da. 


By multiplying the power series for e*’/? and e~*/@", show that ert 1/t)/2 > Jn(x)t”. The function Palais eh 


n=—oo 


is known as the generating function for the J;,(z). 


Chapter 5 


Oliver Heaviside (1850-1925) Pierre-Simon Laplace (1749-1827) 


Laplace Transforms 


Between 1880 and 1887, Oliver Heaviside, a self-taught English electrical engineer, introduced his version of Laplace 
transforms to the world. This gave birth to the modern technique in mathematics and its applications, called 
the operational method. Although the famous French mathematician and astronomer Pierre Simon marquis de 
Laplace introduced the corresponding integral in 1782, the systematic use of this procedure in physics, engineering, 
and technical problems was stimulated by Heaviside’s work. Unfortunately, Oliver’s genius was acknowledged much 
later and he gained most of his recognition posthumously. 

In mathematics, a transform is usually referred to as a procedure that changes one kind of operation into another. 
The purpose of such a change is that in the transformed state the object may be easier to work with, or we may get 
a problem where the solution is known. A familiar example is the logarithmic function, which allows us to replace 
multiplication by addition and exponentiation by multiplication. Another well-known example is the one-to-one 
correspondence between matrices and linear operators in a finite dimensional vector space. 

Differential expressions and differential equations occur rather often in applications. Heaviside’s ideas are based 
on transforming differential equations into algebraic equations. This allows us to define a function of the differential 
operator, D = d/dt, which acts on functions of a positive independent variable. That is, it transforms the operation 
of differentiation into the algebraic operation of multiplication. A transformation that assigns to an operation an 
algebraic operation of multiplication is called a spectral representation for the given operation. In other words, 
the Laplace transform is an example of a spectral representation for the differential operator D acting in a space of 
smooth functions on a positive half-line. 

Since the problems under consideration contain an independent variable that varies from zero to infinity, we 
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denote it by t in order to emphasize the connection with time. Heaviside’s idea was to eliminate the time-variable 
along with the corresponding differentiation from the problems to reduce their dimensions. It is no surprise that the 
operational method originated from electrical problems modeled by differential equations with discontinuous and/or 
periodic functions. For example, the impressed voltage on a circuit could be piecewise continuous and periodic. The 
Laplace transform is a powerful technique for solving initial value problems for constant coefficient ordinary and 
partial differential equations that have discontinuous forcing functions. 

While the general solution gives us a set of all solutions to an ordinary differential equation, practical problems 
often require determination out of the infinity of solution curves only a specific solution that satisfies some auxiliary 
conditions like the initial conditions or boundary conditions. To solve the initial value problem, we have had to find 
the general solution first and then determine coefficients to fit the initial data. In contrast to techniques described in 
the previous chapter, the Laplace transform solves the initial value problems directly, without determining the general 
solution. Application of the Laplace transformation to constant coefficient linear differential equations includes the 
following steps: 


1. Application of the Laplace transformation to the given problem. 
2. Solving the transformed problem (algebraic equation). 


3. Calculating the inverse Laplace transform to restore the solution. 


5.1 The Laplace Transform 


A pair of two integral equations 


b d 
f(a) = / K(e,9) oy) dy, 9) = if G(x, y) f(a) da 


may be considered as an integral transformation and its inverse, respectively. In these equations, K(a#,y) and 
G(a,y) are known functions, and we call them kernels of the integral transformation and its inverse, respectively. 
The Laplace transformation is a particular case of the general integral operator, which has proved its usefulness in 
numerous applications. The Laplace transform provides an example of a linear operator (see §4.1.1). 


Definition 5.1: Let f be an arbitrary (complex-valued or real-valued) function, defined on the semi-infinite 
interval [0, co); then the integral 


prays (cara fo es (pat (6.1.1) 


0 
is said to be the Laplace transform of f if the integral (5.1.1) converges for some value A = Xo of a parameter A. 
Therefore, the Laplace transform of a function (if it exists) depends on a parameter A, which could be either a 
real number or a complex number. 


Saying that a function f(t) has a Laplace transform f"(A) means that the limit 


N 
L()) = lim e rot 
PPO) | f(t) eat 


N-0o 


exists for some A = Ag. The integral on the right-hand side of Eq. (5.1.1) is an integral over an unbounded interval. 
Such integrals are called improper integrals, and they are defined as a limit of integrals over finite intervals. If such 
a limit does not exist, the improper integral is said to diverge. 

From the definition of the integral, it follows that if the Laplace transform exists for a particular function, then 
it does not depend on the values of a function at a discrete number (finite or infinite) of points. Namely, we can 
change the values of a function at a finite number of points and its Laplace transform will still be the same. 

The parameter \ in the definition of the Laplace transform is not necessarily a real number, and could be a 
complex number. Thus, \ = a+ j3, where a is the real part of A, denoted by a = RA, and @ is an imaginary part 
of a complex number A, 6 = SX. The set of all complex numbers is denoted by C, while the set of all real numbers 
is denoted by R (see §4.5). 
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Theorem 5.1: If a function f is absolutely integrable over any finite interval from [0,00) and the integral 
(5.1.1) converges for some complex number X = yp, then it converges in the half-plane RA > Ry, ie., in 
{AEC:RA> Ry}. 


Since the integral (5.1.1) converges for \ = yp, the integral 


A(t, ) = | firje * dr 


has a finite limit h(oo, A) when t +> oo. Let A be any complex number with A > Ry. We have 


fay = f * ee F(t) at 


) 
= | gt gH’ Fa dk =} a VOM an, ji) 
0 0 


where dh(t, 4) = e-“" f (t)dt. We apply the integration by parts formula 


b b 
/ u(t) du(t) = u(b) v(b) — u(a) (a) — / u(t) du(t) 
with 
u(t) =e" O-M#, du(t) = —(A— p)e"P-4dt,  du(t) =e" f (t)dt, v(t) = A(t, w) 


to obtain 


FO) =Q=n) [OM AE, uy at 
0 
because h(0, 4) = 0 and 
lim e~?-)# A(t, pw) = h(co, w) lim e~O-#)* = 0, 
trroo trroo 


The function h(t, 4) is bounded, meaning that there exists a positive number M such that |h(t,w)| < M (so M is 
the maximum of the absolute values of the function h(t, 1)). Therefore, 


FI < [aul | eRO-W) I(t, p)| dt 


RA A= pI 
ap om uf e BRA Ne eee ae LLY 
ale ROA 1) 


Consequently the integral (5.1.1) converges when RA > Ry. ral 

This theorem may be interpreted geometrically in the following manner. If the integral (5.1.1) converges for some 
complex number A = 4, then it converges everywhere in the region on the right-hand side of a straight line drawn 
through A = yw and parallel to the imaginary axis (see Fig. 5.1). Thus, only the real part of is decisive for the 
convergence of the one-sided Laplace integral (5.1.1). The imaginary part of \ does not matter at all. 

This implies that there exists some real value o-<, called the abscissa of convergence of the function f, such 
that the integral (5.1.1) is convergent in the half-plane RA > o, and divergent in the half-plane RA < o,. We cannot 
predict whether or not there are points of convergence on the line RA = a; itself. 

We emphasize again that the imaginary part of the parameter 4 in the definition of the Laplace transform does 
not affect the convergence of the integral (5.1.1) for real-valued functions. Therefore, for the question of existence of 
Laplace transforms, we may assume in the future that this parameter \ is a real number that is greater than o¢, 
the abscissa of convergence (see Fig. 5.1 on page 272). Thus, for example, if we want to determine the convergence 
of the integral (5.1.1) for a particular function f, we may suppose that the parameter \ is a real number. 


Theorem 5.2: The Laplace transform is a linear operator, that is, 


(LCF) (A) =C(LF)(A) and (L(f +9)) (A) = (LF) A) + (£9) OA), 


where C is a constant and f and g are arbitrary functions for which their Laplace transforms exist. 
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This result follows immediately from the definition of an improper integral. Thus, from the definition, 


we write 


ve —Xt : —At 
eon) =f eMEf(i)dt= im Ofe*at 


N 

=C lim | f(t)e"** dt = CF*()). 
N-0o 0 

Let us suppose that f and g are two functions whose Laplace transforms exist for \ = wr and A = pg, respectively. 


Let Ao be a complex number with its real part greater than the maximum of Ruy and Rug, that is, RA9 > Ruy and 
RAg > Rpg. Then for 4 = Ao we have 


N 
(C(f+9))(Qo) = lim i; (f +g) e7?* dt 


N-0o 


N N 
= im | f(t)e-* dt + Jim | g(t) e~** dt 
=f seoecrats [eat = F400) +900) 
0 0 
Hence, (L(f + g)) (A) = (ZF) (A) + (£9) (A) for all A such that RAS RAo. a 


The next few examples demonstrate the application of the Laplace transformation to power functions and expo- 
nential functions. 
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Figure 5.1: Region of convergence with the abscissa 


Figure 5.2: Example 5.1.4. The graph of the gamma 
of convergence od, = LL. 


function. 


Example 5.1.1: Let f(t) =e”, where a is a real constant. Find its Laplace transform. 
Solution. We evaluate the integral over a semi-finite interval: 


N N 1 t=N 
| oe | eo Ona)t gy — |- goa 
0 0 Xr —-a 


—~ 1 aan, _! 


A-a \-a 


We assume that 4 — a > 0 for real \ and R(A — a) > 0 for complex \. Then the first term with the exponential 
multiple approaches zero as N — oo, leaving 1/(\ — a). Hence, we have 


ee) N 
1 
Le] = ete dt = lim ete™™® dt = —_. (5.1.2) 
0 


N-0o 0 A-a 


Example 5.1.2: Find the Laplace transform of the function f(t) = t, t > 0. 
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Solution. Since the Laplace transformation (Lf) (A) = Jy° e~*‘tdt is defined for RA > 0, its abscissa of 
convergence is og = 0. There are two options to determine the value of the integral. One of them calls for 
integration by parts, which yields 


t 
et 


(£f)() =- + 


ae :f* is 1 
+= e*dt=—. 
ie oe de »? 


We remind that the expression 


means the difference of limits; that is, 


- t t 
= lim (5 e*) — lim (5 e*) ; 
t<9 tt t9° r t30 \X 


The upper limit in the right-hand side is zero because 


lim et = lim e749) = Jim eT PV* Jim e ICSE, 
t—+00 t—+00 t—+00 t—+00 
The first limit, limy44.. e7**)*, approaches zero since RA > 0 and t > 0. The second multiplier, limy_,4.. eI, 
has no limits, but is bounded because |e~*S*‘| = | cos(SAt) — jsin(GAt)| = 1. 
There is another option to find the value of the integral. The integrand is equal to 
d 
—At = 
é t= —=—e 
dA 


Since \ and ¢ are independent variables, we can interchange the order of integration and differentiation to obtain 


Co [oe) d [oe) 
| e“tdt= | (-5e™) dt = 5 / e ** dt. 
; » \ dA a Jy 
It is not a problem to evaluate f>~ e~*! dt = +. So we have 


pe d/1\ 1 
a eae aad Cees 
| ea (5) Y 


Example 5.1.3: Using the result of Example 5.1.2, calculate the Laplace transform of f(t) = t", t > 0, where n is 
an integer. 

Solution. The abscissa of convergence o, for this function is equal to zero. Therefore, according to Theorem 1.2, 
a parameter A may be chosen as any positive number. To start, we consider n = 2. Again integrating by parts, we 


have 
(Lf) (A) =} er de oe) ab if e Ot dé 
0 A t=0 A 0 
since : 
ew“ dt= =. d (eo) . 
Therefore, 


(Lt?) (A) = 


| e Ot dt. 
0 


Using the result of Example 5.1.2, we obtain (Lt?) (A) = +. In the general case, we have 


le 


n! 


(£t") (A) = yar’ 


This formula can be obtained more easily by differentiation of the exponential function: 


pm err = -< ee ma _ (-=) et, 
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Thus, we have 


oe) d n (oe) 
tr = —Atyn t={-— —AXt t 
(Lt) (A) | ee" a ( =) : ed 


Example 5.1.4: Let p be any positive number (not necessarily an integer). Then the Laplace transform of the 
function f(t) = t?, t > 0, is 


(LP)(A) = | e** iP dt = | en A? dé 
0 0 


ae ae T(p +1) 
= p-l Tp eae 
= x | eT? dr= yer? 


where - 
T(v) = | et’ ldr (5.1.3) 
0 
is Euler’s gamma function. This improper integral converges for vy > 0, and using integration by parts, we obtain 
Twvt+1)=vT(v). (5.1.4) 


Indeed, for v > 0, we have 


Tv+l1) = | eT’ dr= -{ 7 dé * 
0 0 


Co 
= —71" ea. +vy | er’ dr =vI(v). 
0 


The most remarkable property of the [-function is obtained when we set v = n, an integer. A comparison with the 
result of the previous example yields [(n +1) = n!, _ n = 0,1,2,.... The computation of £[t"/?], where n is an 
integer, is based on the known special value [ (3) = ,/7 and is left as Problem 7 on page 280. 


Definition 5.2: A function f is said to be piecewise continuous (or intermittent) on a finite interval [a, b] 
if the interval can be divided into finitely many subintervals so that f(t) is continuous on each subinterval and 
approaches a finite limit at the end points of each subinterval from the interior. That is, there is a finite number 
of points {a;}, 7 =1,2,...,.N, where a function f has a jump discontinuity when both 


lim f(a; +h) = f(a; + 0) and lim f(a; —h) = f(a; — 0) 

h>o h>o0 
exist but are different. A function is called piecewise continuous on an infinite interval if it is intermittent on 
every finite subinterval. 


Recall that we have f(t) = f(t+0) = f(¢— 0) for a continuous function f. If at some point ¢ = to this is not 
the case, then the function is discontinuous at t = to. In other words, the finite discontinuity occurs if the left-hand 
side and the right-hand side limits are finite and not equal. 


Definition 5.3: The Heaviside function H(t) is the unit step function, equal to zero for t negative and unity 
for ¢t positive, with H(0) = 1/2, ie., 
1, t>0; 


Wy, t=0; (5.1.5) 
0, t<0. 
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Although Mathematica® has a built-in function HeavisideTheta (which is 1 for t > 0 and 0 for t < 0), it is 
convenient to define the Heaviside function directly: 
HVS[z_] := Piecewise[{{0, x < Oo}, {1, x > O}, {1/2, x==0}}] or 
HVS[x_] := Piecewise[{{ 1, x > 0}, {1/2, x == 0}, {0, True}}] 
Maple” uses the Heaviside(t) symbol to represent the Heaviside function. Both Maple and Mathematica leave 
the value at t = 0 undefined; however, Mathematica has a similar function UnitStep (which is 1 for ¢ > 0 and 0 
otherwise). It is also possible to modify the built-in symbol UnitStep in order to define it as 1/2 at t = 0: 
Unprotect [UnitStep] ; UnitStep[0] = 1/2; Protect [UnitStep] ; 
In Maple, we can either define the Heaviside function directly: 
H:=x->piecewise(x<0,0,x>0,1,x=0,1/2) 
or enforce the built-in function: 
Heaviside(0):=1/2 or type 


NumericEventHandler (invalid_operation=‘Heaviside/EventHandler‘ (value_at_zero=0.5)): 


Remark. Actually, the Laplace transformation is not sensitive to the values of the function at any finite number 
of points. Recall that a definite integral of a (positive) function is the area under its curve, which is further defined 
as the limit of small rectangles inscribed between the curve and the horizontal axis. Since the width of a point is 
zero, its product by the value of the function at that point is also zero, and one point cannot contribute to the area. 
Thus, a definite integral does not depend on the values of an integrated function (called the integrand) at a discrete 
number of points. So you can change the value of the function at any point to be any number, i.e., 4 or 1,000,000 
and its integral value will remain the same. 


Example 5.1.5: Since the Heaviside func- 
tion is 1 for t > 0, its Laplace transform 
is 


= 1 oo 1 
- oe d=—-e™ =-, RA > 0. 
0 r t=0 r —_——————— 
The abscissa of convergence of this integral 
is equal to zero. 


In the calculation, we never used the partic- 
ular value of the Heaviside function at the 
point t = 0. Therefore, the Laplace trans- 
form of H(t) does not depend on the value 
of the function at this point. 


O a 


Figure 5.3: Shifted Heaviside function 
H(t—a). 


The Laplace transform of the shifted (sometimes referred to as retarded) Heaviside function H(t — a) is 


eT or 


A 


(Hea y= a e-™ H(t —a)dt = [ eM dt = 


In many cases, it is very difficult to find an exact expression for the Laplace transformation of a given function. 
However, if the Maclaurin series, f(t) = Pa, fnt”, for the function f(t) is known, then using the formula L[t”| = 
n!\~"—1 found in Example 5.1.3, we sometimes are able to determine its Laplace transform as infinite series: 


n! (n) 
cla =£|o ae] = he =e (5.1.6) 


n>0 n>0 n>0 
provided the series converges. 
Example 5.1.6: The function tant is not piecewise continuous on any interval containing a point t = 7/2 + 


kn, k = 0,+1,+2,..., since it has infinite limits at these points. That is, for example, tan(7/2 —0) = +oo and 
tan(7/2 +0) = —oo. 
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On the other hand, the function (see its graph in Fig. 5.4) 


F(t) = S2(-1)* H(t — ka) 


k=0 
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is piecewise continuous. This function is a bounded function; therefore, its Laplace transform exists. Thus, 


= 1 oe | 
L _ k = k j—aky _ 
FPA) = So(-1* LIN (t — ka)] = X S(-1)*e = Tae 
k=0 k=0 
since we have a So(-p' z* for z= e"™ 
k=0 
yA 
1 — ————— Sana 1 
> 
oO 1 2 3 4 ‘ 
Figure 5.4: Example 5.1.6, the piecewise continuous Figure 5.5: Example 5.1.7, the graph of the piecewise 
function. continuous function. 


Example 5.1.7: Find the Laplace transform of the piecewise continuous function 


t?, 0<t<l, 
f@) =< 244, 1<t<2, 
1-t, 2<t<o. 


Solution. This function is not continuous since it has two points of discontinuity, namely, t = 1 and t = 2. At these 
points we have f(1—0) =1, f(1+0) =3 and f(2—0) =4, f(2+0) = —1. To plot the function, we use the following 


Maple commands: 


f := piecewise(0<t < 1, t°2, 1<t < 2, 2+t, 2<t, 1-t); 
plot(f, t = 0..4, discont=true) ; 


To obtain the Laplace transform for a piecewise continuous function, just apply the integral transformation on 
intervals over which the function is continuous and sum the results. Therefore, the Laplace transform of the function 


f(t) is 


ena feeras f ernerat [ 0-oemae 


t 


Let us attack the integrals using integration by parts. Since the antiderivative of e~** is —A~! e~*", we have 


— 2t _ 2 
fF(A) = Se rage ae | 


t=0 
2 —dAt t —At 1 —At = 1 
+ |-[e*- es + |-—+ 
| e€ c € 5) e€ i X 
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Figure 5.6: Mean value of a discontinuous function. 


The last example shows that the Laplace transform of a discontinuous function is a holomorphic function (that 
possesses infinitely many derivatives) in half plane RA > o¢. 

It soon becomes tiresome to test each function we encounter to determine whether the corresponding integral 
(5.1.1) exists in some range of values for 4. We therefore seek a fairly large class of functions for which the Laplace 
transform can easily be shown to exist. For this purpose, we introduce the following definition. 


Definition 5.4: A function f(t), t € [0, 00) is said to be a function-original if it has a finite number of points 
of discontinuity (finite jumps) on every finite subinterval of [0,co) and it grows not faster than an exponential, 
that is, 

lf(t)| << Me™ (t>T) (5.1.7) 


for some constants c, M, and T, which may be very large. Moreover, we assume that at points of discontinuity 
the values of a function-original are equal to the corresponding mean values; thus, 


f() = 5 [f+ 0) + f(t — 0)] = lim oie) ee) (5.1.8) 


The Laplace transform of such a function is called the image. 


A function-original can diverge to infinity as ¢ tends to infinity. The restriction (5.1.7) tells us that a function- 
original can blow up at infinity as an exponent, but not any faster. 

If a function is continuous at t = to, then f(to) = f(to +0) = f(to — 0), and the relation (5.1.8) is valid at that 
point. We impose condition (5.1.8) to guarantee the uniqueness of the inverse Laplace transform. In §5.4 we will see 
that the inverse Laplace transform uniquely restores a function from its image (Laplace transform) only if condition 
(5.1.8) holds at any point. So, there is a one-to-one correspondence between function-originals and their images. 

Note that the Laplace transform may exist for a function that is not a function-original. A classic example is 
provided by the function f(t) = t~!/?, which has an infinite jump at t = 0. Nevertheless, its Laplace transform 
exists and equals ,/7/A. Therefore, function-originals constitute a subclass of functions for which the Laplace 
transformation exists. 


Definition 5.5: We say that a function f is of exponential order if for some constants c, M, and T the inequality 
(5.1.7) holds. We abbreviate this as f = O(e“) or f € O(e™). A function f is said to be of exponential order 


a, or eo(a@) for abbreviation, if f = O (e%) for any real number c > a, but not when c < a. 
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Any polynomial is of exponential order because the exponential function e (€ > 0) grows faster than any 
polynomial. If f € eo(a), it may or may not be true that f = O (e®*), as the following example shows. 


Example 5.1.8: Both functions te?’ and t~'e”! are of exponential order 2; the latter is O (e7/), but the former is 
not. According to the definition, a function f(t) is of exponential order f(t) = O(e) if and only if there exists a 
real number c such that the fraction f(t)/e® is bounded for all t > T. Since the fraction te” /e”’ = t is not bounded, 
the function te?’ does not belong to the class O (ee However, this function is of exponential order 2 + € for any 
positive ¢, namely, te?’ = O(e(?+*)*), because te?*/eCt*)! = te-** is a bounded function for large values of t, yet 
exponential functions decrease faster than any polynomial. 


Example 5.1.9: The function f(t) = er, t € (0,00), is not a function-original because it grows faster than e“ for 
any c; however, the function 
f(t) = eV", t€ [0,00) 


does not grow faster than e“ for any c. Therefore, it is a function-original. When t approaches infinity, the expression 
t4 + 1 — ¢? approaches zero. This follows from the Taylor representation of the function 
s"(0) 9 


i “is 


s(e)=V1+e=s8(0)+s'(O)e+ 


for small enough e. If we let ¢ = t~+, then we obtain 
Jeqiat i+ aevigeat (i die a eee 
t4 2t4 2t? , 
Hence, 


1 1 
VEL RP =a = ass 


when ¢ approaches infinity. The function e” is not of exponential order since, for any constant 4, 
Pp Y. 


2 
lim e! e~*4 = oo. 
too 


Definition 5.6: The integral (5.1.1) is said to be absolutely convergent, if the integral 


[ e A)E | FG) | dé (5.1.9) 
0 


converges. The greatest lower bound o, of such numbers tA for which the integral (5.1.9) converges is called 
the abscissa of absolute convergence. 


The following assessments follow from Definitions 5.1 and 5.5. 


Theorem 5.3: If |f(t)| < C for t > T, then the Laplace transform (5.1.1) converges absolutely for any \ with 
tA > 0. In particular, the Laplace transform exists for any positive (real) A. 


Theorem 5.4: The integral (5.1.1) converges for any function-original. Moreover, if a function f is of expo- 
nential order a, then the integral (5.1.1) absolutely converges for #A > a. Furthermore, if f and g are piecewise 
continuous functions whose Laplace transforms exist and satisfy (Lf) = (Lg), then f = g at their points of 
continuity. Thus, if F(\) has a continuous inverse f, then f is unique. 


Example 5.1.10: We have already shown in Example 5.1.1, page 272, that (Le°*) (A) = (A- a)~*. With this in 
hand, we can find the Laplace transform of the trigonometric functions sin at and cos at using Euler’s equations 


cos 6 = Rel? and sin 0 = Sel?, (5.1.10) 
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where #z = Rez = x denotes the real part of a complex number z = x + jy and Sz = Imz = y is its imaginary 
part. Then assuming that \ is a real number, we find the Laplace transform of sinat as the imaginary part of the 
following integral: 


il Qa 


* = —xXt jat = —At pjat _ = 
(Csinat) = | e Se a=sf ee ta ae 


Similarly, extracting the real part, we obtain 
1 A+ ja r 


_ a —At viat = es ce eS ee 
(Coosat) = f HRT g Ja) ta) a 


Example 5.1.11: Since the hyperbolic functions are linear combinations of exponential functions, 
sinh at = 


[fe —e~"|,  coshat = ; [ew + e7™] , 


their Laplace transformations follow from Eq. (5.1.2), page 272. For instance, 


1 Af? 
(L cosh at) = | ee dt + = 7 ee di 
2 Jo 2 Jo 


1 1 1 

“9 Feraberd N= a?" 

We summarize our results on the Laplace transforms as well as some relevant integrals in Table 280. In the 

previous examples, we found the Laplace transforms of some well known functions. It should be noted that all 

functions are considered only for positive values of the argument. For negative values of t, these functions are 

assumed to be identically zero; to ensure this property, we multiply all functions by the Heaviside step function, 

Eq. (5.1.5). For instance, in Example 5.1.3, the Laplace transform of the function f(t) = t? was found. However, 

we actually found the Laplace transform of the function g(t) = H(t)t?, but not f(t). Hence, we can consider the 
Laplace transform for g(t) as the integral over the infinite interval, namely, 


g (A) = i g(t) e-™* dt. (5.1.11) 


Such an integral, called a two-sided or bilateral Laplace transform, does not exist for the function f(t) = ¢? 
for any real value of X. 


1. Find the Laplace transform of the following functions. 


(a) 2-—5t; (b) tcos 2t; (c) e” sint; (d)  e* cosh 2t; 
e e** sinh 4¢; f e* cos 2t; t* — 2t; h 2t sinh 2t; 
& 
i cost + sin 2¢; j sint + e?!; k t? cosh 2t; 1 t sin t; 
J 
(m) te”; (n) te’; (0) ?# sin3t; (p) (t? —1)sin2t; 
(q)_ t? cos2t; (r) te™, 


2. Determine whether the given integral converges or diverges. 
(a) fo°(W+4) "di (b) fee dt; (c) fo? t cos 2tde; 


(d) for sat dt; (e) foo at; (f) foe peat. 
3. Determine whether the given function is a function-original: 
(a) sin2t; (b) e* cos2t; (c) 1/(1+¢#?); 
(d) sint?; (e) +t ', t>0; (f) +3, t>0; 
(g) aint. (h) exp {VP +4— ae (i) (1-—cos t)/t?; 
t, O0<t<l; 1, O0<t<1; 
Gj) f)=4 (-t)?+1, 1<t<3; (k) f(t)= 4 2-t, 1<t<2; 
{6 22% { 2 2<t. 


4. Prove that if f(t) is a function-original, then vim fr (A) =0. 
— 00 
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3 5 
(c) 


Figure 5.8: Problems 4(c) and 4(d). 


Figure 5.9: Problems 4(e) and 4(f). 


5. The functions are depicted by the graphs in Fig. 5.7, 5.8, and 5.9, in which the rightmost segment extends to infinity. 
Construct an analytic formula for the function and obtain its Laplace transform. 


6. Prove that if f(t) and g(t) are each of exponential order as t + oo, then f(t)- g(t) and f(t) + g(t) are also of exponential 
order as t — oo. 


7. Following Example 5.1.4 and using the known value ['(1/2) = \/7, find the Laplace transform of the following functions. 


oe OF OF) wa 


8. Which of the following functions are piecewise continuous on [0, co) and which are not? 


(a) In(1+¢#?); (b)  |t], the greatest integer less than ¢; 


-1, _ Jj 0,  # is an integer, 

ey (4) £) -{ 1, otherwise; 

O24 OMe 7 aa 
: sint 

(g) sin +; (h) —. 


5.1. The Laplace Transform 


Table 280: A table of elementary Laplace transforms. Note: each function in the left column is zero for negative 
t; that is, they must be multiplied by the Heaviside function H(t). 


Laplace transform. 


Ht Function-original Laplace transform Oc 
a 
1. H(t) 5 RA > 0 
1 
2. H(t —a) X es RA > 0 
1 
3. t oi RA > 0 
! 
4. i n=1,2, yaa Rr > 0 
Tp +1) 
5. tP PH RA > 0 
6. ett : RA > Ra 
A-a 
n pat = n! 
ie re, WH 1525... Onayr RA > Ra 
8. sin at y < a2 RA > O 
9. cos at y A a2 RA > 0 
at os B 
10. e* sin Bt Dnata Be RA > Ra 
7 rA-a 
11. e* cos Bt nase RA > Ra 
12. sinh St x E zB RA > KB 
a 
13. cosh Gt aD RA > KG 
: 28 
14. tsin Bt 02+ pe RA > 0 
2 onl B? 
16. ev — eft ane RA > Ra, KB 
(A — a)(A — 8) 
a a - A 
17. ev [cos Bt + § sin Bt Ona te RA > Ra 
sin cos 1 
18. mee — boos OF BE RA > 0 
r 
tsin Bt 
19. sin Bt Or RA > 0 
B 
at Ble 
20. e™ sinh Bt Onaf RA > Ra + B) 
21. e™ cosh Bt sam RA > Ra + B) 


Here o, is the abscissa of convergence for the 
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5.2 Properties of the Laplace Transform 


The success of transformation techniques in solving initial value problems and other applications hinges on their 
operational properties. Rules that govern how operations in the time domain translate to operations in their 
transformation images are called operational laws or rules. In this section we present the 10 basic rules that are 
useful in applications of the Laplace transformation to differential equations. The justifications of these laws involve 
technical detail and require scrutiny that is beyond the scope of our text—we simply point to the books [8, 12, 47]. 
Let us start with the following definition. 


Definition 5.7: The convolution f * g of two functions f and g, defined on the positive half-line [0, co), is the 
integral 


(f*g9)( n= f He—ral r)dr = (9 f) (2). 


Example 5.2.1: The convolution of two unit constants (which are actually two Heaviside functions) is 


t 
(A x H)( n= f He-n) H(r)ar= | dr =t, t>0. 
0 


Now we list the properties of the Laplace transform. All considered functions are assumed to be function-originals. 


1° The convolution rule 


The Laplace transform of the convolution of two functions is equal to the product of its images: 
Lf * g)(A) = f"()9" (A). (5.2.1) 
A short manipulation gives 
L(f*g)A) = i erat [ f(t-7)g 
= fae rar f &™ Fe nae—1) = 9 F-0), 


2° The derivative rule 


L Eine () O) =A" 270) = 3 yr-k FRY) (40). (5.2.2) 
Integration by parts gives us the equality (5.2.2). In ati 
LIf’IO) =A FPO) — FO); (5.2.3) 
Lf" (A) = f°) — AF) — f'(0). (5.2.4) 
3° The similarity rule 
Lif(at)|() = = ft (=)  icrke, eso (5.2.5) 


4° The shift rule 


If we know g“(A), the Laplace transform of g(t), then the retarded function f(t) = g(t — a)H(t — a) has the 
Laplace transform g”(A)e~™, namely, 


LlH(t—a)g(t—a)| (A) =e M9"), a>. (5.2.6) 


5° The attenuation rule 


Lle f(t)] Q) = f*(A+a). (5.2.7) 
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6° The integration rule 


Li * FIO) = L : (t = qn} f(r) dr = my mer n= 1, 2; erie es (5.2.8) 

If n = 1, then : 
SfOy=£ f ferar. (5.2.9) 

0 


These equations (5.2.8) and (5.2.9) are consequences of the convolution rule and the equality 


corny = fo weMar= BEY, (5.2.10) 


where [(v) is Euler’s gamma function (5.1.3), page 274. If v is an integer (that is, vy = n), then [(n+ 1) = 
n! n=0,1,2,.... This relation has been proved with integration by parts in §5.1, page 274. Since the gamma 
function (see Fig. 5.2 on page 272) has the same value at v = 1 and at v = 2, that is, (1) =T'(2) = 1, we get 


Ol=1!=1. 
7° Rule for multiplicity by ¢” or the derivative of a Laplace transform 
d” a 
ae fPOH=L(-1)" fH] (wn =0,1,...). (6.210) 


8° Rule for division by t 
| =| f'(c) do. (52.19) 
r 


9° The Laplace transform of periodic functions 
If f(t) = f(t+w), then r 
f(A) = —= | e © f(t) dt. (5.2.13) 
10° The Laplace transform of anti-periodic functions 
If f(t) = —f(t+w), then 


FLA) : | * eM F(t) de, (5.2.14) 


~ 1+ e7 wr 0 
Theorem 5.5: If f(t) is a continuous function such that f’(¢) is a function-original, then 


AO) = ma VO), AO\= i * p()e™ at. 


AH00 
Remark 1. We can unite (5.2.5) and (5.2.7) into one formula: 


£ F en bla F (=) i= Fay: (5.2.15) 


a 


Remark 2. We do not formulate exact requirements on functions to guarantee the validity of each rule. A curious 
reader should consult {12, 47]. 


Example 5.2.2: (Convolution rule) The convolution of the function fi) = tH(t) with itself is 


faf=tH(yseH = fr (e—nar=t | rdr— | Tv dr 


fo) ee Se eeess 
So i & og ee 
According to the convolution rule (5.2.1), we get its Laplace transform: 
a 1 J 1 
L[t*t]=L 5 = Lit] - LE] Sag ae 
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Example 5.2.3: (Convolution rule) The convolution product of the shifted Heaviside function H(t— a) and t? 


1S 
— a) 


H(t—a)ee = | H(t—r—a)rdr=H(t~a) [Par = H(ea) (t 


The same result is obtained with the convolution theorem: 
1 2 2 
L[H(t—a)*t?] =L[H(t—a)]- L[t?] = = es as ee 
The Laplace transformation is an excellent tool to determine convolution integrals. While we did not study the 
inverse Laplace transform, which is the topic of §5.4, we can find their inverses based on Table 280. Application of 
the shift rule (5.2.6) yields 


(t- a)? 
_ 


Ye x oe = H(t — a) ~—_— = H(t -a) 


Example 5.2.4: (Derivative rule) Find the Laplace transform of cos at. 
Solution. We know that —a?cosat = d?(cosat)/dt?. Let f& = L{cosat] (XA) denote the required Laplace 
transform of cosat. Using the initial values of sine and cosine functions, the derivative rule (5.2.2) yields 


2 
L [-a* cosat] (A) =-a?f’ =L B eae a = 27h). 


dt? 
Solving for f", we get f& = L [cosat] (A) = A/(a? + A?). 


Example 5.2.5: (Similarity rule) Find the Laplace transform of cos at. 
Solution. We know that L[cost](\) = \(A? + 1)~!. Then, using the similarity rule (5.2.5), we obtain 


r 
1. | r 
L[cos at}(X) = a a = a 
145 
a2 


Now we find the same transform using the periodic property cos(t) = — cos(t+7) for 0 < t < a. Indeed, Eq. (5.2.14) 
gives 


1 cid 
L{cost](A) = =a! e-" cos(t) dt 
+ 0 
1 -s aN 
= —___ ——_ (]l FN : 
Teena ead 


Example 5.2.6: Figure 5.10 shows the graphs of the functions f;(t) = H(t—a) sint and f(t) = H(t—a)sin(t—a). 
These two functions f; and fg have different Laplace transforms. Indeed, using the trigonometric identity 
sint = sin(t — a+ a) = sin(t — a) cosa + cos(t — a) sina, we obtain 


—Xa 


fEQ) = £{ H(t — a) [sin(t — a) cosa + cos(t — a) sina]} = xs a 


[Asin a + cosa], 


whereas f#(A) = & 


Example 5.2.7: (Shift rule) Evaluate C[H(t—a)](A) (see graph of H(t—a) on page 275). Then find the Laplace 
transform of the product H(t — b) cosa(t — b). 
Solution. The shift rule (5.2.6) gives 


LIH(t — a)|(A) =e HE (A) = xo. 


We again use the shift rule to obtain 


\ 


LIH(t = b) cos a(t = b)] = er pre 
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2 2 
1 1 
0} — 0 

=| _4 

og 5 a 5 


Figure 5.10: Example 5.2.6. The graph of the function H(t — a)sint and H(t — a) sin(t — a), with a = 1, plotted in 
MATLAB®. 


Example 5.2.8: (Attenuation rule) Find the Laplace transform of cosh 2t. 
Solution. Using the attenuation rule Eq. (5.2.7), we have 


1 1 1 
£|cosh2t] = cls + se] = £ [e?4] + £le-**] 
~ i132 ,11 __A4 
a Ro ed 


Example 5.2.9: (Attenuation rule) Find the Laplace transform of t° sin 3t. 
Solution. We know that sind = Re® is the imaginary part of e/°, then the attenuation rule yields 


L [t° sin 3t] — 2 ie Se] = SL [t° | 
3! (A + 3j)* 3! 


31S —————— 
( 


9 __ oy, 4.354 
y= 3A Fan? OF Oe SOT) 


= & = 

~ ~ = 35 
3! 

= Beat SP +4 BI) + 8-9? BI)? +4-AGH + BI)" 
! 

- ory [40° -3 — 403°] = 


314-3 


02 +94 [3 = 9d]. 


The same result can be obtained with the aid of the multiplication rule (5.2.11): 
a a 3 72d(A? — 9) 
Le? sin3t] = -—~L£ [sin 3¢] = -—- ——— = —_"——. 
[é* sin 3t] = — 5 £sin3t] = —Daxa yg (QO? +9)4 


Example 5.2.10: (Integration rule) Find the Laplace transform of the integral 
| ‘Gas antes La 
—T) sin27 dr = = — = sin 2t. 
f T C2554 


Solution. Using the integration rule (5.2.8), we obtain 


: ; _ Lfsin2](A) 2 1 
Ay (t—7T) sin2rdr| ae ae a 


This integral is actually the convolution of two functions: ¢ * sin2t. So, its Laplace transform is the product of 


Laplace transformations of the multipliers: 4 = L{t] and ems z = £|sin 2). 


Example 5.2.11: (Multiplication by the t” rule) Find the Laplace transform of t cos 2t. 
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Solution. Using Eq. (5.2.11), we find its Laplace transform to be 


Lt cos2t] = | t eos te at =~ | cos 2t 1 (e-™) dt 
0 (0) dr 
i ia d d 
= —-— ie d= = If) = 
aay, eS gy A eS ay oe a 
_ MW+4—-2A-24] 2-4 
(2 + 4)? (2 4.4)?" 


Alternatively, using Euler’s formula (5.1.10), we have cos 2t = Re)’ = Ree)". Hence 


L[t cos2t] =RL [te*] and L{t]= z 


From the attenuation rule, it follows that 


1 (A + 2j)? 
Eltcos2] = Roto "Oras ap 
ee ar ae ee eee 
= Grey RO+ 29" = Gee RO? + Ni- 4) = Ge 


Example 5.2.12: (Periodic function) Find the Laplace transform of sin 2t. 
Solution. To find £[sin 2¢](\), we apply the rule (5.2.13) because the sine is a periodic function with period 27. 
The function sin 2t has the period 7, that is, sin 2t = sin2(t+ 7). Hence, 


1 Tv 
L£[sin 2¢]() => [-e-™ | e* sin 2¢ dt. 


The value of the integral can be determined using Euler’s identity sin@ = Sei", namely, sin @ is the imaginary part 
of &3?: 


Tv Tv Tv 
| e™ sindtdt = | a eae dba S | ede 
0 0 0 


en t+ 25t tan _ = 7 1 
SSS pe FP me ft 
#=0 vA — 2j vA — 2j 


ea 

os aT 
Multiplying the numerator and denominator by + 2j, the complex conjugate of \ — 2j, and using the identity 
(A + 25)(A — 2j) = A? + 4, we obtain 


1 —An+2jr A+ 2j : A+ 2j 
ae ee a 
\ — 2j vA — 2j +4 A? +4 
since e?7J = e~?"J = 1. Taking the imaginary part, we get 
1 2 2 
L{sin (4) = ——5 ~— [l-e™]= 
[sin ¢]() fam ee | c | +44 


Example 5.2.13: (Periodic function) Find the Laplace transforms of the periodic functions 
fa(t) = H(t) -— H(t—a)+ H(t —2a)—- H(t—-3a)+--:-. 


and 


g(t) = sin 2tf,,/2(t) = sin 2t a (—1)* H(t — 1/2). 
k=0 
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(a) . (b) 


Figure 5.11: Example 5.2.13: (a) the piece-wise continuous function f,(¢) (also called the square-wave function), and 
(b) the function g(t) = sin 2tf,/2(t), plotted with Mathematica. 


Solution. The function f,(¢) has a period equal to w = 2a; therefore, we can apply Eq. (5.2.13) to find the 
Laplace transform of a periodic function. In fact, we have 


1 1 1 
L —axr —2ar —3ar 
A ee — aya a 
fi) ae + x° hd + 
1 * —At 1 a. —At 
i eTwar i, 1 ib ( ) _ e72ar 0 C 
1—e7# 1—e"“% 1 


A(1 — e724) (1 —e-*) (1+ e-®) (1+ e742)" 
In turn, the function g(t) has a period 7 and is zero for t € [Z, | Thus, 


1 n/2 7 ) {it ean /2 
Lig(t)| (A) = {2 | e at sin(2t) dt = ae ra A {—e-/t = e-an/2 : 


In the general case for the periodic function 


fo(t) = sin wt, 0<t<mr/u, 
eS i oO, tlw <t < 2r/w, 


with the period 27/w, we can obtain its Laplace transform as follows: 


1 m/w 
L : —AXt 
A) = — t dt 
PO) = —Sexe [ sinwte 
wW 1 —7Tr/w 
~ 24 wy? 1 eo a/u lite ls 


You can handle this function with a computer algebra system. For instance, in Mathematica it can be done as 
follows: 
SawTooth[t_] := 2 t - 2 Floor[t] -1; 
TriangularWave[t_] := Abs[2 SawTooth[(2 t -1)/4] -1; 
SquareWave[t_] := Sign[ TriangularWave[t] ]; 
Plot[{ SawTooth[t], TriangularWave[t], SquareWave[t]}, {t, 0, 10}] 
The function g(t) can be plotted in Mathematica by executing the following code: 
g[x_] := If[FractionalPart[x/Pi] < 0.5, Sin[2 x], 0] 
Plot[g[t], {t, 0, 8}, PlotRange -> {0, 1.2}, AspectRatio -> 1/5, 
Ticks -> {Pi/2 Range[0, 8], {1}}, PlotStyle -> Thickness [0.006] ] 


Example 5.2.14: Figure 5.12(a) shows a pulsed periodic function f(t) with a pulse repetition period +r = 4 +4 7. 
Find its Laplace transform. 
Solution. Thus, on the interval [a,a +7], this function is defined as 


fa(t) = sin k(t — a), axt<at+n, 
oN 10, a+m<t<at+r. 
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Figure 5.12: (a) Example 5.2.14: graph of the pulsed periodic function; (b) Example 5.2.15: graph of the damped 
oscillations f(t) =e~* sinwt repeated every 7 seconds; here a = 1.0, w = 10, and 7 = 8.0, plotted with MATLAB. 


To find the Laplace transform of this function, first we find the transformation of the function f(t) when a = 0 
and then we apply the shift rule (5.2.6). The function fo(t) is a periodic function with the period r. Therefore, 


Eq. (5.2.13) yields 


1 * 1 7 
L —At . —Xt 
fo) = l-eo™ i fo(t)e"™ dt = = [ sinkte”™ dt 


1 (—1)*+1 
1-e? BR © 


if & is an integer. The shift rule gives us the general formula, that is, 


tose 1 Kigebe) 
fo) = Bye Toe ® | 


Example 5.2.15: Figure 5.12(b) shows a series of damped oscillations f(t) = e~™ sinwt repeated every T sec. 
Assuming that a is large enough, show that its Laplace transform is 


L _ Ww 1 
Ol Fes =a 1—e-™" 
Solution. According to Eq. (5.2.13) on page 283, we have 


1 r w 1 
L —at o; —At 
ha) 1—e-7 | ee peat =| 1—e-7 


since the function f(t) has the period r. 


Example 5.2.16: Figure 5.13 shows an anti-periodic function f(t) = —f(t+ a) defined as 


z E, if0<t<a; 
ft= —-Zt+E, ifa<t< 2a; 
f(t) = f(t + 2a), for all positive t. 
The Laplace transform of this function can be obtained according to Eq. (5.2.14) on page 283, that is, 
il 1 E f* 
& —At —At 
jj) = —— t) dt = ——__ -= tdt 
PO = gee f met=-paa sf 
E 1-e® E ad 
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Figure 5.13: Example 5.2.16. Anti-periodic function. 


Example 5.2.17: To find the Laplace transform of sin st we expand it into the Taylor series: 


sin 3t _ 1 ai yr pent! _ a hn p27 gant 
Tl a! Qn+1)! 42 (2n +1)! 


Interchanging the order of integration and summation, we obtain 


sin 3t ad sin 3é | ae [ec yn (32)? en 
= t= En RES bs 
c[=* l= f a=) oe 


n=0 


an)! 
The fourth formula in Table 280 provides £ | = a Cs y Thus, 


\2n+1" 
c = =) ) ae (ny) Se. = (-1)” gees 
a ee Qn+pl & +1 (2n + 1) 
since (2n + 1)! = (2n+1)- (2n)!. 

1. Find the Laplace transform of the following functions. 

(a) 4t—2; (b) 2°; (c)  cos(2t — 1); 

(d) 4t? + sin 2t; (e) ett, (f)  sin?(2t — 1); 

(g) cos*(5t—1); (h) (2t+3)3; (i) 2-4e*; 

(Gj) (2t-1)°; (k) | cos 2t¢|; (1) sin 2¢ cos 4t. 
2. Find the Laplace transform of the derivatives 

d 2t), d ;,2 -+ A d ty. 
(a) zz Ite in (b) = It e |;  (c) Gz [eos 2t — te’; 
d ;,2 3:7. d : ; df 4 

(d) oP [t-e'];  (e) F [t sin2¢];  (f) a [e * cos3¢]. 
3. Use the attenuation rule and/or the similarity rule to find the Laplace transformations of the following functions. 

(a) 2; (b) 48, (c) ten, 

(d) e* sinh¢; (e) t(e~* — e**)?; (f) 6 Mt; 

(g) 7 cosht; (h)  e* sinht; (i) e7* sin 2¢; 

(j) et; (k) e%(5—2t+cost); (Il) e*%(1+t)?; 
(m) e *sin4t; (n) (e7?4+ 3c”) sint; (0) cos(2t + 1); 
(p) sin(83t—1); (q)  sinh(3t+1); (r)  e** cosh(3¢ +2). 


d 
4. Use the rules (5.2.2) and (5.2.11) to find the Laplace transform of the function t Tr (e’ sin 2t). 


5. Use the shift rule to determine the Laplace transform of the following functions. 
(a) f(t—7) where f(t)=cost H(t); (b) f(t—1) where f(t) =t H(t). 
6. Use the rule (5.2.11) to find the Laplace transform of the following functions. 


(a) te™: (b)  ¢? sin 2t; (c)  ¢? cos(3t); 
(d)  ¢ sin(2t +7); (e) te™?,; (f) #? sin(2¢— 1); 
(g) te, (h) #? sing; (i) ¢ cos(2t); 

(j) ¢ sinh(2t); (k) #? cosh(3t); (1) te7*; 


(m) —te* cosh(2t); (n) t2—¢*+t8; (0) te” sinh(3¢). 
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7. Find the Laplace transform of the following functions: 


1 2 : 1 1 . 
(a) z (e — 1}; (b) = (cos(2t) — 1); (ce) ; (2(1 — cos t)); 
1. 1 1 
(d) ; sin 2t; (e) ; (cosh2t—1); (f) ; (cos t — cosh t). 
8. Find the Laplace transform of the following periodic functions: 
sin t, if0<t<q, _ re 
(a) f(t)= { awe — and f(t) = f(t +27) for all positive t. 
(b) f(t)= { id : . i . oe and f(t) = f(t +27) for all positive t. 
i, O<eei, 
0 1<t<2 
c t) = } , and f(t+4) = f(t) for allt >0. 
(©) =F eee fle+ 4) = Fb) 
0, 3<t<4; 


9. Find the Laplace transform of the periodic function: f(t) = (—1)'*"!, where a is a real positive number. 


10. Use mathematical induction to justify £[t”] = 5c. 


11. Find the Laplace transform of the triangular wave function f(t) = )7,59[(t — 2an) H(t — 2an) + (2a + 2an —t) H(2a+ 
2an —t)]. 


12. Find the Laplace transform of the following integrals. 


t t 
(a) te | = (e** cost) dt; (b) / te * cost dt; 
o, dt 0 


a2 t d t 
(c) al e7 cos t dt; (d) ah ec” cos 3t dt. 
0 0 
13. Prove that 

¢ fe 
(a) £fty'] =» -a Se, (b) [ty] = —2ay -  E 

Litey” _ dy ay ave QE (X 

(c) [i y”] = ae? py + 2y A) 


14. Show that £[H(t— a)g(t)| (A) =e" Lig(t+.a)] (A), a> 0, and L[f(t+a)] (A) 
=e {fr(r) - foe ft)dt}, a>0 

i : ” L 2 dy” 

15. Prove the identity: L[ty”] = y(+0) — 2Ay~ — A ra 

16. Use any method to find the Laplace transform for each of the following functions. 


2(1 — cosht) l-e 7 sin kt sin kt 1 — cos kt 
a—— oy——_—— &) ce (d)——; — (e) —{ —. 
17. Find the convolution of two functions f(t) and g(t), where 
0, 0<t<l, 
(a) f(t)=sinat, g(t)=coshat; (b) f(tHh= 4 e, 1<t<2, g(t) =¢; 
0, t > 2, 
(c) txcosht; (d) txte; 
(e) e% * sinh bt; (f) coshat * cosh bt; 
(g) sinat * sinat; (h) coshat * coshat; 
(i)  cosat * cos at; (j)  sinhat « sinh at; 
(k) txt? xt; (1) ef xe xe; 
(m)  sinat * sin at * sin at; (n) © sinhat « sinh at * sinhat. 
18. Use the rule (5.2.11) to find the Laplace transform of the following functions. 
(a) te *: (b) te sin3t; (c) te cosh3t; 


(d) # sinh(2t); (e) # -t'; (f)  t?e7** sin(3t). 
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5.3 Discontinuous and Impulse Functions 


In engineering applications, situations frequently occur in which there is an abrupt change in a system’s behavior 
at specified values of time t. One common example is when a voltage is switched on or off in an electrical system. 
Many other physical phenomena can often be described by discontinuous functions. The value of t = 0 is usually 
taken as a convenient time to switch on or off the given electromotive force (emf, for short). The Heaviside function 
(5.1.5), page 274, or unit step function is an excellent tool to model the switching process mathematically. In many 
circuits, waveforms are applied at specified intervals. Such a function may be described using the shifted Heaviside 
function. A common situation in a circuit is for a voltage to be applied at a particular time (say at t = a) and 
removed later at tf = b. Such a piecewise behavior is described by the rectangular window function: 


1, if t<b 
Wh=Hd—o nije 
0, outside the interval [a, }]. 
This voltage across the terminals of the source of emf has strength 1 and duration (b — a). For example, the LRC- 
circuit sketched in Fig. 6.1(a) on page 343 has one loop involving a resistance R, a capacitance C’, an inductance L, 
and a time-dependent electromotive force E(t). The charge q(t) is modeled (see [14]) by the differential equation 
dq. dq 


1 
L fk + 


dt2 yoo 


An initial charge go and initial current Ip are specified as the initial conditions: q(0) = qo, q(0) = Io. 
If the current is initially unforced but is plugged into an alternating electromotive force E(t) = Ep sinw(t — T) 
at time T, then the piecewise continuous function E(t) is defined by 


: 0, tT, 
E(t) = Eo sinw(t— 7) H(¢—T) = { wey, ae 
Our primary tool to describe discontinuous functions is the Heaviside function (see Definition 5.3 on page 274). 
There are other known unit step functions that have different values at the point of discontinuity. For instance, a 
unit function can be defined as 


HW) =5(1+=5). t £0, 


where the value at ¢ = 0 is undefined, which means that it can be chosen arbitrarily. However, as will be clear later 
from §5.4, an application of the Laplace transformation calls for the Heaviside function. Many functions, including 
continuous and discontinuous, can be approximated as a sum of linear combinations of unit functions. For example, 
for small ¢ > 0, a continuous function f(t) can be well approximated by a piecewise step-function f. = f (€|t/e]), 
where |A] is the floor of a real number A, namely, the largest integer that is less than or equal to A. It is not a 
surprise that the inverse statement holds: the Heaviside function is the limit of continuous functions containing a 
parameter when the parameter approaches a limit value. For example, 


S—>0o 


A(t) = lim {5 + arctan(st)} ; (5.3.1) 


Related to the unit function is the function signum x, defined by 


1, z>0, 
signzx = <¢ 0, z=0, 
—l, x<0. 


Maple and Mazima have the built-in symbol signum(x), while Mathematica uses the symbol Sign[x], MATLAB, 
SymPy, and R utilize the nomenclature sign(x), and Sage uses sgn(x). The signum function can be expressed 
through the Heaviside function as signt = 2H(t) — 1, so 


H(t) = 5 ison qatl. (5.3.2) 


The two representations, (5.3.1) and (5.3.2), of the Heaviside function are valid for all real values of the independent 
argument, including t = 0. Recall that H(0) = 4. In the following, we will consider only such discontinuous functions 


292 Chapter 5. Laplace Transforms 


Figure 5.14: Approximations (5.3.1) to the Heaviside function for s = 2, s = 10 (dashed line), and s = 40 (solid 
line), plotted in MATLAB. 


that satisfy the mean value property (5.1.8), page 277. That is, all functions under consideration have the value at 
a point of discontinuity to be equal to the average of limit values from the left and from the right. 

As we saw in the first section, the Laplace transform of a piecewise continuous function is a smooth function 
(moreover, a holomorphic function in some half-plane). The Laplace transform is particularly beneficial for dealing 
with discontinuous functions. 

Keep in mind that our ultimate goal is to solve differential equations with possible piecewise continuous forcing 
functions. Utilization of the Laplace transformation in differential equations involves two steps: the direct Laplace 
transform and the inverse Laplace transform. When applying the Laplace transform to an intermittent function, it 
does not matter what the values of the function at the points of discontinuity are—integration is not sensitive to 
the values of the function at a discrete number of points. On the other hand, the inverse Laplace transform defines 
the function that possesses a mean value property. For instance, the unit function 


1, #20, 


t 
es oe 
ae Fest fi t <0; 


has the same Laplace transform as the Heaviside function: Llu] = £L[H] = 1/A. As we will see later in §5.4, the 
inverse Laplace transform of 1/\ is £~! [+] = H(t). Hence, £~1 [L[u]] = £-'L[H] = H(t), and the unit function 
cannot be restored from its Laplace transform. 

If we need to cut out a piece of a function on the interval (a, b) and set it identically to zero outside this interval, 
we just multiply the function by the difference H(t — a) — H(t — b), called the rectangular window. For example, 
the following discontinuous functions (see Fig. 5.15 on next page) 


1, 0<t<2, _ J 2, 1<t<4, 
f(t) = { 0, elsewhere; and a { 0, elsewhere; 


can be written as 
f(t) = H(t) — H(t — 2) and g(t) = 2H(t — 1) — 2H(t — 4), 


respectively. According to our agreement, we do not pay attention to the values of the functions at the points 


of discontinuity and define piecewise functions only on open intervals. It is assumed that the functions under 
consideration satisfy the mean value property (5.1.8). In our example, we have f(0) = f(2) = $ and g(1) = g(4) =1. 


In general, if a function is defined on disjointed intervals via distinct formulas, we can represent this function 
as a sum of these expressions multiplied by the difference of the corresponding Heaviside functions. For instance, 
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Figure 5.15: Graphs of the functions (a) f(t) = H(t)— H(t-—2), and (b) the function g(t) = 2H(t—1)-—2H(t—4). 


consider the function 


t, 0<t<il, 

1, 1<t<3, 
ft)= 

4-t, 3<t<4, 

0, t>4. 


This function is a combination of four functions defined on disjoint intervals. Therefore, we represent f(t) as the 
sum: 


f(t) =t[H(t) — H(t-1)) +1[H(t-1) — H(t —3)] + (4-8) [H(t — 3) - H(t 4) 
=tH(t)—(t—1)H(t-1) + (4-t-1) H(t-3)+ (4-8) B(t-4) 
=t H(t) —(t—1) H(t—1) — (t—3) H(t —3) — (t—4) H(t—4). 


1 1 1 1 
pL — —3X —4X 
2? o 0c 2 


Example 5.3.1: (Example 5.1.7 revisited) We reconsider the discontinuous function from Example 5.1.7 on 
page 276. To find its Laplace transform, it is convenient to rewrite its formula using the Heaviside function: 


f®) = (A@® —Ae-— 1) + 2+ -1) -— F¢-2))+0-d)[AG¢—2). 


In order to apply the shift rule, we need to do some extra work by adding and subtracting a number to the function 
that is multiplied by the shifted Heaviside function. Recall that the shift rule (5.2.6) requires the function to be 
shifted by the same value as the Heaviside function. For instance, instead of t? H(t — 1) we use 


PH(t—1)=(t-—14+1)? H(t-1) = [(t- 1)? + 2(¢- 1) +1] H(t 1). 


This allows us to rewrite the given function as 


f(t) = H(t) + [2- (t- 1) - (t- 1)”] H(t- 1) - [2 - 2) +5] A(t — 2). 


Now the function is ready for application of the shift rule (5.2.6). Using formula (4) in Table 280 for the Laplace 
transform of the power function, we get 


2 2 1 2 2 5 
pe ae 


Example 5.3.2: Express the square wave function shown in Fig. 5.11 (Example 5.2.13 on page 286) and the so- 
called Meander function shown in Fig. 5.16 on page 294 in terms of the Heaviside function, and obtain their Laplace 
transforms. 

Solution. It can be seen that the Meander function f(t) is defined by the equation 


f(t) = H(t) — 2H(t — a) + 2H(t — 2a) — 2H(t — 3a) + 2H(t—4a)+---. 


294 Chapter 5. Laplace Transforms 


Figure 5.16: Example 5.3.2. The Meander function. 


From formula 2 of Table 280, it follows that the Laplace transform of each term in the last representation of the 
Meander function is 


LH (t — na)|(0) = : en =0,1,2,.... 


Therefore, 
1 2 2 2 
L —rXa —2Xra —3Xra 
A) = --= = = 
fr) an + xe xe + 
1 
= : [1 Je7™ (1 e€ aX +e 2aXr e€ 3ar a ) 
1 2e— 1 fl-e™ teat, aX 
— _— — td = n — 
nN 1+e7o A [1 +e7% nN 2 
Here, we used the geometric series 
1 sk 
i =)  #altzt¢7+3re (5.3.3) 
ae a 
with z = —e~**. On the other hand, the Laplace transform of the square wave function, 


g(t) = H(t) — H(t-1) + H(t-3) -H(t-4)+---, 


1 1 1 1 
L =, SAY a td 8h 
g (A) = ce +5e xe + 
1 
_ 5 [l-e +e De Bh 44] 
t= se 1. 4 
7 a ) ~ A} 1+e-4 


Definition 5.8: The full-wave rectifier of a function f(t), 0 <t < T, is a periodic function with period T 
that is equal to f(t) on the interval [0, T]. 


The half-wave rectifier of a function f(t), 0 <t< T, is a periodic function with period 2T that coincides 
with f(¢) on the interval [0,7] and is identically zero on the interval [T, 2T]. 


Example 5.3.3: Find the Laplace transform of the saw-tooth function 


f®= = (H(t) — H(t —a)] + = (t —a) [H(t — a) — H(t — 2a)| 


Es = (t — 2a) [H(t — 2a) — H(t — 3a)| + = 


a 


(t — 3a) [H(t — 3a) — H(t —4a)|+---, 


5.3. Discontinuous and Impulse Functions 295 


Figure 5.17: Example 5.3.3. The saw-tooth function. 


0 a 2a 3a 4a Sa 6a 


Figure 5.18: Example 5.3.4. The half-wave rectification of the function f(t) = = t, on the interval [0, 2a]. 


a 


which is a full-wave rectifier of the function f(t) = 4 t, on the interval [0, a]. 


Solution. This is a periodic function with the period w = a. Applying Eq. (5.2.13), we obtain 


1 E 2 
L fe ee coe —tr 
f°) = 7a =f te di 
1 Ej 1 1 4, @ _ay El Ee 
=> _ _ & Seo 2 Ce Ceo Fr irV>hw 
l—e~” qa |? 2 ad2 X1—e-%* 


Example 5.3.4: Find the Laplace transform of the half-wave rectifier of the function f(t) = Et on the interval 
(0, 2a]. 


Solution. The half-wave rectification of the function f(t) is the following function: 


(t — 2a) [H(t — 2a) — H(t — 3a)] 


E 
+ — (t— 4a) [H(t — 4a) — H(t—5a)|+---. 
From the shift rule (5.2.6), it follows that 


LH (t — na)(t — na)] = e~" Lt] = ee" uw 


Li[H(t—na-—a)(t—na)] = L[H(t-—na-a) 
L\H(t—na—a)(t — na—a)| + aL[H(t — na —a)| 


1 a 
—(n+1)ar —(n+1)ar 
= € ( Sv = X é€ : 
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Figure 5.19: Example 5.3.5. 


Hence, the Laplace transform of F(t) is 


FF()) a as [l—e aA 1 2ar e 3ar Le 4axr e€ 5aXd | 

aX 

E 
Fi le ar 3aA 1 © Sav 4, | 

Ht 2aXr dar 6aX 

= oz lite +e +e +++] 

E 
es Be lie i Ea | 

a 

E 


—2ar 


Setting z =e , we summarize the series using Eq. (5.3.3) to obtain 


Tt e772 4 oe 4OA eg ge 


Thus, 


BE 1- —axr E —axr 
F*) = —— + 


Example 5.3.5: Certain light dimmers produce the following type of function as an output voltage: a sine function 
that is cut off as shown in Fig. 5.19 by the solid line; the jumps are at a, 7 +a, 27 +a, etc. Find the Laplace 
transform of this function. 

Solution. The output is a periodic function f(t) with the period w = 27. According to Eq. (5.2.13) on page 283, 
its Laplace transform is 


20 
7 A= = | eo" Fe, 


where 
sint, O0<t<a, 
0, O0<t<q, 
fH=4 . » 
sin t, maot<am+a, 
0, T+ta<t<2r. 


Substituting this function under the integral sign yields 


1 a T+a 
me = [— 6-2" ff sinte dt +f sinte a ‘ 
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Using the relation sint = Sei’, the imaginary part of ej, and multiplying both sides by 1 — e~?™, we obtain 


a T+a 
[1—e°?™] fe) = 3 / teat +9 | elt oe" dt 
0) Tw 


43 {at elatie e A(t +a) + Ard eit ar 


Ml 
1 
T et 3 {-(A+j)(cosa+jsina)e~** +A +5} 
—A\T 
+3 | S{(A+j)(cosa +jsina)e—** — r- j} 
because e7) = —1 and YJ = wh: - Sot. Thus, 
1 
[1 _— | aes = ad [1 _ ewe (A siiacene a)] 
eat 
+Saa 7 Ti [e~** (Asin a + cosa) — 1]. 
Therefore, 
L 1-— ert ; , 
fr”) = (1 — e297) (2 +1) [a sin a + cosa) e~ (atm) _ —-a 


Example 5.3.6: Find the Laplace transform of the ladder function 


f(t) =7H(t)+7H(t-—a)+7H(t—2a)+--- 


Solution. We find the Laplace transform of the function by applying the shift rule: CH(t—a) = \~' e~**. Thus, 


FLO) = c lite Aa 4 24a 4 6 Be ed 
= 5 [treet (ot)? + (08)? + (e™4)" + ---| 
T 1 
~ \I-e a 


0 a 2a 3a 4a sles > 
0 a ate t 


Figure 5.20: Example 5.3.6. The ladder function. Figure 5.21: Approximation of the 6-function. 


Mechanical systems are also often driven by an external force of large magnitude that acts for only very short 
periods of time. For example, the strike of a hammer exerts a relatively large force over a relatively short time. 
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s=6 


s=2 


-3 -2 -1 0 1 2 3 


Figure 5.22: Approximations 62(t,s) to the 6-function for different values s = 1, s = 3, and s = 6, plotted with 
MATLAB. 


Mathematical simulations of such processes involve differential equations with discontinuous or impulsive forcing 
functions. 
Paul Dirac®® introduced in 1926 his celebrated 6-function via the relation 


iG = / S(t — 2) ul) dé, (5.3.4) 


where 6(x) = 0 if a £0. Such a “function” is zero everywhere except at the origin, where it becomes infinite in such 


a way as to ensure 
/ d(x) da = 1. 


The Dirac delta-function d(x) is not a genuine function in the ordinary sense, but it is a generalized function or 
distribution. Generalized functions were rigorously defined in 1936 by the Russian mathematician Sergei L’vovich 
Sobolev (1908-1989). Later in 1950 and 1951, the French mathematician Laurent Schwartz?! published two volumes 
of “Theore des Distributions,” in which he presented the theory of distributions. 

Recall that Dirac introduced his 6-function in order to justify laws in quantum mechanics. We cannot see 
elementary particles like electrons, but we can observe the point where the electron strikes the screen. To describe 
this phenomenon mathematically, Dirac suggested using integration of two functions, one of which corresponds to 
a particle, and the other one, called the “probe” function, corresponds to the environment (as a screen). Hence, 
the 6-function operates on “probe” functions according to Eq. (5.3.4). The delta-function can be interpreted as the 
limit of a physical quantity that has a very large magnitude for a very short time, keeping their product finite (i.e., 
the strength of the pulse remains constant). For example, 

1 


6(t— a) = lim 6.(¢~ a) = lim = [H(t—a) — H(t-a—©)] 


As €++ 0, the function 6.(t — a) approaches the unit impulse function or the Dirac delta-function. The right-hand 
side limit is the derivative of the Heaviside function H(t — a) with respect to t, namely, 


6(t — a) = lim 6,(t— a) = H'(t—a), (5.3.5) 
e>0 
50Paul Dirac (1902-1984), an English physicist, was awarded the Nobel Prize (jointly with Erwin Schrédinger) in 1933 for his work in 


quantum mechanics. 
°1Laurent Schwartz (1915-2002) received the most prestigious award in mathematics—The Fields Medal—for this work. 
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3.5 T T T 10 
s=30 
37 s=10 7 


2.5], 3] 


0.57 7 


-0.5 F 7 


Figure 5.23: Approximations 64(t, s) to the 6-function for s = 10 and s = 30. 


and hence 


[fue-ajat=— f pe He-ajar= f f(Q5¢-aat= Fla) (5.3.6) 


for any continuously differentiable function f(t). Because of Eq. (5.3.4), the product of the delta-function 6(t — a) 
and a smooth function g(t) has the same value as g(a) 6(t — a); that is, 


g(t) O(t — a) = g(a) 0(t — a). (5.3.7) 


It should be emphasized that the derivative of the Heaviside function is understood not in the ordinary sense, 
but as the derivative of a generalized function. With the exception of the point t = 0, the Heaviside function H(t) 
permits differentiation anywhere, and its derivative vanishes in any region that does not contain the point t = 0. 
Although we cannot differentiate the Heaviside function because of its discontinuity at t = 0, we can approximate it 
by, for instance, formula (5.3.1) to obtain the derivative 


d fl 1 8 
H! : . 
6(t) (¢) jim, dt {5 a. 7 arctan(st)} ut foe) 1 (8?t? + 1) ; 
Therefore, the 6-function can be defined as the limit of another sequence of functions, namely, 


Ss 


(t) = Jim da(t,s), with d2(¢,8) = [ap yay: 


There are many other known approximations of the delta-function, among which we mention two: 43(t,s) = 
— 5242 in(st 
eet and d4(t, 8) = 2. 


Using the shift rule (5.2.6) on page 282, we obtain 
d(t — a) * f(t) = f(t—a). (5.3.8) 
The Laplace transform of the Dirac 6-function is 


La) = 62(a) = | * §(t) edt =e = 1, (5.3.9) 


The Dirac delta-function has numerous applications in theoretical physics, mathematics, and engineering prob- 
lems. In electric circuits, the delta-function can serve as a model for voltage spikes. In a spike event, the electromotive 
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function applied to a circuit increases by thousands of volts and then decreases to a normal level, all within nanosec- 
onds (10~° seconds). It would be nearly impossible to make measurements necessary to graph a spike. The best we 
can do is to approximate it by mdé(t — a), where t = a is the spike time. The multiple m represents the magnitude 
of the impulse coursed by the spike. 


The delta-function is also used in mechanical problems. For example, if we strike an object like a string with a 
hammer (this is commonly used in a piano) or beat a drum with a stick, a rather large force acts for a short interval 
of time. In [14], we discuss vibrations of a weighted spring system when the mass has been struck a sharp blow. 
This force can be approximated by the delta-function multiplied by some appropriate constant to make it equal to 
the total impulse of energy. Recall from mechanics that a force F acting on an object during a time interval (0, t] is 
said to impact an impulse, which is defined by the integral i Fdt. If F is a constant force, the impulse becomes 
the product Ft. When applied to mechanical systems, the impulse equals the change in momentum. 


Remark. It should be pointed out that the approximations of the delta-function are defined not pointwise, but 
in the generalized sense: for any continuous and integrable function f(t), the d-function is defined as the limit 


where 6(t, s) is one of the previous functions approximating the unit impulse function. 


1. Rewrite each function in terms of the Heaviside function and find its Laplace transform. 


l O<t<2, 1, 0<t<2, 
(a) sey ={ -1, t>2. (b) fy ={ 0, t22. 

t, O<t<83, 0, 0<t<2, 
(c) f(t ={ 0, t>3. (d) =f P, tS 2: 

_ fj sint, O<t<2n, _ f 0, 0<t< 37/4, 

(e) f(t ={ 0, t>2n. (f) i=) sin(2t), t > 30/4. 

l, O<t<2, 2 0<t<4, 
(g) f@=4 9, 2<t<4, (h) f(t)= 4 t/2, 4<t<6, 

-1, 4<t. 3 6<e 
(i) f(t -{ cos3t, O<t< 7/2, -{ cos3t, O<t<xn, 


0, m/2<t. G) f@) 0, m<t. 
2. For « > 0, find the Laplace transform of the piecewise-step function f-(t) = f (e|t/e]), where f(t) = t. 


3. Find the Laplace transform of the following functions. 


(a) H(t)—H(t—1)+H(t—2)—H(t—3); (b) 6(t)-4(t- 1); 
(c) sint[H(t)— H(t—7)]; (d) e'H(t-1). 


4. Use the Heaviside function to redefine each of the following piecewise continuous functions. Then using the shift rule, 
find its Laplace transform. 


t, 0<t<l, 0, O<t<lil, 
(a) f(t)=4 2-t, 1<t<2, (b) f= ft, 1<t<4, 
0, 2<t. 1, 4<t. 


5.3. Discontinuous and Impulse Functions 


0, 0<t<l, t, 0<t<l, 
© JO) 07 Gee © 05,54. oo; 
0, eas 0, 3<%, 
iL v<b<2 2, ieee 1, 
, ees. ‘ betes. 
(ce) AO = -2, 3<t<4, = t-2, 2<t<8, 
5 id, 0, ise, 
1, 02e<2 0, 0<tK<l, 
t, 2<t<4, t-1, 1<t<2, 
(ge) f@)=46-t, 4<t<6, (h) f@)=4 1, 2<t<3, 
pom. Gi e 4—t, 3<t<4, 
i #8 U; ae 


5. Find the Laplace transformation of the given periodic function. 
(a) Full-wave rectifier of the function f(t) = sint on the interval [0, 27]. 
(b) Full-wave rectifier of the function f(t) = sin 2t on the interval [0, z]. 
(c) Full-wave rectifier of the function f(t) 
(d) Full-wave rectifier of the function f(t) 


cost on the interval [0, 7/2]. 


= ¢? on the interval [0, 1]. 


(e) Half-wave rectifier of the function f(t) = sin¢ on the interval [0, 27]. 
(f) Half-wave rectifier of the function f(t) = sin 2t on the interval [0, 7]. 
(g) Half-wave rectifier of the function f(t) = cost on the interval [0, 7/2]. 
(h) Half-wave rectifier of the function f(t) = ¢? on the interval (0, 1]. 

: ss : = _ ft, O0<t<l, 
(i) Periodic function f(t) = f(t+2), where f(t) = { 2-4, L<tic2 
(j) Periodic function f(t) = f(t +3), where 

t, 0<t<l, 


Is Li > 
fo%. Bees 


(k) Periodic function f(t) = f(¢+ 4), where 


i eee td, 
fH=¢ &, 1<t<3, 
9, 3<t<a4. 
(1) Periodic function f(t) = f(t +5), where 
Dix 0<t<2, 
f= 4 9, 2<t<3, 
(t — 3)? — 2, 3<t<5. 


6. Find the Laplace transform of sign(sin ¢). 


7. Find the Laplace transforms of the following functions. 
(a) 6(t—1)-—d(t-3); (b) (€-—3)d(-3); (c) Vt? +3td(t-—1); (d) cosatd(t—1). 


8. Rewrite each function from Problem 5 in §5.1 (page 280) in terms of shifted Heaviside functions. 


9. Show that the Laplace transform of the full-wave rectification of sin wt on the interval [0, 7/w] is xripz coth za. 


10. Find the Laplace transform of the given functions. 
(a) (? oe 25)H(t—5); (b) (t-—7m)H(t—7) — (t—- 27)A(t — 27); 
(c) 9 H(t—3); (a) cos(nt)[H(t) — H(t — 2)} 
(e) (1+t)H(t— 2); (f)  sin(27t)[H(t — 1) — H(t — 3/2))}. 
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11. Can £7 Fed be found by using the differential rule (5.2.2) £7! Fed =pL! a] =De*=~—e~*? Explain why 


X+1 
or why not. 


A+1 
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5.4 The Inverse Laplace Transform 


We employ the symbol £~*[F(A)] to denote the inverse Laplace transformation of the function F(A), while £ 
corresponds to the direct Laplace transform defined by Eq. (5.1.1), page 270. Thus, we have the Laplace pair 


F(a) = (£f(t)) () = | “eM f(t)dt, f(t) = LFA). 


0 


It has already been demonstrated that the Laplace transform f"(A) = (Lf)(A) of a given function f(t) can be 
calculated either by direct integration or by using some straightforward rules. As we will see later, many practical 
problems, when solved using the Laplace transformation, provide us with F(A), the Laplace transform of some 
unknown function of interest. That is, we usually know F(A); however, we need to determine f(t) such that 
f(t) =£71[F(A)]. The inverse Laplace transform was a challengeable problem, so it is not surprising that it took a 
while to discover exact formulas. In 1916, Thomas Bromwich>? answered the question of how to find this function, 
f(t), the inverse Laplace transform of a given function F(A). He expressed the inverse Laplace transform as the 
contour integral 


1 c+joo i ctjw 
[f+ 0) + f(é—0)] fF A)e* dA = lim — / f (Aye dA, (5.4.1) 


7 Inj c—joo Ware 2nj 


NIlR 


—jw 


where c is any number greater than the abscissa of convergence for f“(A) and the integral takes its Cauchy prin- 
cipal®? value. Note that the inverse formula (5.4.1) is valid only for t > 0, providing zero for negative t, that is, 
L-'F(A)|(t) = 0 for t < 0. 

Remark. From this formula (5.4.1), it follows that the inverse Laplace transform restores a function-original 
from its image in such a way that the value of a function-original at any point is equal to the mean of its right-hand 
side limit value and its left-hand side limit value. Algebra of arithmetic operations with function-originals requires 
reevaluations of the outcome function at the points of discontinuity. For example, this justifies the following relation 
for the Heaviside functions: H?(t) = H(t). | 

In this section, we will not use Eq. (5.4.1), as it is very complicated. Instead, we consider three practical methods 
to find the inverse Laplace transform: Partial Fraction Decomposition, the Convolution Theorem, and the Residue 
Method. The reader is free to use any of them or all of them. We will restrict ourselves to finding the inverse Laplace 
transform of rational functions or their products on exponentials; that is, 


Q() 


where P(A) and Q(A) are polynomials (or sometimes entire functions) without common factors. Such functions occur 
in most applications of the Laplace transform to differential equations with constant coefficients. In this section, it 
is always assumed that the degree of the denominator is larger than the degree of the numerator. 

The case of the product of a rational function and an exponential can be easily reduced to the case without the 
exponential multiplier by the shift rule (5.2.6). In fact, suppose we know 


Fa(A) = 


the original of a rational function F(A) = P(A)/Q(A). Then according to the shift rule (5.2.6), page 282, we have 


H(t—a)f(t— a) =L[F()e™). 


5.4.1 Partial Fraction Decomposition 


The fraction of two polynomials in A 


52Thomas John I’Anson Bromwich (1875-1929) was an English mathematician, and a Fellow of the Royal Society. 
53We do not discuss the definitions of contour integration and Cauchy principal value because they require a solid knowledge of the 
theory of a complex variable. We refer the reader to other books, for example, [47]. 
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can be expanded into elementary partial fractions, that is, P/Q can be represented as a linear combination of simple 
rational functions of the form 1/(\— a), 1/(A—)?, and so forth. To do this, it is first necessary to find all nulls of 
the denominator Q(A) or, equivalently, to find all roots of the equation 


Q(A) = 0. (5.4.2) 
Then Q(A) can be factored as 
Q(A) = co(A = A1)™ (A = Aa) A AK), 


where Aj, Ag, ..-, Ax are the distinct roots of Eq. (5.4.2) and m1, ma, ..., mz are their respective multiplicities. 
A root of Eq. (5.4.2) is called simple if its multiplicity equals one. A root which appears twice is often called a 
double root. Recall that a polynomial of degree n has n roots, counting multiplicities, som, + m2+---+mz, =n. 
Thus, if Eq. (5.4.2) has a simple real root \ = Ao, then the polynomial Q(A) has a factor \ — Ao. In F = P/Q, this 
factor corresponds to the partial fraction decomposition of the form 


A 
A= 0’ 


where A is a constant yet to be found. The inverse Laplace transform of this fraction is (see Table 280, formula 6) 


A 
Lot |_| = Ae H(t 
| -4e m0, 
where H(t) is the Heaviside function (5.1.5), page 274. 

The attenuation rule (5.2.7) gives us a clue about how to get rid of Ao in the denominator. Thus, formula 1 from 


Table 280 yields 


ae FE = AH (t) (with a constant A). 


Therefore, £ [e*°t H(t)] = (A — Ao) 71. 
If a polynomial Q(A) has a repeated factor (A — Ao)’, that is, if Eq. (5.4.2) has a root Ag with multiplicity m 
(m > 1), then the partial fraction decomposition of F = P/Q contains a sum of m fractions 


te eg, So 
(A = Xo)” (A = Ao)! A No 


The inverse Laplace transform of each term is (see Table 280, formula 7) 


= Ant |<] = Am oH), 
(A — Xo) 


™(m—1)! 

Suppose a polynomial Q(A) has an unrepeated complex factor (A—Ag)(A—Ao), where Ap = a+jG, and X» = a—j8 
is the complex conjugate of Ag. When the coefficients of Q(A) are real, complex roots occur in conjugate pairs. The 
pair of conjugate roots of Eq. (5.4.2) corresponding to this factor gives rise to the term 


AX\+ B 
O= al +P 


in the partial fraction decomposition since 
(A — a — 58)(A — a + §8) = (A— a)? + B. 
The expansion of the fraction F = P/Q can be rewritten as 
A(A—a)+aA+B 
O= al +x 
From formulas 9 and 10, Table 280, and the shift rule (5.2.6), we obtain the inverse transform: 


AA+ B 


“1 A+B 
[ eens 


| =e | A.c0s 6 + aa sin Gt} H(t). (5.4.3) 
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If the polynomial Q(A) has the repeated complex factor [(A — Ao)(A — Ao)]?, then the sum of the form 


AX\+ B CA+D 
(D—aP + PPT nara 


corresponds to this factor in the partial fraction decomposition of F' = P/Q. The last fraction is in the form that 
appears in Eq. (5.4.3). To find the inverse Laplace transform of the first factor, we can use formulas 17 and 18 from 
Table 280 and the shift rule (5.2.6). This leads us to 


A\+B at E 
=e 


lem soos —tsin ft+ coe (sin Bt — Bt cos Bt)| H(t). 


28 26? 
Example 5.4.1: To find the inverse Laplace transform of 
(a #5) 
FE UEEW GEER a 
(A — a)(A — b) , 


where a and 0 are real constants, we expand the given fraction into elementary partial fractions: 


1 A B 


DO) Cae)” Oo) 
The coefficients A and B are determined by multiplying each term by the lowest common denominator (A—a)(A—}). 
This leads to the equation 
A(A — b) + B(A-— a) = 1, 
from which we need to determine the unknown constants A and B. We equate the coefficients of like power terms 
of A to obtain 


A+B=0, —(Ab+ Ba) =1. 
Thus, A = —B = 1/(a— b). Now we are in a position to find the inverse Laplace transform of this fraction: 
1 1 1 1 
=1 -1 at bt 
ae a Pe —e) H(t 
. —S rs = — eagle eae), 


where H(t) is the Heaviside function (5.1.5). We multiply the last expression by H(t) because the inverse Laplace 
transform vanishes for negative values of the argument t, but the functions e® and e” are positive for all values of 
t. Mathematica confirms our calculations: Apart [1/((lambda - a) (lambda - b))] 


Example 5.4.2: In a similar way, the fraction 


1 
Tare F) 


can be expanded as 
1 id 1 1 
(2 + a2)()2 +62) bb? — a? | 2 +02 2 + 82 |” 


To check our answer, we type in Maple: 
convert (1/((lambda’2 + a°2)*(lambda’2 + b°2)) ,parfrac, lambda) 


Using formula 8 from Table 280, we obtain the inverse Laplace transform: 
fot a ee = eae aes = a 
(A? + a?)(A\2 + 67) b? — a? +a? 2+ 62 
1 1 1 1 
— | en a 
b? — a? Ferd b? — a? Ferd 


1 sinat — sinbt 
za | ab | #0. 


where H(t) is the Heaviside function. The last expression is multiplied by H(t) because the Laplace transform deals 
only with functions that are zero for negative values of an argument, but the functions sin at and sin bt are not zero 
for negative values of t. 


5.4. The Inverse Laplace Transform 


Example 5.4.3: Find 


pt (As 
M2+2A\4+5]- 


Solution. We first expand this ratio into partial fractions as follows: 


A+5 A+1+4 A+1 2 


SSS SS SS SSS = SSS SSS SS 2 . 
M+2A4+5 (A41)? 42? Aries melee 
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Since we know from Table 280 that £7! |>4.5| = cos2t and £7! |=%,5] = sin 2t, the attenuation rule (5.2.7) 
N242 42 


yields 


co goes = [et cos 2t + 2e* sin 2t] H(t). 


Example 5.4.4: Find the inverse Laplace transform of the function 


4X+5 


F(\) = —————_.. 
(A) 4A\2 + 12+ 13 


Solution. We rearrange the denominator to complete squares: 
AN + 12\ 413 = (2A)? +. 2-24-3437 — 37 413 = (2A +3)? +4. 


Hence, we have 
F(.) = AN+6—-1 20A+8) _ 1 
— (2A4+3)? +4 (2443)? +4 (2443)? +4° 


Setting s = 2+ 3 gives 
F(Ms)) = yy - 
Pe tA gt’ 


The inverse Laplace transform of the right-hand side is 
1 
2cos 2t H(t) — 5 sin 2t H(t). 
Application of Eq. (5.2.15) yields 
1 t 1 t t 
-1 _ +, at/2 By a deg pot Vee 
Lo |fAIO 5 © 200s (23) 5 sin ( 5) | (5) 
3t/2 1 
e€ cost — 7 sint| H(t) 
because H(t/2) = H(t). 


Example 5.4.5: Find the inverse Laplace transfrom of the fraction 


242 + 6A+ 10 
(A — 1)(A? + 4A + 18)" 


Solution. Since the denominator (A — 1)(A? + 4\ + 13) = (A — 1)(A + 2)? + 3?) has one real null 4 = 1 and two 


complex conjugate nulls \ = —2 + 3j, we expand the given function into partial fractions: 


2\? + 6A + 10 1 A+2 1 


O=D08+4418) ial” Gada” aoe es 
With the aid of Table 280 and the attenuation rule (5.2.7), it is easy to find the inverse Laplace transform: 


2\? + 6+ 10 


Jat 
O=Deetaai| °° 


1 
L* —2t cos 3t + ae sin 3t, t>0. 


As usual, we should multiply the right-hand side by the Heaviside function. 
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Example 5.4.6: Find the inverse Laplace transform of 


a 


F(A) = OA 1) e€ 


Solution. The attenuation rule (5.2.7) suggests that we first find the inverse Laplace transform of the fraction 


A+3 


We expand F(A) into partial fractions: 
A+3 A B C 


Miai) 2 ok A 
To determine the unknown coefficients A, B, and C we combine the right-hand side into one fraction to obtain 


Me ADK De DO=)+Ccr 
A—1) PA—D 


Therefore, the numerator of the right-hand side must be equal to the numerator of the left-hand side. Equating the 
power like terms, we obtain 
0=C+B, 1=A-B, 3=-A. 
Solving for A, B, and C in the system of algebraic equations, we get A = —3, B = —4, and C = 4. To check our 
answer, we use the following Maxima command: 
partfrac((lambdat3) /((lambda-1) *lambda‘2) ) , lambda) ; 
Thus, the inverse Laplace transform of fF’ becomes 


A+3 


fo=l-* cer = H(t) (4e’ — 3¢ — 4). 


Now we are in a position to find the inverse Laplace transform of F2(\). Using the shift rule (5.2.6), we have 


fo(t) 


L~* [Fo(A)] = H(t — 2) (4e*~? — 3(¢ — 2) — 4) 
H(t — 2) [4e*-? — 3¢ 4 2]. 


5.4.2 Convolution Theorem 


Suppose that a function F(A) is represented as a product of two other functions: F(A) = Fy(A)- F2(A). Assume 
that we know the inverse Laplace transforms f(t) and f2(t) of these functions F(A) and F2(A), respectively. Then 
the inverse Laplace transform of the product Fy (A) - F2(A) can be defined according to the convolution rule (5.2.1), 
page 282: 


t 
Lo" [Fi(a) Fo(A)] (t) = £7" [Fi(Q)] * £77 LQ) = (fi * fa) = if Fir) fa(t — 7) dr. 
0 
It turns out that one can calculate the inverse of such a product in terms of the known inverses, with the convolution 
integral. 
Example 5.4.7: Find the inverse Laplace transform of the function 
1 1 1 


Ay ea 


Solution. The function F' is a product of two functions F(A) = Fy(A)F2(A), where 
F(A) =A, F(A) = (A- a), 
with known inverses (see formula 1 from Table 280 on page 281) 


Alt)=L°{A()}=H), felt) = {FO} = He", 
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where H(t) is the Heaviside function (recall that H(t) = 1 for positive t, H(t) = 0 for negative t, and H(0) = 1/2). 
Then their convolution becomes 


f(t) = (fi * fay(t =f nt-nAte i (r) e*7 dr 
=f He-0 ear f dr = — HW) 


a 


As usual, we multiply the result by the Heaviside function. 


Example 5.4.8: (Example 5.4.1 revisited) The function F(A) = (A—a)~!(A — b)~! is the product of two 
functions 


1 1 
fA) == and F2(\) = 1,=h 
with inverses known to be 
1 1 
—_ p-l — pat = —1 = bt 
fit) =L os = -| e“ A(t) and fo(t)=L ra = 7] e” H(t). 
From Eq. (5.2.1) on page 282, it follows that 
fe) =c7 : (fi « fa)(0) 
7 (X—a)(A—8) ee 


Straightforward calculations show that 


(fi * fay(t )= f nte-nat T) dr ay et eb dr = (t > 0) 


1 
at eet ar at bt 
- drt = _ A(t). 
e€ [« Sr [e e€ | (t) 


Example 5.4.9: Find 
1 
t) = £°-1 | ——_._—_~—____ |. 
FO) ec | 
Solution. The function F(A) is a product of two functions F,(A) = (A? + a?)~1 and Fo(A) = (A? + b?)~1. Their 
inverse Laplace transforms are known from Table 280, namely, 


1 
2 + a2 


Pie) ce ome | = * sinat, Bo=Lf- se 7 = sind, £>0. 


Hence, their convolution gives us the required function 


f(t) = (fi * fa) (t) = - | sin at sin b(t — r) dr 


1 


t 
ae I [cos(at — bt + br) — cos(at + bt — br)] dr 


[ d sin(at — bt + br) — sf d sin(at + bt — br) 


= SG (a +b) 2a wa (a — 
_ sin(at — bt + bt)+sinbt — sin(at + bt — bt) — sin bt 
7 2ab(a + b) 2ab(a — b) 
in at + sin bt in at — sin bt 
_ sina +sinbé — sinat— sin pi: 


2ab(a + b) 2ab(a—b) ’ 
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Example 5.4.10: To find the inverse Laplace transform £7! potas]: we use the convolution rule. Since 


_ = = 1 sin at 
£ 1 [A "| = t H(t), L 1 Feed => Ai H(t), 
we get 
Z i sinat 1 [* : 
fH=L aots|- re =i (t — Tr) sinat dr 


t t 1 t 
=- sin at dt — — T sinat dr 
a Jo a Jo 


tcosar|™ 1 ce ot ae 
= —-- — + —5T cosaT += cos at dt 
aq @ |;29 @ r=0 % Jo 
t sin aT t sin at 
a ee aa $20 
a Gg 3 a 
Example 5.4.11: Evaluate 
1 sinat sinat 
t) = £71 |... ] = —« . 
F(t) oo a a 
According to the definition of convolution, we have 
1 ft 
f(t)=- | sinar sina(t — 7) dr (t > 0) 
a” Jo 
os ae 
=a | cos(at — at + ar) dr — 2a | cos(at + at — ar) dr 
ge : tcos at 1. 
=F (sin at + sin at) — ae oe (sin at — at cos at) H(t). 


5.4.3 The Residue Method 


Our presentation of the residue method does not have the generality that is given in more advanced books. Nev- 
ertheless, our exposition is adequate to treat the inverse Laplace transformations of the functions appearing in this 
course. The residue method can be tied to partial fraction decomposition [16], but our novel presentation of the 
material is independent. 


P(r 
Suppose a function F(A) = > ~ is a ratio of two irreducible polynomials (or entire functions). We denote by 
Ajy G = 1,2,-+0)N, all nulls** of the denominator Q(\). Then the inverse Laplace transform of the function F' can 
be found as 
N 
_ pol _ Mt 
ft) =L{FO)}= 5° Res F(A)e™, (5.4.4) 


SAG 


* at the point 


where the sum ranges over all zeroes of the equation Q(\) = 0, and residues of the function F(A\)e* 
A=; (7 =1,2,...,.N) are evaluated as follows. 


If \; is a simple root of the equation Q(A) = 0, then 


P(Aj) 
Res F(A)e™’ = —* es 5.4.5 
ee aN) O's) (5.4.5) 
If A; is a double root of Q(A) = 0, then 
d 
Mo 4; _y)2 Mt 
Ls F(Aje™ = ae D (A— Aj)? F(A) e™*}. (5.4.6) 


54 N = 00 if Q(A) is an entire function. 
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In general, when \, is an n-fold root of Q(A) = 0 (that is, it has multiplicity n), then 


1 qv1 
At : n At 
- ae A, 
Res F(A)e Jim mae {(A — Aj)"F(A) et (5.4.7) 
Note that if A; = a + j@ is a complex null of the denominator Q(A) (the root of the polynomial equation with real 
coefficients Q(A) = 0), then A; = a — jG is also a null of Q(A). In this case, we don’t need to calculate the residue 
(5.4.4) at a@ — jG because it is known to be the complex conjugate of the residue at a + jj: 


P(X) ove POA) 5 
a5 OQ) in (Res a0) ° ). 


Therefore, if the denominator Q(A) has two complex conjugate roots \ = a+ j@, then the sum of residues at these 
points is just double the value of the real part of one of them: 
P(A) P(A) P(A) 


Res —— —— =2" Res ——. 5.4.8 
otis Q(A) | 0-38 Q(A) atid Q(A) pee) 


In the case of simple complex roots, we get 


Res F(Aje*! + Res F(A)e** = e* {2A cos Bt — 2B sin Bt}, (5.4.9) 
at+j a—Jj 


P(a +i) P(a +8) 
at J atj 
A= Res F(A) = Re ————,, B=S$ Res F(A) = In ———. 
agg PO) Q'(a + j8) 2 PO) Q'(a + j8) 
Example 5.4.12: We demonstrate the power of the residue method by revisiting Example 5.4.1. Consider the 


function 


1 
= =a aay 


According to Eq. (5.4.4), we have 


At ert 


sy 1 = : . 
f(t)=L Fenrent = O00). 5 Canc 


We denote P(A) = e*/(A—b) and Q(\) = A—a when evaluating the residue at the point \ = a because the function 
P(A) is well defined in a neighborhood of that point. Then from Eq. (5.4.4), it follows that 


ert P(a) ert 
P ae 


hg 2-8 
because Q’(a) = 1. Similarly we get 


e€ e€ 
= (0-60 p 1s 


yp OG 


Adding these expressions, we obtain 


1 1 
Loi — at _ bt H(t). 
lacaaca| pope 
Note that we always have to multiply the inverse Laplace transformation by the Heaviside function to ensure that 


this function vanishes for negative t. However, sometimes we are too lazy to do this, so the reader is expected to 
finish our job. 


Example 5.4.13: Consider the function 
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Using the residue method, we get its inverse Laplace transform 


At ert 


“1 1 = Bes e 
no =£" aE] > BS aR TES 


Since the \ = a is a double root, we apply formula (5.4.6) and obtain 


a d e* te“ (\ — b) — e** er 1 _ 
ROS. —— a a = — e ze 
rA=a (A—a)2(A—b)  =dAA-—||y_, (A — b) wa SU (a — b) 
The residue at A = 6 is evaluated according to Eq. (5.4.5): 
Mt ext cbt 
Res ——_.—__ = —. = 
Sb (A-aP—r—b) -apP|,_, ae 
Therefore, 
1 t 1 ebt 
—p-l = at at t . 
i lacaos| ab Gate | Gap 


If one prefers to use the partial fraction decomposition, then we have to break the given function into the sum of 


simple terms: 
1 1 il 1 1 il 1 


904) 1-0 Oa a ea 


Since the inverse Laplace transform of every term is known, we get the same result. 
Application of the convolution rule yields: 


Wo = £" loopy 


t t 
= te xe — | ret ebt-7) dr = a 7 elt—))t gr 
0 


0 
=. igbt HF [(-1 + (@— bjt) e-9] / 
t 1 a—b)t t 
= e Cae [(-1 + (a= bjt) e' Mt) +e? Gar 


Example 5.4.14: (Example 5.4.10 revisited) Using the residue method, find the inverse Laplace transform 


o=0 lage) 


Solution. The denominator \?(\? + a?) has one double null 4 = 0 and two simple pure imaginary nulls A = +ja. 
The residue at the point A = 0 can be found according to the formula (5.4.6): 


a . d a _ te (2 +47)-2re*¥ st 
Res « ——.——— > = lim — | — | = lim ———.——..—__ = =. 
0 | A? (A2 + a?) x30 dA | A? + a? 40 (A? + a?)? a? 


To find the residues at the simple roots \ = +ja, we set P(A) = \~? e** and Q(A) = A? +.a?. Then with the benefit 
of the formula (5.4.5), we get 


eo ; ert etdjat cos at + jsin at 
Res ¢« ——.—_~- p = _ lim ~— = : = ——___.—__.. 
+ja | A?(A\? + a?) Aotja A2-2X — 2(+ja)3 +2ja3 


Summing all residues, we obtain the inverse Laplace transform: 
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iat ewjat 
2j a 
Now we find f(t) using partial fraction decomposition as follows: 


because sin at = 


1 11 1 1 11 1 a 


A? (02 +a?) = a2 \2 a2 \2 +a? a2 \2 a> \2 +a2- 


The inverse Laplace transforms of each term are known from Example 5.1.2 (page 272) and Example 5.1.10 (page 
278) to be 
1 1 1 1 a t sin at 
Sa | nn (er || (a 
L acer we =| wat Feed ) yo 


Example 5.4.15: Find the inverse Laplace transform 


fo= 07 ?+5\—4 |. 


AB + 8A2 + 2A 


Solution. We start with the partial fraction decomposition method. The equation \? + 3\? + 2\ = 0 has three 
simple roots \ = 0, A = —1, and A = —2. Therefore, 


M7 +5\—4 A B C 


Pease, A el Ae 
with three unknowns A, B, and C. Multiplying each term on the right-hand side of the last equation by the common 
denominator, \(A + 1)(A + 2), we get 


A(A +. 1)(A + 2) + BA(A + 2) + CA(A +1) = 0? + 5A - 4, 


We equate coefficients of like powers of A to obtain A = —2, B = 8, and C = —5. Thus, the final decomposition has 


the form 
M+5A\-—4 2 8 5 


SEA OA LOD 
The inverse Laplace transform of each partial fraction can be found from Table 280. The result is 


2 
FO) = £7 |S + og ~ yay] = (At Bet 5) HO. 


Now we attack the inverse formula of the same fraction using the residue method. For simplicity, we denote the 
numerator, (A? + 5A — 4) e*’, as P(A) and the denominator, \®+3A?+2A, as Q(A). The derivative of the denominator 
is Q’(A) = 3A? + 6A +2. Substituting into this formula \ = 0,—1, and —2 yields Q’(0) = 2, Q’(—1) = —1, and 
Q’(—2) = 2, respectively. The corresponding values of the numerator at these points are P(0) = —4, P(—1) = —8e~*, 
and P(—2) = —10e~”". Since all roots of the equation Q(A) = 0 are simple, Eq. (5.4.5) can be used to evaluate 
residues 


PO) . P(-1) -,, P(-2) 
a0) O° * AP) 


f= e = (-24+ 8e'—5e-) H(t). 


Example 5.4.16: Consider the function 


From Example 5.4.12, we know that 
fQ=L% ee (e* —e") H(t) 
(A — a)(A — 0) , 
Using the shift rule (5.2.6), page 282, we obtain 


eon 


fe sos = f(t—cdH(t—c) = =a ee — oe) H(t—c). 
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On the other hand, if we apply the residue method, then formally we have 


re enon " elt—c)A e elt-c)A 
SS ~ OO) 2 OO 0) 
1 
= a(t—c) _ ,b(t—c) 
a—b le 


however, we obtain the correct answer only after multiplying this result by the shifted Heaviside function, H(t — c). 


Finally, we present two examples to show that, in general, the considered methods (partial fraction decomposition, 
convolution, and the residue theorem) are not applicable directly; however, the inverse Laplace transform can be 
found with the aid of other techniques. 


Example 5.4.17: Suppose we want to find the inverse Laplace transform £7 [in As). No methods have been 


A+2 
discussed so far for finding inverses of functions that involve logarithms. However, note that 
d, A+1 d 1 1 
— In — = — (In(A + 1) —- In(A + 2)) = —— - —— 
Ooo oo ee) a 


and that £7! |~4 — ~4,| = Je~' — e~?*] H(t). Then according to the multiplicity rule (5.2.11), page 283, we have 
MHI — W428 


ye —2t_ -t 
| hn A | ee 7 


A+ 2 t 


Example 5.4.18: In this example, we consider finding inverses of Laplace transforms of periodic functions. For 
instance, using the geometric series, we get 


—1 1—e _ pl 1 = no—n 1 = n—A,—n 
cae cereal tens Eye ener Dna ) 


n=0 n=0 
-1 1 = n ,—nr 1 = no—(n+1)r 
n=0 n=0 
-—1 1 — no—nr 1 ~ k+1,—kxr 
=f 7 S- 1)"e"™* — 7 a 1) re 
n=0 k=1 
- I 2 — no—n — n 
SE" | 55 Cite | He +2) Clan) 
n=1 n=1 


Thus, the inverse of the given function is the periodic function f(t) = 1 for0 <t< land f(t) =—-1forl<t< 2, 
with the period 2: f(t) = f(t+ 2). 


1. Determine the inverse Laplace transforms by inspection. 


(a) shy 0) 3 (©) oS @) OE 8) 
_ , ‘ A+1)° 
(f) 5343; (g) oops (h) XO} @ 5 @ “4h: 
18(A+2 dA-2 ‘ 
he O Be @& S&S: @ BG ©) D- 
2. Use partial fraction decomposition to determine the inverse Laplace transforms. 
8 . ae 200241). 9d? —36 . 
(2) opposyi  ()) sorry) og «XE D: 
(e) 5AZ—A+1, (f) 2A+1 ; ( ) Ar : (h) 10 < 
Boren cane! (©) gape! O) aatey 
(i) XOAFDOZ41)? (j) OFT? (k) XO749)? (1) 2-215? 
5, 20724244, 4a+1_. A+2 
(m) 33 (nt) gop ©) athe = () oft: 


3. Use the attenuation rule (5.2.7) to find the inverse Laplace transforms. 
2A+3 5 A-1 4x — 2 
>———. (b >. >. (d) =. 
@ yooto © wrap © Boas | DRO FI 
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4. Use the convolution rule to find se inverse Laplace transforms. 


(a) 355 (b) x PORT (c) OO (d) 
QO cm © cape © apo 
(i) CFD GQ ope () psy (1) 
(m) Cai (n) OPA eR? FD} (0) st (p) 


5. Use the residue method to determine the inverse Laplace transforms. 


4 ; 
AQP? 
D275 my 5 


XOETD?} 


8A+8 
(9\2+1)(42—1)° 


241, ee A244 : 

(a) (A=1)(A+2)? (b) 2-4) (c) AOAFDOA—4) ’ 

2\-8_, 2443, 13\2 424126, 

(d) XO2+4)? (e) XOassy? (f) T—2) (A249)? 

(g) (A+1)2 (h) (A41)744, (i) 4412 

8) 0= me o O52) 0749)" : Qz+ E418)? 

Gj) OPPO (kK) Gaya7) Q) Gata 

(m) eee ELS rena (n) ee eo (0) eevee 
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6. Apply the residue method and then check your answer by using partial fraction decomposition to determine the inverse 


Laplace transforms. 


2 : A=3 6A+4_. 
(a) X7=GA410? (b) MA oats? (c) MFFBATS? 
(qd) ae (e) (4)?? (f) seyexp 
( ) 3A+12 7 (h) A+1 (i) 6A+4_, 
& OF207=1) } X7+2\-F10' N2+2N 5° 
: 16 : 
G) ati (kK) Gem? Q) praaiop=ay! 

3Q+1) 5-10 8(A3—33A7+90A—11) 
(m) (x2 4413) (a2)? (n) (\2—4\413)(A2—4A 48)? (0) (\2—6A413)(\2 +4A—12) * 

7. Find the inverse Laplace transform. 
20—(A—2)(A+3)_, A+9 (A+2) A+1)=(A—3)(A=2). 

(a) OD 2)(A43)? (b) TA 1)(AZ 49)? (c) (Q—1OF2)0—3)? 


Qx24rA-28 1242 17(A+1) . 
(d) OF1)(—2) F4)? (e) (x2 F2A45) (A= 2)2? (f) (x2 —4\45) (42)? 
(3) Bee Oe - (hh __5A74+5A+10 (i) 5A+29 
gs OS DOF)O= 3) arms OPFBATS)(A= Des 
@ wap (Kk) era QO wrgSD 
(m) 2A+ (n) 11 é (0) A+6 
CELERANT Ty) (x2 —4n+13)(AF2)? (4x2 —4\+10)(A—2)* 


8. Using each of the methods: partial fraction decomposition, convolution, and residue method, find the inverse Laplace 


transform of the function 1/(A — 


a)(A— b)(A—c). 


9. Find the inverse of each of the following Laplace transforms. 


A+a 2+e% 

In(l+4/\*); (b) In — 

@) n+); @) BST) 
2e7 A+3 Mw +1 
d ; In ——,; f) n———. 
@) =: @) my © my y 
10. Using the ais rule (5.2.6), find the inverse Laplace transform of each of the following functions. 
we ae Beeb) He. ©) aetia e 
(d) ~—4d7410X e7 2X (e) 2d e —3A (f) 30 
(s) se (hy) See. (@) Hae 
(i) 288 (k) 3A2=2A_ ,-3X (1) 23 —y? 
J) YoOrpy: Meza Ee: (2a 45)2 © 


11. Use the convolution formula to find the inverse Laplace transform of each of the following functions. 


2 
(d) QP +4 


>) — 


a) its 0b) fs (e) 


12. Show that for any integer n > 1 and any a £0, 


e724 


eo 


1)? 


L 
eat , 


-1 1 ae oe 
. Larrea =a lara « 


13. Use the formula from the previous exercise to show that 


i? ! ace fife [os tdtdt---dt 
(A? + a?)r+1 — 2ran! 0 Ff 7 sin @ . 
a 


n times 
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5.5 Homogeneous Differential Equations 


In Section 5.2, we established some properties about Laplace transforms. In particular, we got the differential rule, 
which we reformulate below as a theorem. 


Theorem 5.6: Let a function-original f(t) be defined on a positive half-line, have n — 1 first continuous 
derivatives f’, f”,..., f("—), and a piecewise continuous derivative f(”)(t) on any finite interval 0 < t < t* < oo. 
Suppose further that f(t) and all its derivatives through f(”")(t) are of exponential order; that is, there exist 
constants T,s, M such that |f(t)| < Me**, |f’(t)| < Me, ..., |f™(£)| < Me* for t > T. Then the Laplace 
transform of f(")(t) exists when RA > s, and it has the following form: 


n 


£|F™@] Q) = *LIFIO) — D> a * #0). (5.5.1) 


ki 


Here, f*-)) (+0) means the limit f-1 (+0) = lima 410 f-Y (2). 


This is an essential feature of the Laplace transform when applied to differential equations. It transforms a 
derivative with respect to t into multiplication by a parameter (plus, possibly, some values of a function at the 
origin). Therefore, the Laplace transform is a very convenient tool to solve initial value problems for differential 
equations because it reduces the problem under consideration into an algebraic equation. Furthermore, initial 
conditions are automatically taken into account, and nonhomogeneous equations are handled in exactly the same 
way as homogeneous ones. When the initial values are not specified, we obtain the general solution. Also, this 
method has no restrictions on the order of a differential equation, and it can be successfully applied to higher order 
equations and to systems of differential equations (see §8.5). 

We start with the initial value problem for a second order homogeneous linear differential equation with constant 
coefficients (a, b, and c) 


ay" (t) + by'(t) + cey(t)=0, = y(0)=yo, y'(0) =m. (5.5.2) 


To apply the Laplace transform to the initial value problem (5.5.2), we multiply both sides of the differential equation 
by e** and then integrate the results with respect to ¢ from zero to infinity to obtain the integral equation 


af “ate atte [ y' (t)e—*# atte [ y(t) e—** dt = 0. (5.5.3) 
0 0 0 
Integrating by parts (or using the differential rule (5.2.2)) yields 
| y" (te dt = | e*d(y’) = y' ae ee +2 | y'(t)e~* dt 
0 0 0 
==) +a fo e*awy) 
0 


= —y'(0) iit. Ay(t) ae + ’ i y(t) et dt 
0 


and analogously 


0 


since the terms 
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approach zero at infinity (when t + +00); the general formula for integration by parts is 


b b 
/ u(t) dv(t) = v(6) u(d) ~ v(a) uta) — f POnots 


From the integral equation (5.5.3), we get 


a i u(t) eat — do — | +b i u(t) eat ~ yo] tof y(t) e~* dt = 0. 
0 0 
If we denote by 
woy= fo wear 
the Laplace transform of the unknown function, then the integral equation (5.5.3) can be written in compact form: 
(ad? + bA + c)y” = a(Ayo + yo) + byo- (5.5.4) 


Therefore, the Laplace transform reduces the initial value problem (5.5.2) to the algebraic (subsidiary) equation 
(5.5.4) for the Laplace transform of the unknown function, y”. Solving Eq. (5.5.4) for y”, we obtain 


Ly, _. &AYo + Yo) + bYo 
ee arc re ware 


Notice the way in which the derivative rule (5.2.2), page 282, automatically incorporates the initial conditions into 
the Laplace transform of the solution to the given initial value problem. This is just one of many valuable features 
of the Laplace transform. 

Thus, the Laplace transform y” of the unknown function is represented as a fraction of two polynomials. The 
denominator of this fraction coincides with the characteristic polynomial of the homogeneous equation (5.5.2). The 
degree of the numerator is less than the degree of the denominator. Therefore, we can apply methods from the 
previous section to find the inverse Laplace transform of y”. 

To find the solution of the initial value problem, we apply the residue method to obtain the inverse of the function 
y”. If both roots \y and Az of the characteristic equation a\? + bA + c = 0 are different (that is, the discriminant 
b? — dac # 0), then we apply the formula (5.4.5) to obtain 


o) +b a o) +b 
oe c 1yo + Yo) + bYo Jaret _, AA2Y0 + Yo) + bY0 orse 


H(t). oO. 
2a, +b 2ar2 +b (t) P25) 


If the characteristic equation has one double root \ = A, (that is, b? = 4ac), then the characteristic polynomial will 
be a\? + bA +c = a(A— Aj)”, where Ay = —b/2a. In this case, we apply the formula (5.4.6) and the solution becomes 


ee 2 aAY¥o + Yo) +byo 441 14, dd i Mt 
WO) = im gy {0 — May? SEER eo) =F i Fy (love + 0) + ble 


t ae bt 
= we + “fa digo + 94) + bya)! = eH [yy + yo + th] HUD. (5.5.6) 


In formula (5.5.5), we use only our knowledge about the multiplicity of the roots of the characteristic equation, but 
do not acquire whether they are real or complex numbers. Hence, the formula (5.5.5) holds for distinct roots, which 
may be either real numbers or complex numbers. 

Let us consider the initial value problem 


Ly[Dly(t)=0, — y(0) = yo, y'(0) = 96, ---, y"2(0) = yg”, (5.5.7) 


where L,,[D] is a linear constant coefficient differential operator of the n-th order, that is, 


n 
Lg (D] = 0,0" a_iiD™ 1 + fag = So ax Dp’, D=d/dt, 
k=0 
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and ag, a1, .--, @, are some constants. To apply the Laplace transform, we multiply both sides of the differential 
equation L,,[D]y(t) = 0 by e~* and then integrate the result with respect to t from zero to infinity to obtain 


Co 


an | y”) (t) et dt + ant | y(t) et dt+---+ a | y(t) et dt =0. 
0 0 0 


We integrate by parts all integrals that contain derivatives of the unknown function y(t). This leads us to the 
algebraic (subsidiary) equation 


where 
oe) n k 
y= | y(t) e* dt, and P(\) = S- ak » Ne ye) (40), 
0 
We can rewrite the equation in the following form: 
L(A)y” = P(A), (5.5.8) 


with the characteristic polynomial L(A) = a,” + an_1A\"-1+-++-+ a9. Notice that Eq. (5.5.8) can also be obtained 
from the differential rule: 


L{L[D\y} =L pS Gk vy = y ax L [D*y] 
k 


k=0 


=0 
= D ak piv) — S- Pa yD) (40) 


s=1 


yX (A) = =. (5.5.9) 


From this equation, we see that the Laplace transform y” is represented as the fraction of two polynomials P()) 
and L(A). Therefore, to find the solution of the initial value problem (5.5.7), we just need to apply the inverse 
Laplace transform to the right-hand side of Eq. (5.5.9). Various techniques to accomplish this goal are presented in 
the previous section. 


Example 5.5.1: Solve the initial value problem 
yl" +u*y=0, y(+0)=1, y/(+0) =2. 


Solution. We transform the differential equation by means of the derivative rule (5.5.1), using the abbreviation y” 
for the Laplace transform y! = Jo. y(t) ee! dt of the unknown solution y(t). This gives 


dy” — rAy(+0) — y’ (+0) + w?2y” =0, 


and, by collecting similar terms, [A? + w?] y* = A+ 2. Division by \? + w? yields the solution of the subsidiary 
equation: 

— At+2 

Hw?! 

Thus, y” is a fraction of two polynomials. To find its inverse Laplace transform, we apply the residue method. The 
equation \? + w? = 0 has two simple roots \; = jw and \2 = —jw. Applying the formula (5.4.5) on page 308, we 
obtain the solution of the initial value problem 


y” (A) 


_ A+? M A+2 Mu 
ult) = aye a Ga 


1 1 ‘ 1 1 : 
~ E ' ia| a ? E - ia| — 
dS=jw =H jw 2 jw 2 jw 
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since the derivative of \? + w? is 2\. Next step is to simplify the answer and show that: it is a real-valued function. 
This can be done with the aid of the Euler formula (4.5.2), page 213; however, the fastest way is to apply Eq. (5.4.8), 
page 309, and extract the real part: 


1 1 P : 2. 3 2 
y(t) =2® E + =| J* —~ Rel + KR el”! =coswt+—sinwt, t>0. 


We can also find the solution using the partial fraction decomposition method. For this purpose, we rewrite the 


expression for y’ as 
y=(A) = ee 
M+w2 wr? +? 


Using formulas 7 and 8 from Table 280, we obtain the same expression. 


Example 5.5.2: To solve the initial value problem 


y" +5y'+6y=0, y(0)=1, (0) =0, 


t 


by the Laplace transform technique, we multiply both sides of the given differential equation by e*’ and integrate 


with respect to ¢ from zero to infinity. This yields 
(17 +5\ +6) y" —A\—-5=0, 
where y” is the Laplace transform of an unknown solution y(t). The solution of the subsidiary equation becomes 


A+5 A+5 


L 
A) = > CF Oo 
y) M+5A+6 (A+3)(A+2) 
We expand the right-hand side into partial fractions 


APO Ag Be 
(A+3)(A+2) A+3 5 X+2 


with two unknowns A and B. Again, by combining the right-hand side into one fraction, we obtain the relation 


A+5 AXE BBO 3) 


(A + 3)(A + 2) (A+ 3)(A+2)  — 
The denominators of these two fractions coincide; therefore, the numerators must be equal; that is, 
A+5 = A(A+ 2) 4+ BIA+3) = (At B)A+2A4 3B. 


In this equation, setting 4 = —2 and then A = —3, we get A = —2 and B = 3. Hence, 


=o 3 
L = 
. O) = sa3ty49° 


Applying the inverse Laplace transform, we obtain the solution of the initial value problem: 
y(t) = 3H (t)e"** — 2H(t) e*. 


Following the previous example, the easiest way to find the solution is to apply the residue method. Since we 


know that the denominator has two simple roots A; = —2 and Ap = —3, we immediately write down the explicit 
solution ae Sede 
bs oo oe Ait 2 Azt — [3 e—2t _ 9 e-3¢] A(t). 
y(®) ee i Pea is [3 e e | (t) 
We can also apply the convolution rule (5.2.1) if we rearrange y” as follows: 
A+5 A+342 il 2 
yA) = >a = 


——— 
(A+3)A42)  (A+3)(A4+2) A+2 °° (A4+3)(A4+2) 
Then using formula 6 from Table 280, we get 


y(t) = fot ssa + 2L7! Fe . sl => e 7 H(t) + Qe * e 2% H(t). 


318 Chapter 5. Laplace Transforms 


5.5.1 Equations with Variable Coefficients 


The Laplace transformation can also be applied to differential equations with polynomial coefficients, as the following 
examples show. 


Example 5.5.3: Let us start with the Euler differential equation 
1 
Py" +ty’—--y=0. 
4 
Although we know from 84.6.2 that this equation has solutions expressed through power functions, it is instructive 


to apply the Laplace transform, which is based on the formulas from Problem 13 of §5.2, page 290. This yields the 
Euler equation in variable 2: 


because Lity’] = —y” + AL[ty] and Lit? y"”] = 2y” — 4AL[ty] + \7 Lt? y]. We seek a solution in the form y’(A) = \*. 
Upon substitution into the differential equation, we obtain k(k—1)+3k+ 2 = 0, which has two solutions ky = —1/2 
and ky = —3/2. The obtained differential equation for y” has two linearly independent solutions y’ = \~!/? and 
on = A~3/2, Applying the inverse Laplace transform (see formula 5 in Table 280), we obtain two linearly independent 


solutions 


ny =£7 | 5] = aH: wl) =e" ssa] - ay tH. 


where [' (4) = Vm and I’ (3) = Vx/2. 
Example 5.5.4: Solve the initial value problem for the Bessel equation of zero order: 
ty” +y'+ty=0, y(0)=1, y/(0)=0. 


Solution. Using the derivative rule (5.2.11) on page 283, we find the Laplace transforms: 


d d dy” 
ty’) = —-— W) Sf y2,,L a L 2 1 
Lity"] elt =a ee A 2 eee ee 
Lly’] = Ay” -1, 
_ d _ d L 
Lity) = —Zflyl=—-qy- 


This allows us to transfer the original initial value problem to the following separable equation for the Laplace 
transform y": 
dy" d dy" 
—2ry" — P= 4+14+Ay’-1-—y"’ =0 = M+ 1) = = ry”. 
o dy ed Pa OF 
Integration yields a solution y’ = (r? + 7 ie its inverse Laplace transform is a special function, called the Bessel 
functions of zero order and denoted by Jo(t). 


For a differential equation a2(t) y”(t) +.a1(t) y’(t) + ao(t) y(t) = 0 whose coefficients are linear in t, it is sometimes 

possible to find a solution in the form 
A2 
y(t) = / v(A) e*' da, (5.5.10) 
A 

where A; and Az are constants and v(A) is a function to be determined. This form (5.5.10) resembles the inverse 
Laplace transform formula (5.4.1) on page 302. 

Let the differential equation of interest be 


(agt + bo) y” + (ait + bi) y’ + (aot + bo) y =0. (5.5.11) 


After substituting (5.5.10) into (5.5.11), it becomes 


A2 d2 
t At — 
if V(A)e* ars [ R(A) V(A) e* dA = 0, (5.5.12) 


1 M1 
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where 
V(A) = (a2d? + aA +. a9) v(A) (5.5.13) 
= bod? + 1A +6 
2 TY¥IAT YO 
= So 4 
R(A) Cea eo (5.5.14) 


Since te** = 4 e*’, integration by parts changes (5.5.12) into 


A=A2 


et V(A) a. 


dA2 
| eM V/(A) — R(A)V (A) dd = 0. 
This equation is definitely fulfilled if V’(A)— R(A)V (A) = 0 and if A; and )2 are nulls of the function V(A). Separating 


the variables in the latter equation, we get 


aad = R(A) dA = InV(A) = [ RO) dd. (5.5.15) 


Hence the function v(A) in Eq. (5.5.10) is expressed as 


ee 
a2 + ayr + ao 


Example 5.5.5: Let us solve the differential equation 
ty” — (2t—3)y’ —4y=0. 
Solution. Formula (5.5.14) gives 


3A—4 


where C is a constant of integration (which we can set equal to 1 because we seek just one of all possible solutions). 
Hence, by Eq. (5.5.15), V(A) = CA?(A — 2) and from (5.5.13), we obtain 


VQ) _¥A=%_, 


ON 5a 2) 


Formula (5.5.10) gives the solution 


1. Determine the initial value problem for which the following inverse Laplace transform is its solution. 

(a) AX-1 : (b) 9A+6 A (c) 3A+5 F (d) 4\+4 

42-441? 9X2+46A41? 9X2 —12d44? 42412049" 

Use the Laplace transform to solve the given initial value problems. 
2. y"+9y=0, y(0) =2, y/(0) =0. 
3. 4y” —4y’+5y=0, y(0) =2, y'(0) =3. 
4. y"+2y’+y=0, y(0)=—1, y’(0) =2. 
5. y” —4y'+5y=0, y(0) =0, y'(0) =3. 
6. y” —y'—6y=0, y(0) =2, y’(0) = 1. 
vd 
8 
9 


. dy" — dy’ +37y=0, (0) =2, y/(0) = -3. 

. y+ 3y’ +2y=0, y(0) =2, y'(0) =3. 

yy +2y'+5y=0, y(0)=1, y/(0)=—-1. 
10. 4y” —12y'+13y=0, y(0)=2, y/(0) =3. 
ll. y’ +4y’+13y=0, y(0)=1, y’(0) = -6. 
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12. y’ +6y'+9y=0, y(0)=1, y/(0) =—3. 

13. y¥) +y¥=0, y(0) =y'"(0) =0, (0) = y'(0) = 1/Vv2. 
14. y” —2y'+5y=0, y(0) =0, y’(0) 

15. y” — 20y'+5ly=0, y(0) =0, y’(0) = -14. 
16. 2y"” + 3y’ +y=0, y(0) = 3, y’ (0) =-1. 

17. 3y”’+8y' —3y=0, y(0) =3, y’(0) = —4. 

18. Qy” 20y’ Sly =0, y(0) = 1, y! 0) =—5. 
19. Ay” 40’ 10ly = 0, y(0) =1, y’ (0) =—5. 


20. y” + Gy’ + 34y=0, y(0)=3, y/(0) =1 
21. yy!” + 8y" +16y’ =0, y(0) =y'(0) =1, y”(0) = —-8 
22. y+ 6y” +13y’ =0, y(0) =y'(0) =1, y”(0) = -6 
23. y'” — 6y"” +13y' =0, y(0) =y'(0) =1, y"(0) =6 
24. y+ 4y"” 4+29y’=0, y(0)=1, y’(0) =5, y”(0) = —20. 
25. y+ 6y"” +25y’=0, y(0)=1, y’(0) =4, y”(0) = —24. 
26. y'” — 6y” +10y =0, y(0)=1, y’(0) =3, y”(0) =8 
! 


a7. y + 13y" + 36y=0, y(0)=0, y'0)=-1, y"(0)=5, yO) = 19. 

The following second order differential equations with polynomial coefficients have at least one solution that is integrable 
in some neighborhood of t = 0. Apply the Laplace transform (for this purpose use rule (5.2.11) and Problem 13 in §5.2) to 
derive and solve the differential equation for y” = L[y(t)], where y(t) is a solution of the given differential equation. Then 
come back to the same equation and solve it using the formula (5.5.10). 


28. y—ty=0; 29. tyt+y+ty=0; 30. ty+(1—t)y+2y =0; 
31. %—2ty+2ay=0; 32. ty—(t-4)y—3y=0; 33. te+(1—2)y—(2t+1)y =0; 
34. tt+y+y=0; 35. tj—(2t-ly—y=0; 36. t#+59—(t4+3)y =0: 


37. ty+3y—ty=0; 38. 2ty+(2t+3)y+y=0; 39. ty—(4t—3)y+ (3t—7)y=0. 


5.6 Nonhomogeneous Differential Equations 
Let us examine a second order nonhomogeneous differential equation with constant coefficients: 
ay” (t)+by'(t) + cy(t) = f(t), 0<t<o, (5.6.1) 
subject to the initial conditions 
y(0)=yo, y'(0) = yo. (5.6.2) 
Equation (5.6.1) together with the condition (5.6.2) is usually referred to as an initial value problem (IVP). We always 


assume that the unknown function y(t) and the nonhomogeneous term f(t) are function-originals, and, therefore, 
their Laplace transforms 


vo= fo y(t) e* dt, ray= fo fije ade (5.6.3) 


exist. From Theorem 4.20 (page 224), we know that the solution of the driven equation (5.6.1) is the sum of a 
particular solution of the nonhomogeneous equation and the general solution of the corresponding homogeneous 
equation. For simplicity, we split the problem under consideration into two problems, namely, 


ay’ (t) + by'(t) + cy(t) = f(t), (0) =9, y(0) =0 (5.6.4) 


and 

ay’ (t) + by'(t) + cy(t) =0, y(0) = yo, y'(0) = yo. (5.6.5) 
Then the solution y(t) of the initial value problem (5.6.1) and (5.6.2) is the sum of y(t), the solution of the problem 
(5.6.4), and y(t), the solution of the problem (5.6.5) for the homogeneous equation, that is, 


y(t) = Yp(t) + yn(t). (5.6.6) 


First, we attack the problem (5.6.4) for the nonhomogeneous equation (5.6.1) because the IVP (5.6.5) was considered 
in the previous section. We know how we would have to apply the Laplace transform to find the solution of this 
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initial value problem. We multiply both sides of Eq. (5.6.1) by e~** 


zero to infinity to obtain 


and integrate the result with respect to t from 


I [ay” (t) + by'(t) + cy(t)] e~* dt = [ five de. 
0 0 


Integrating by parts (or applying the derivative rule (5.2.2) on page 282) and using the notations (5.6.3), we get 
ad yh + bAyt + cy = f¥. 
Solving the above algebraic equation for ae we obtain 
=r CO). (5.6.7) 

where 
- 1 
~ ad +bA+e 
is the reciprocal of the characteristic polynomial. Its inverse Laplace transform G(t) = £~![G*(A)] is called the 
Green function and it describes the impulse response of the system because L [d(t)] = 1. 

Note that the function yp (A) in Eq. (5.6.7) is the product of two functions, f£() and G(X). When the explicit 
formula for f” is known, application of the inverse Laplace transform to y’ = f’ G"(A) produces the solution yp(t) 


of the initial value problem (5.6.4). If an explicit formula for f” is not available, we have no other option than to 
use the convolution rule: 


Cy) (5.6.8) 


uplt) = (fF «G)(t) = f(r) Ge— 7) ar. 65.6.9) 


The Green function G(t) can be defined directly without using the Laplace transformation. Recall that the Laplace 
transform of the 6-function is 1 since 


L{6|(A) = | (Qe t=<." |. =e a1, 


Therefore, the Green function G(t) for the given differential equation (5.6.1) is the solution of the following nonho- 
mogeneous equation: 


aG"(t) + bG'(t) +cG(t) = 6(t). 


Since the equation involves differentiation in the sense of distribution (recall that the d-function is a distribution but 
not an ordinary function), it is more practical to define the Green function as the solution of the following initial 
value problem: 

aG" (t) + bG'(t) + cG(t) =0, G(0)=0, G’(0)=1/a. (5.6.10) 


Therefore, the initial value problem (5.6.5) can be solved using techniques disclosed in §5.5. Application of the 
Laplace transform to the initial value problem (5.6.5) leads to the following algebraic equation for y/, the Laplace 
transform of its solution: 

ad? yk — adyn(0) — ay;,(0) + bAy’ — by, (0) + cyy = 0. 
Solving for yk, we obtain 
i adyo + ays + byo 
A) = 5.6.11 
Un) = aT B\te een 


Any method presented in §5.4 can be used to determine the inverse Laplace transform of yt , which gives the solution 
yn(t) of the problem (5.6.5). We demonstrate the Laplace transform technique in the following examples. 


Example 5.6.1: Find the Green function for the given differential operator 


4p* + 12p? + 5D? -—16D+5, D=d/de. 
Solution. The Green function for this operator is the solution of the following initial value problem: 
4G + 12G"" +5G” —16G'+5G=0, G(0)=0, G’(0) =0, G’(0) =0, G’”" (0) = 1/4. 
The application of the Laplace transform yields 
(4A* + 1243 + 5A? — 16\ +5) G? = 1. 
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Therefore, G’(\), the Laplace transform of the Green function, is the reciprocal of the characteristic polynomial, 
that is, 


G*(y) : 


~ 4+ 1D + BM? — 1614 5° 
The characteristic equation Q(A) = 4+ + 123 + 5A? — 164 +5 = (2\ — 1)?(A? + 44+ 5) = 0 has one double root 
A, = 1/2 and two complex conjugate roots A2,.3 = —24 


t j. Application of the residue method gives us 
d (A-1/2)?e%% 
( /2)*€ + cS 


Git) dy Q(A) a Q’ (Ak) 


eret 


d ert e(-2+i)t 
— ——— + 2k —~—__,, 
dA 4(A2 + 4\ + 5) a=i Q’(-2 +j) 
where it does not matter which complex root Ay, (k = 2,3) will be chosen. Since 
d ee _ ie” (2+ 4) et 
dd 4(\2 +4445) 4.02 +4445) 4(\2 + 4445)? 
and for k = 2,3, 


Q' (Ax) = (Ax — 1)? Gag + 4) = (2A, — 1)? 2 (Ay + 2 


) = (-—4+ 2j — 1)? 2 (4)) 
= (—5 $ 2j)? 2 (4j) = 2(25 — 4 + 20j) (4)) = 2(20 + 215), 
1 7 1 _  20F214j 10 2tj 
(2A, —1)2 (2A, +4) 2(204 21j) ——«-2(212 + 202) 841 * 1682’ 
we have 


_{t 20 -\ t/2 1 _o ‘ 
(i= (5 an)e H(t)+ se (20 cost + 21sint) H(t). 
negative f. 


As usual, we have to multiply this result by the Heaviside function to guarantee vanishing of the Green function for 


On the other hand, using partial fraction decomposition, we obtain 


4 A 
CNHs as ° 


_ 7 z 20A + 61 
~ 29(2X—1)2 841 (2\—1) 


841 (A? + 445) 
Rewriting the last fraction as 


20\+61 _ 20(A+2) +21 


M2 +4445  (A4+2)2? 41 


and using the attenuation rule (5.2.7), we get the same result produced by the residue method. 


Example 5.6.2: Solve y” — y' — 6y = e* subject to the given initial conditions y(0) = 1, y/(0) = —1 
Solution. The application of the Laplace transform gives us 


Lly"] — Lly'] — 6L[y] = Le’). 
We denote the Laplace transform f° e~~ y(t) dt of the unknown function y(t) by y” = Ly. Then from the differential 
rule (5.5.1), it follows that 


Lly’] Ay” — y(0) = Ay” — 1; 


Lly"] = My” — dry(0) — y'(0) = Ny” —A+1. 
The Laplace transform of the function f(t) =e!’ is f£(A) = Lie“] = 


x7 (see formula 6 in Table 280). Therefore, 


OF =r —6)y” = fF +A-2. 


My! —Ab1— Ay’ 41-69" =f? or 
This algebraic equation has the solution y’ = ie + yk, where 


1 A= 5 
L L DL 
My ja—-y-6 2) MO UR ee 
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Hence, we need to find the inverse Laplace transform of each of these functions, ye and yk, where Yp is the solution 
to the nonhomogeneous equation y”” — y’ — 6y = e’ subject to the homogeneous initial conditions y(0) = y'(0) = 0, 
and yp, is the solution of the homogeneous equation y” — y’ — 6y = 0 subject to the nonhomogeneous conditions 


1 
y(0) = 1, y'(0) = —1. If we know the Green function G(t) = £7! | 


woacwe could find y, as the convolution: 


i t 
Yyp(t) =Gx ft) =f*G) = if G(t—7) f(r) dr = | G(r) e®7 dr. 
iw 0 
On the other hand, the Green function can be obtained either with partial fraction decomposition: 


G(t)=£> wos =f" a 7 2 a : [e* — e-?"] H(t), 


or with the residue method: 


At ert 


CO NS ae ag Se 


dM M 
i [e** — e~*"| H(t). 


AD Dheetgs Da aig” 5 


With this in hand, we define a particular solution of the nonhomogeneous equation as the convolution integral; that 
is, 


ae Be 1 1 1 
Yp(t)=Gef= 7 | [ee —er*"| ede = (3 ert + TA e 7 3 ‘) H(t). 


Note that the function yp(t) satisfies the homogeneous initial conditions y,(+0) = 0, y,,(+0) = 0. 
The function y,(t) could also be found in a more straightforward manner since the Laplace transform of the 
forcing function is known: 


y — Lot —_—_—_ SOC — -1 are oo woven 7 
: M-A-6 A-1 (A = 3)(A + 2)(A— 1) 
After application of the partial fraction decomposition method, we obtain 


1 3t 1 t 
ee eer ee ay 
ie aoe ge 


va pd | Se. —— 
Up = Sx ax 61 


1 i ic 4 tego 


The function y;, can be discovered using the residue method, rather painless, that 


7 (A — 2) e** (A — 2)e** 
pe RES Wa er Ve er 
(A — 2) eS (A — 2) et Me ay 3t 
aj. + Ser Pers. ESO 
WEA eg Oe ae 


The function y;,(t) satisfies the homogeneous equation y” — y’ — 6y = 0 and the given initial conditions y(+0) = 1, 
y' (+0) = —1. We check the answer with Maple: 


sol := dsolve({ics, ode}, y(x), method = laplace) odetest(sol, [ode,ics]) 
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5.6.1 Differential Equations with Intermittent Forcing Functions 


In this subsection, we demonstrate the beauty and advantages of the Laplace transformation in solving nonhomo- 
geneous differential equations with discontinuous forcing functions. This topic has many applications in engineering 
and numerical analysis when input functions are approximated by piecewise continuous or step-functions. 


Example 5.6.3: We start with a simple problem for the first order differential equation 
ytay=f),  y(0) =9, 


where a is a real number, f(t), called an input, is a given function; we will refer to the output as the solution of this 
problem. After applying the Laplace transformation, we get the algebraic equation for y”, the Laplace transform of 


the unknown solution y(t): 
1 
L L 


According to the convolution rule (5.2.1), the inverse Laplace transform yields the solution 


hy tay" =f"() = oy 


1 
Ata 


y(t) =£71 | 0) = [ e Ut) f(r) dr. (5.6.12) 


Let us consider a periodic input modeled by function f(t) = sint. Then we get 


1 i 1 
= fob | _— S  .. = —at 
ae besbirsl @+1° ~ a+ 


(a sint—cost), t20. 


Now approximate sint with a step-function so that the total energy (which is the integral) on each subinterval (0, 7] 
and [7,27] remains the same: 


2 1 if0<t<7 
a(t) = — : a(t) = fr(t+2 ’ 
Fr(t) ae iia <t< 27, Fat) = Fal m) 


Expanding f,(¢) periodically with period 27, we rewrite it using the Heaviside function: f,(t) = = >> inf — 1)" 
[H(t — ka) — H(t — ka —7)]. The Laplace transform of f,(t) is 


_ p-aAt 
Llfa) == 7-1)" fe — reer] = 2 LEE tanh (=) | 


mA 1+e>%* wr 


m= 6 [Fe] = DCD alt — be) — 90k) 
where 
gt) = = (1) HO 
1 
5 1 20 
=| 


Figure 5.24: Example 5.6.3: approximation f,(t). Figure 5.25: Example 5.6.3: approximation f,/4(t). 
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Similarly, we can truncate the interval [0,27] into eight subintervals of length 7/4 and approximate sin t on each 
interval keeping the integral unchanged: 


fr/a(t) = hi 42. (-1)* H(t — km) + So (-1 y [a (: a kn H(t-4 kn) | 


k>1 k>0 


+h, H(t) + he S\(-1)' la (+ 4 — kr) H(t - kn) |, 


k>0 


where hy = Ly es sintdt = = 4 2V2 w 0.37 and hy = 2 ef st nid: = ava ~ 0.9. As it is seen from Fig. 5.26(b), 
the corresponding solution y, alt ) approximates the suet solution (5.6.12) for f(t) = sint better than y,(t). 


-0.64 


(a) (b) 


Figure 5.26: Example 5.6.3. (a) The output (black line) y,(¢) to the input (dashed line) and (b) output y,/4(t), 
plotted with Maple. 
Example 5.6.4: Use the Laplace transform to solve the initial value problem 


Vee eee | Oe, § OCtS a, eh ets 
y -¥'-8y=f0={ 5 ei y(0) =1, y') =-1. 


Solution. Following the same procedure as in the previous example, we apply the Laplace transform to the given 
initial value problem to obtain 


y” = yh + UK 
where y” is the Laplace transform of the unknown function y(t) and 
1 0 
def DL _ ie def DL _ 
Ll¥p| = Yp = (Crs ea f and Llynl = on = —A—6" 


The function y;,, was found in Example 5.6.2. We analyze here only the inverse of the function ies Since the Green 
function was defined in Example 5.6.2, we express yp as the convolution integral 


Yp(t) = («n= | G(t—T)f war= f G(r) f(t — 7) dr 


10 i [e3" — e-?"] sin 4(t — 7) dr, if0<t<z7, 
10 {, let" e772)" | sind dy, if 7 <t, 
8 3 __ A sindt +2 cos4t—2e-", if0<t<n, 


8 3! [1 — e387] + 26 [e?™ — 1], ifm <t. 
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However, we can find the solution y, of the nonhomogeneous equation in another way. The Laplace transform of f 
is 


rR) -| e*¢ F(t) a= 50 f e—** sin 4t dt = ao. [1—e7*"]. 
Then 
ogo tae ee ee 
where 
ee 1 200 200 egeese 


M—A-6 M416 (AF2)A—3)02 + 16)’ 


Since o is the difference of two similar expressions, one of which differs from the other only by the exponential 
multiplier, e~*”, it will be enough to find the inverse Laplace transform of one term, namely, xo(t) = £7' Ee 
Of course we can apply the method of partial fraction decomposition, but the procedure to determine coefficients 


in the expansion 
1 200 A B Cr+ D 


W—A-6N+16 A-3 ATID’ WHE 
becomes tedious (unless you use a computer algebra system). 
The most straightforward way to determine the function zo(t) is to apply the residue method because the 

denominator (which we denote by Q(t)) has four simple different roots; that is, 

Q(A) = (A? — A 6)(A? + 16) = (A — 3)(A +. 2)(A + 45) (A — 45). 
The derivative of Q()) is 

Q’(A) = (A + 2)(A? + 16) + (A — 3)(A? + 16) + 2A(A — 3)(A 4 2). 
Hence, the explicit expression for the function 29(t) becomes 


200e7" =200e7%% — 200eN* = 200 e St 


9 = Oey OC * em * VCH 
= ; i aie (2 aT est + 2(2 a em 
= se 26% +X cosdt — = sin dt H(t) 
because 1G = ao = ss (2+ 11j). Note that instead of evaluating residues at two complex conjugate nulls 


\ = +4j, we can use the formula (5.4.8) on page 309. We must multiply the right-hand side by the Heaviside function 
H(t) since x(t) should be zero for t < 0. 
The inverse Laplace transform of the function x’ = 2} e~** can be found with the aid of the shift rule (5.2.6) 
on page 282; namely, 
a,(t) = L7* [al e*"] = x(t — 7) H(t — 7/). 


Therefore, the solution y,(t) of the nonhomogeneous equation with the homogeneous initial conditions is 


yp(t) = ao(t)—2,(t) = vo(t)H(t) — wo(t — 7) H(t — 7) 
_— E ext rem H(t) E e3t—37 Qe 2tten H(t _ T) 


+ E cos 4t — < sina [H(t) — H(t—7)]. 


Example 5.6.5: Use the Laplace transform to solve the initial value problem 


y” +4y = f(t) = H(t-— 7) -— H(t-—37), y(0)=0, y’(0) =0. 
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Solution. Since the input function f(t) is the difference of two shifted Heaviside functions f(t) = fi(t) — fo(), 
with f; = H(t—7) and fo = H(t — 37), we apply the Laplace transform and obtain 


Ny” + 4y” = f = ft — FY, 


where y” = Ly], fi =Lfi, fo = Cfo, and 


eo) ie) a l 
fr) = i; f(t) ee / en dt = 7 edt = x le" — aa . 


By straightforward algebraic operations we find the Laplace transform of the unknown solution to be 
1 1 1 
L L L. ¢k —Ar _ ,—3\0 L_ok 
\) = ——.. fl =G". fe = —— . = = =yl— 
y (A) aa f N+4 x le € | YW Y2 > 


where yh = A71(A? + 4)~1e79* and yF = \~1(\2 + 4)~te-%™. Let us introduce an auxiliary function g(t), which 
we define through its Laplace transform: 
ee 
J N08 +4) 


Using the residue method, we find the explicit formula for g(t): 


At At 


e e 
= _—_ 2 __ 
ay = ea ee) 


1 erst (; 1 ) 

—-+2R——— = ( —— = cos2t ) H(2). 

i*’® apap aa @ 

Since the Laplace transform of the unknown solution y’(\) is expressed through g"(A) as 
y (A) = g"(A)e*™™ — g*(A) e-™*, 


the above expression calls for application of the shift rule (5.2.6) on page 282. This yields 


(1 — cos 2t)[H(t — 7) — H(t — 3n)], 


Ble 


y(t) = g(t — 7) — g(t — 37) = 


and yi(t) = g(t — 7), yo(t) = g(t — 3m). We check this answer by substituting y(t) into the given equation and the 
initial conditions. First, we need to find its derivatives (using the product rule): 


yo = 7(1 — cos 2t)/|H(t — 7) — H(t — 37)] 4 : (1 — cos 2t)[H'(t — 7) — H’(t — 37)] 
- 5 einen Hea) : (1 — cos 24)[6(t — m) — 6(t — 3m) 
2 5 sin 2¢ [H(t — m) — H(t —3n)| + : (a ee : Hi ecseneGBn) 
Z 5 sin 2t [H(t — m) — H(t —3m)], 

gf ; (sin 2)! [H(t — r) — H(t —3m)] + 5 sin 2t [H’(t — r) — H'(t — 3m)] 


= cos2t|H(t— 7) — H(t — 37)| + = sin 2¢ [d(t — 7) — 6(t — 32)] 


Z 
2 
because H’(t — a) = 6(t — a) and f(t) d(t — a) = f(a)d(¢ — a) for any a > 0 and smooth function f (see Eqs. (5.3.5) 


and (5.3.7) on page 299). Then we substitute these derivatives into the given differential equation to verify that it 
is satisfied. Since the function y(t) = yi (t) — ya(t) is identically zero for t < 7, the initial conditions follow. 
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Example 5.6.6: Solve the initial value problem with intermittent forcing function 


0, ee eer 
t — 2, 2<t<3, 

wb + 8y=fOH=) Gia geeca viO=9, ¥'(0) =2. 
0, 4<t<om, 


Solution. We rewrite the function f(t) via Heaviside functions: 


f(t) = (¢— 2) [H(t — 2) - H(t - 3)] - ¢- 4) [H(t - 3) - H(t - 4)] 
= (t — 2)H(t — 2) — 2(t - 3) H(t — 3) + (¢- 4) H(t — 4). 


Using the shift rule (5.2.6), page 282, its Laplace transform becomes 
1 
~ 2 


The application of the Laplace transform to the given initial value problem yields the algebraic equation (A? — 5A + 
6)y" —2 = f” with respect to y’ = Ly. Hence, the Laplace transform of the unknown function y(t) is 


le=* Se aes oP | : 


P(X) 


2 - ft" 
M2—-5A+6 A2—5A+6°- 


Now we apply the inverse Laplace transform to obtain 


y(t) = yrnlt) + y(t), 


yA) = 


where 


n=" era] = ay] =P OHO. 


yi =o- SSeS = oO =3) jeqs" fe 4 =| . 


aX 


According to the shift rule (5.2.6) on page 282, multiplication of the image by e~%* causes shifting its function-original 


by a. Let g(t) be the function defined by the inverse Laplace transform: 


g(t) =£" pases | 


Since the denominator, \?(\ — 2)(A — 3), has one double null A = 0 and two simple nulls \ = 2 and \ = 3, we find 
its inverse Laplace transform using the residue method to be 


At At ert 


e e 
t) = Res ————___ ee = =, 
ot) = Res ay ayn a) + BO aos) + POSTS) 
From formula (5.4.6) on page 308, we get 
e 


At d ert 
“1 MODs) ak (= —DA— 5) 


seg 8S (27 — 5A +6) en: 


Residues at simple nulls are evaluated with the help of Eq. (5.4.5): 


ert ert 1 
Res =~ = => = —-e, 
2 A2(A—2)(A-—3) =A2(A—3)|,_. 4 
M Mt 
ey ee as e, 
3 M(A-—2)(A—3) MWA-2)I|,_5 9 
et . t+5 1 y 1 x . : 
Combining all these formulas, we obtain g(t) = reg + 9° H(t). Therefore, the required solution be- 


comes 
y(t) = ya(t) + g(t — 2) — 2g(t — 3) + g(t — 4). 
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Example 5.6.7: A mass of 0.2kg is attached to a spring with stiffness coefficient 104newtons per meter. Suppose 
that at ¢ = 0 the mass, which is motionless and in equilibrium position, is struck by a hammer of mass 0.25kg, 
traveling at 40m per sec. Find the displacement if the damping coefficient is approximately 0.02 kg/sec. 

Solution. The equation of motion of this mechanical system is modeled by 


my” + cy’ + ky = po(t), 


where m = 0.2 is the mass of the object, c = 0.02 is the damping coefficient due to air resistance, k = 104 is the string 
coefficient, and p is the momentum of the hammer to be determined. For more details, we recommend consulting 
[14]. 

The momentum of the hammer before the strike is pp = mv = 0.25 x 40 = 10 kilogram meter per second. The 
law of conservation of momentum states that the total momentum is constant if the total external force acting 
on a system is zero. Hence po = Pw + Ph = MwVw + Mvp, where Pw, Vw and pp;, vp denote the momenta and 
velocities of the mass and hammer, respectively. The law of conservation of energy is also applicable: the total 
kinetic energy of the system after the hammer strikes is equal to the kinetic energy of the hammer before it struck. 
So mu? = mv? +mMyv?2,, where vp, and vy» denote the velocity of the hammer and the mass after the strike. Excluding 
vp, and vy from the above two equations, we obtain 


2Mwy qd MmM—- My 
Ww eu" an h, = . 
Pp mela” Pr Tae 
The momentum imparted to the mass by the collision is 
_ @my — 2x02 ee 
Pu m+my?  02+0257° 045° 45. 9" 


Thus, the displacement y satisfies the IVP 
1 / 4 80 / 
0.2y" + 0.01 y' + 10 y= 9 O(t); y(0) = y/(0) = 0. 


Taking the Laplace transform, we get 


o 400/9 400/9 _ 0.2 x 223.6 


Yo 3+ 0IA+5x 102 (0 +0.05)2 +5 x 10?— 25x 10-4 (A+. 0.05)2 + 223.62" 


It follows that the displacement of the mass is approximately y(t) © 0.2 e~ °°! sin(223.6t), t > 0. 


Problems| I all problems, D = d/dt stands for the derivative operator, while D°, the identity operator, is omitted. 


1. Obtain the Green function for the following differential operators. 
(a) D?+2D4+5; (b) D?4+2D4+1; (c) 2D?+D-1; 
(d) D?+6D+10; (e) D?+5D+6; (f) (D?—4D+20)?; 
(g) D?+2D4+17; (h) 6D?+5D+4+1; (i) D*+1; 
(j) 4D?+7D+3; (k) 2D?+5D-3; (1) 4D°—D. 


2. Solve the initial value problems using the Laplace transformation. 
(a) ¥+2y+ 3y=9t, y(0)=0, ¥(0) =1. 
(b) 4y+16y+17y=17t-—1, y(0)=-—1, y(0) =2. 
(c) §+5y+4y=3e%, y(0) = —-y(0) =-1. 
(d) ¥-4y+4y=t?e", y(0)=1, y(0) =2. 
(e) y+9y=e™, y(0) = -4, 9(0) = a: 
(f) 2y-—3y+17y=17t-1, y(0) =—1, 9(0) =2. 
(g) y a, 2y ee y= e*, y(0) = 1, y(0) =—-l. 
(h) f-2t+5y=2+t, y(0)=4, HO)=1L (i) Wey=e"?,  y(0)=-1. 
(j) y+8y+20y=sin2t, y(0) =1, y(0) = —4. 
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(n) 3§+5y—2y=7e**, (0) =3, 9(0) =0. 


3. Use the Laplace transformation to solve the following initial value problems with intermittent forcing functions (H (é) 
is the Heaviside function, page 274). 


(a) y +y=H(t)-H(t-2), yO)=1; (b) y'-2y=4t[H(t)- H(é-2)],  y(0) =1; 
(c) Pines 24 sint[H(t)+ H(t—7)], y(0)=0, y’(0) =0; 

(d) y"+2y/’ A(t)—H(t—1),  y(0)=1, y'(0) =-1; 

(e) pals ae cost[H(t) — H(t—7/4)], y(0) =1, y’(0) = -1; 

(f) y" +5y' + 6y = 36t[H(t)—- H(t—1)],  y(0)=—1, y'(0) = -2; 

(g) y” +4y’ + 13y = 39H (t) — 507(t — 2) H(t— 2), y(0) =3, y/(0) = 1; 

(h) y” + 4y = 3[H(t) — H(t— 4)] + (2¢-5)H(t—4), (0) = 3/4, y'(0) = 2; 

(i) 4y" + 4y’ + 5y = 25t[H(t) — H(t — 7/2)], (0) =2, y'(0) =2; 

Gj) y” + 4y' + 3y = H(t) - H(t—1)+ H(t—2)—- H(t—3),  y(0) = —2/3, y(0) = 1; 


4. Use the Laplace transform to solve the initial value problems with piecewise continuous forcing functions. 


he Hepes y, 
() w-W=1 9 ost y(0)=-6, (0) =1. 
0, ote i, 
(b)  y” —3y'+2y=4 1, Ie 2; y(0) =3, y’(0) =-1. 
i, $30, 
1, 0<t<2, 
(c) W439! +2y=f ai eS y(0)=0, y'(0)=0 
t, O<t<n7, 
(a) ytu={%, 0S! y(0) =0, y'(0) =0. 
Si USS, 
(e) of tay={ ee SESE yoy=0, vO) =0 


5. Solve the IVPs involving the Dirac delta-function. The initial conditions are assumed to be homogeneous. 


(a) y+ (20)? y = 36(t — 1/3) — 6(¢ — 1). (b) yy” +2y' + 2y = 36(t — 1). 
(c) yy’ +4y' + 29y = 5d(t m2) —5d(t- 27). (d) y+ 3y’+2y =1-6(t—-1). 
(e) 4y”+4y' +y=e 7 6(t—1). (ff) y” ~ Ty! + 6y = 5(t 1). 


1 
6. Consider an RC-circuit and associated equation Rq + G q(t) = E(t), where E(t) is the electromotive force function. 


Use the Laplace transform method to solve for the current I(t) = q and charge q(t) in each of the following cases 
assuming homogeneous initial conditions. 


(a) R= 10ohms, C = 0.01 farads, E(t) = H(t — 1) — H(t — 2). 
(b) R= 20ohms, C = 0.001 farads, E(t) = (t— 1)[H(t — 1) — H(t — 2)] 4+ (8 — t)[H(t— 2) — H(t— 3). 
(c) R=1ohm, C = 0.01 farads, E(t) = t]H(t) — H(t — 1). 
(d) R= 2ohms, C = 0.001 farads, E(t) = (1 — e~*)[H(t) — H(t — 2)]. 
7. Consider an LC-circuit and associated equation LI + = a(t) = E(t), where E(t) is the electromotive force function. 


Assuming that q(0) = 0 and I(0) = 0, use Laplace transform methods to solve for I(t) = q and q(t) in each of the 
following cases. 


(a) L=1henry, C = 1farad, E(t) = H(t — 1) — H(t — 2). 
(b) L=1henry, C = 0.01 farads, E(t) = (t — 1)[H(t — 1) — H(t — 2)] + (8 -t)[H(t — 2) — H(t - 3)]. 
(c) L = 2henries, C = 0.05 farads, E(t) = t]H(t) — H(t — 1)]. 

(d) L =2henries, C = 0.01 farads, E(t) = (1 — e~‘)[H(t) — H(t — x)]. 


8. Consider an RL-circuit and associated equation LI + RI = E(t). Assume that I(0) = 0, that L and R are constants, 
i. OSiek 


Solve for [(E). 
he De, 


and that E(t) is the periodic function whose description in one period is E(t) = 
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10. 


11. 


12. 


13. 


14. 


. Suppose that a weight of constant mass m is suspended from a spring with spring constant & and that the air resistance 


is proportional to its speed. Then the differential equation of motion is my + cy + ky = f(t), where y(t) denotes the 
vertical displacement of the mass at time ¢ and f(t) is some force acting on the system. Assume that c? = 4mk and 
sint, O<t<q, 


that y(0) = y(0) = 0. Find the response of the system under the following force: f(t) = ‘ Zi 
; TK<t. 


Suppose that in the preceding mass-spring problem m = 1, c = 2, k = 10; and y(0) = 1, y(0) = 0; and f(t) is a periodic 
10, O0<t<q, 
T 


Find y(t). 
eri t <2n. ag) 


< 
function whose description in one period is f(t) = = 
A weight of 2kilograms is suspended from a spring of stiffness 5 x 10* newtons per meter. Suppose that at t = 2 the 
mass, which is motionless and in its equilibrium position, is struck by a hammer of mass 0.25 kilogram, traveling at 20 


meters per second. Find the displacement if the damping coefficient is approximately 0.02 kg/sec. 


The values of mass m, spring constant k, dashpot resistance c, and external force f(t) are given for a mass-spring-dashpot 
system: 


(a) m=1,k=8,c=4, f(t) = H(t) — H(t— 7). 

(b) m=1,k=4,c=5, f(t) = H(t) — H(t —- 2). 

(c) m=1,k=8,c=9, f(t) =4sint[H(t) — H(t—7)]. 

(d) m=1,k=4,c=4, f(t) =t[A(t) — H(t — 2)}. 

(e) m=1,k=4,c=5, f(t) is the triangular wave function with a = 1 (Problem 11, §5.2, page 290). 


méi+cét+ka = f(t), x(0)=0, «(0) = v0. 


Solve the initial value problems for differential equations of order higher than 2. 


(a) yy" +y"+4y'+4y=8, y(0) =4, y/(0) =—3, y”(0) = -3. 


(b) y'” — 2y"” —y'+ 2y=4t, y(0) = 2, y’(0) = -2, y”(0) = 4. 
(c) y” —y" +4y' —4y =8e"* —5e', y(0) =2, y/(0) =0, y(0) =3. 
(d) y” —5y"”+y' —y=2t-10-#, (0) =2, y'(0) =0, y"(0) =0. 


Use the Laplace transformation to solve the following initial value problems with intermittent forcing functions (H(t) 
is the Heaviside function, page 274) for the differential equations of the order higher than 2. 


(a) y — 5y" + 4y = 12[H() — A(é—-1)], y(0) =0, y'(0) =0, y”(0) = 0, y”(0) = 0; 
(b) y — 16y = 32[H(t)-— H(t—7)], y(0) =0, y’(0) =0, y(0) =0, y'”(0) = 0; 

(c) (D-1)*y =6e* [A(t)— H(t-1)], y(0) =0, y'(0) =0, y"(0) =0; 

(d) (D° —D)y = H(t)— H(t—2), (0) =0, y/(0) =0, y"(0) =0. 


Summary for Chapter 5 


Li, 


The Laplace transform is applied to functions defined on the positive half-line. We always assume that such functions 
are zero for the negative values of its argument. In Laplace transform applications, we usually use well-known functions 
(such as sint or t?, for example) that originally are defined on the whole axis, but not only on the positive half-line. 
That is why we must multiply a function by the Heaviside function to guarantee that such a function is zero for negative 
values of the argument. 


. For a given function f(t), t > 0, and some complex or real parameter X , if the indefinite integral ie f(t)e~ dt exists, 


it is called the Laplace transform of the function f(t) and denoted either by f” or L[f]. The imaginary part of the 
parameter 2 does not affect the convergence of the Laplace integral. If the integral converges for some real value A = pu, 
then it converges for all real \ > y as well as for complex numbers with a real part greater than ju. 


There is a class of functions, called function-originals, for which the Laplace transformation exists. The Laplace 
transform establishes a one-to-one correspondence between a function-original and its image. Each function-original 
possesses the property (5.1.8), page 277, that is, at each point t = to, a function-original is equal to the mean of its 
left-hand side and right-hand side limit values. 
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4. A computer algebra system is useful for a quick calculation of Laplace transforms. For example, Maple can find the 


Laplace transform of the function f(t) = e”* sin(3t) in the following steps: 
with(inttrans): f:=exp(2*t)*sin(3*t) ; 
F:=laplace(f,t,lambda); F:=simplify(expand(F)) ; 
Mathematica can do a similar job: 

f = Exp[2*t]*Sin[3*t] ; 

F = LaplaceTransform[f,t,lambda] 

Simplify[Expand[ F ]] 

When using Mazima, we type 
laplace(exp(2*t)*sin(3*t) ,t, lambda) 

Application of Sage yields 

t,lambd,a = var(’t,lambd,a’) 

f = exp(a*t)*sin(3*t) 

f.laplace(t, lambd) 

3/(a°2 - 2*a*lambd + lambd*2 + 9) 


Below are the most important properties of the Laplace transformation: 


1° The convolution rule, Eq. (5.2.1), page 282. 

2° The differential rule, Eq. (5.2.2), page 282. 

3° The similarity rule, Eq. (5.2.5), page 282. 

4° The shift rule , Eq. (5.2.6), page 282. 

5° The attenuation rule, Eq. (5.2.7), page 282. 

6° The integration rule, Eq. (5.2.8), page 283. 

7° Rule for multiplicity by t”, Eq. (5.2.11), page 283. 

8° The Laplace transform of periodic functions: If f(t) = f(t +w), then 


fi) =—— [ e™ F(t) dt. (5.2.13) 


~ [—e-¥A 


The property 
Hs s) = [Far 


can be used to obtain an antiderivative of a given function f(t), namely, 
: 1 
[ soar=co [Ero]. 
0 aN 


The full-wave rectifier of a function f(t), 0 <¢< T is a periodic function with period T that coincides with f(t) on 
the interval [0, T]. 

The half-wave rectifier of a function f(t), 0 <t < T is a periodic function with period 2T that coincides with f(t) 
on the interval [0,7] and is zero on the interval [T, 2T]. 


The Dirac 6-function is a derivative of the Heaviside function, namely, 


im eal eRe S|, 
eoO0 € 
1 a+e 
where the limit is understood in generalized sense as / 6(t — a) f(t) dt = lim — f(t) dt = f(a) for every smooth 


esoO0 € ai 


integrable function f(t). 


8. The Dirac delta-function has two remarkable properties: 6(t — a) x f(t) = f(t — a) and L[6](A) = 1. 


We present three methods to determine the inverse Laplace transform, £~', of rational functions P(A)/Q(A), where P(\) 


and Q(A) are polynomials or entire functions. 


A. The Partial Fraction Decomposition Method is based on the expansion of P(A)/Q(A) into a linear combination 


of the simple rational functions 
1 A\+B 


t-a Gap 
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or their powers. Table 280 suggests that 


co A\+B Aa + 8 
(A= ays 6? B 
where H(t) is the Heaviside (see Eq. (5.1.5) on page 274). The rule (5.2.11) of multiplicity by t” may be used to find 
the inverse Laplace transform for powers of simple rational functions. 


= [4 e™ cos Bt + e“ sin Bt| H(t), 


B. Convolution Theorem 
LO* {Fi (A) Fa(a)} = (fi * fa) (0); 
where F(A) = L[fi] = fi’ and Fo(A) = L[fe] = fr’. 
C. The Residue Method. For a function F(A) = P(A)/Q(A) which is a fraction of two entire functions 


F(t) =£ FO) = D [Res F)e™, 


At 


where the sum covers all zeroes ; of the equation Q(A) = 0. The residues Res), F(A)e*" are evaluated as follows: 


If A; is a simple root of Q(A) = 0, then 


Res F(A)e*’ = er. 
Aj 0) Q'(Aj) 
If A; is a double root of Q(A) = 0, then 
Mo d 42 Mt 
Res F(A) = im ay {(a AS FiAle \ 


In general, when ; is an n-fold root of Q(A) = 0, then 


fa rojo Ss te x lai {0 — AORN a 
dj AA; (1 — 1)! dAn-1 : ‘ 


9. To check your answer, you may want to use a computer algebra system: 
Maple: with(inttrans): invlaplace(function, lambda, t); or 
with(MIM): ilaplace(function(lambda), lambda, t); 
Mathematica: £ = InverseLaplaceTransform[function,lambda,t] // Expand 
Maxima: ilt(function,lambda,t); 
Sage: f.inverse_laplace(lambd,t) 


Although the ideas of Chapter 4 could be used to solve the initial value problems, Laplace transform provides a convenient 
alternative solutions strategy. Applying the Laplace transform to the initial value problem for second order homogeneous 
linear differential equations with constant coefficients 


ay" (t)+by'(t)+ey(t)=0, = y(0) = yo, (0) =m 


includes the following steps: 


1. Multiply both sides of the given differential equation by e*! 


to infinity. This yields the integral equation. 


and then integrate the result with respect to t from zero 


2. Introduce the shortcut notation for the Laplace transform of the unknown function, say y”. 


3. Integrate by parts the terms with derivatives y’ and y” or use the differential rule (5.2.2). This leads to the algebraic 
equation with respect to ye: 


(ad* + bA + c)y” = a(Ayo + y1) + byo- 


; ; ; Ayo + y1) + byo 
4. Solve this algeb ‘ci y ost Ope Ss 
Solve this algebraic equation for y” to obtain y” (A) at ee 


5. Apply the inverse Laplace transform to determine y(t). 


A similar procedure is applicable to initial value problems for n-th order differential equations. 
The application of the Laplace transform to initial value problems for nonhomogeneous differential equations 


ay" (t) + by'(t) + cy(t) = f(t), (E> to) y(to) = yo, y'(to) = yo (*) 
or, in the general case, 
def n n-— n-1 
Liy] = any +--+ a0y =f, y(to) =o, «5 9 (to) = 90", (**) 
with linear constant coefficient differential operator L of the n-th order and given function f, consists of the following steps. 
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1. Set to = 0 by making a shift and consider the initial value problem for this case. 


2. Apply the Laplace transform using the differential rule (5.2.2); that is, substitute into Eq. (*) or Eq. (**) the algebraic 
expression 


Lily} = \ryk _ yF-ly, yea) a aes yo 
for the k-th derivative of unknown function y‘”). The result should look like this: 
LOA)y" = f* + San [Ayo + PU? + — yh] . 
k=0 
3. Divide both sides of the last algebraic equation by the characteristic polynomial L(A) = anX” +--+ + a0 to obtain 
Y= Vp +n 
where 
yeA)=G"(A)-f" with G’O)=—. 
the Laplace transform of the Green function, and 
L 1 . k-1 k—-2. (1) 0) 
A) =— |X aN tee ‘ 
uh) = Ty (son yo + *y3” + — yp 


4. Determine y;(t), the solution of the homogeneous equation with the given initial conditions, as the inverse Laplace 
transform of yx (A). For this purpose, use partial fraction decomposition or the residue method. For details, see §5.4. 


5. To find y(t), the solution of a nonhomogeneous equation with homogeneous initial conditions (at ¢ = 0), there are two 
options: 


(a) Determine the explicit expression f” of the Laplace transform of the function f(t) by evaluating the integral 


saya [roe ar 


For this purpose we can use any method that is most convenient, for instance, Table 280 can be utilized. 


With this in hand, find the function yp(t) as the inverse Laplace transform of the known function yf (see §5.4). 


(b) If the explicit expression f® is hard to determine, use the convolution rule. First, find the Green function 
G(t) = £7'[G"] (again use the methods of §5.4). Then apply the convolution rule to obtain 


w(t) =f Ge-nF()ar= f clot one 


6. Add the two solutions yp(t) and yp(t) to obtain the solution of the given initial value problem with initial conditions 
at t = 0. 


7. Return to the original initial conditions at t = to by using the shift rule (5.2.6). Namely, let z(t) be the solution of the 
given initial value problem with the initial conditions at t = 0. Then the solution y(t) for the same problem with the 
initial conditions at t = to is y(t) = H(t — to)z(t — to). 


Review Questions for Chapter 5 


Section 5.1. 


1. Find the Laplace transform of the given functions. 


t 

@ fo-{% 333 m f0={3 $33 oe 
Ls 0<t<l, ie 0<t<l, 

(d) f)=4 &, 1<t<2, (e) f)=¢ 1, LatS 2, 
0, t > 2; (t— 1)’, t>2. 


2. Let a be a positive number. Find the Laplace transform of the given functions. A preliminary integration by parts may 
be necessary. 


(a) e* sinhat; (b) e™ cosht; (c) Vti+t?; (d) (t¢+1)’; 


(e)  sin(2t) cos(3t); (fF) sinh? 2¢; (g) 4439/2 — 8¢5/?, (h) cosh 3¢ sinh 4¢; 
(i) [sint + cos¢]?; (j) sintcosh(2t); (k) (t+1)? cos3#; (1) Vte™. 
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ie) 


. If f(t) and f’(t) are both Laplace transformable and f(t) is continuous on (0, 00), then prove that im AF” (A) = f (+0). 
—0o 


4. Find £ ler sin t] by performing the integration ie (e* sin t) e™ dt. 
5. Which of the following functions have Laplace transforms? 

(@) gio (b) Ap (© Les @) 222; () 3 () e&®. 
6. Which of the following functions are of exponential order on [0, 00)? 

(a) Ini+ry (by) #7: (c) tln¢; (d) t* cost; 

(e) emt, (f) sin(t2); (g)  sinh(t2); | (hh) cos (c*). 


Section 5.2 of Chapter 5 (Review) 

1. Find the Laplace transform of the following functions. 
(a) sin?t; (b) cos*t; (c) sin(t+a); (d) |sinat|; (e) ¢t? sinat; (f) t* cosat. 

2. Find the Laplace transform of the given periodic functions. 
@fO=(9 gore, f0+9=10; @& fo-[$ TEES  fe+9=10 
(c) ft)=1-t, f0<t<2, f(t +2) = f(t). 


(d) f(t) = sin(t/2), O<t<2n, 


d t+ 47) = f(t) for allt >0. 
0, 27 <t<4r, — A m= eee 


3. Find the Laplace transform of the given functions. 
(a) « sin 2¢; (b)  (t—3) cos 2t; (c) t?+3cos3t; 
(d) 2¢?cosh3t; (e) S(?-2t+5)e"; (f) [t+cos¢]’. 
4. Use the attenuation rule and/or the similarity rule to find the Laplace transformations of the following functions. 
(a) te; (b) Pe) VER 
(d) ec“ cosBt; (e) te (Ff) eo % (4? 4+3t-6). 


5. Find the Laplace transform of the following integrals. 


t t t 
(a) / t cosh 2t dt; (b) / e”* sin 3t dt; (c) F te** sin t dt; 
9 t Oat ae t 
(d) e * | tsin5tde; (e) ef t cos 2t dt; (f) =| e’ sin 2t dt; 
Ont it 
(g) te ** / a (e’ sin 3t) dt; (h) / te * sintdt; (i) a e’ cosh 3¢ dt. 
o dt 0 dt Jo 


6. Compute the indicated convolution product (H is the Heaviside function) 
(i) by calculating the convolution integral; 
(ii) using the Laplace transform and the convolution rule (5.2.1). 
(a) t* H(t—1); (b) (#-—1)* H(t-1); 
(c) t"* H(t—a); (d) [H(t— 2) — H(t— 3)] *[H(t—1) — H(t 4)]; 
(e) H(t—1)*H(t—3); (f) [H(t)— H(t—3)]*[H(t-1)- HE — 2); 
(g) ? sint* H(t—a); (h) 2sint*sint[H(t) — H(t—7)j. 


7. Find the convolution of the following functions. 


(a) ae xt; (b) ¢?x¢; (c) #2 *t?; 
(d) *(t +1)’; (e) tee: (f) tx sinh dt; 
(g) cot * ny (h) exe", afb; (i) et xe™ 
(j) cost*xe™; (k)  sinat *¢; (1) cosat * ¢t; 
(m)  sinatxcosat; (n)_ sinat «sin dt; (o) e' *cosht; 
(p) e 'xcosht; (q) tr xt™ *t*; (r) sinat * sin at * cos at. 


8. Suppose that both f(t) and fi f(r) dz are Laplace transformable, where a is any nonnegative constant. Show that 


el f se )ar| = $ £11) - xf Je~*" dr. 


9. Let f(t) = |t| be the floor function (the greatest integer that is less than or equal to t). Show that 


L\t| = > . (e™ _ ene) : 
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10. 


11. 


12: 


13. 


14. 


Review Problems 


Find the Laplace transform of the exponential function e’ by integrating term by term the Maclaurin series e' = ons can 
and adding the results. Hint: The sum of the geometric series is ae, OS 


Find the Laplace transform of the function cos(2V)/7t by expanding cosine into the Maclaurin series, cosu = 


n 


ease al” a and integrating term by term. 


Let x(t) be a piecewise continuous function. Show that the unweighted average of x(t) over the interval [t — a,t+ a] 


can be written as a 


— a(t+7)dr = 


—a 


1 

7a [((H «x x)(t+a)—(H*2x)(t—a)]. 

Give a formula for the convolution of n Heaviside functions: H « H *«...* H. 
— 


Evaluate each of the following convolutions using the Laplace transformation. 
(a) txcosat; (b) txsinat; (c) ¢?*sinh3t; (d)  ¢°®*cosh3¢. 


Section 5.3 of Chapter 5 (Review) 


1. 


Sketch the graph of the given function and determine whether it is continuous, piecewise continuous, or neither on the 
intervalO <t <4. 


2t, 0<t<2, i-*, 0<t<1, 
(a) f(t)=<¢ 4-t, 2<t<3, (b) f@)=¢ ((-1)', 1<t<3, 
3, 3<t<4 1, 3<t<4 
i 0<t<2, t, 0<t<2, 
(c) ft=¢ 4, 2<t<3, (d) f@=4 5-t, 2<t <3, 
5-t, 3<t<4 t-1, 3<t<4. 


2. Write each function in terms of the Heaviside function and find its Laplace transform. 


2, O<t<4, 0, O<t<3, 
@ i=) >. po: O) 705) as, 

ty, O<7¢<1, _f 0, O0<t<3, 
omy a iOa, > 7 y 

_ f sin2t, O<t<nd, _ f 0, 0<t<3n/4, 

©) fo={ § t>n. (f) f= { dean, t> 3/4. 

2 Ox<t<1, 1, Dete 
(gs) f=4 0, 1<t<2, (h) f@)=4 t 1<t<2, 

a) a . Det. 


3. Find the convolutions: 


(a) (e — 2) * 6(t — In3); (b) sin(t) * 6(¢ — 3); (c) H(t—1) * 6(t — 3). 


4. Find expressions for the functions whose graphs are shown in Fig. 5.27, 5.28 in terms of shifted Heaviside functions, 


and calculate the Laplace transform of each. 


Figure 5.27: Problems 4(a) and 4(b). 


Section 5.4 of Chapter 5 (Review) 


1. Find the inverse Laplace transform of the given functions. 


A+4 4 A+1., A+1. 65. ¥ 
(a) xin (b) XE (c) xp) (d) Gym nay 
(e) a3) (f) D332? (¢) swipe «6((h) SS 


: 30\ 
(i) syztexcs- 
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Figure 5.28: Problems 4(c) and 4(d). 


2. Find the inverse of each of the following Laplace transforms. 


2 d\+a)2 =3 2 
(a) Inset; (b) nS, (ce) mast; (d) arctansh; (ce) ngAis: (f) In(1+4). 
3. Use the convolution rule to find the inverse Laplace transforms. 
6» i 1 : 7 . 
(a) VF? (b) r2(A—1)? (c) (X—2)(A+5)? 
(@) xq2gr ©) mis 0 wea 
XA—3)2? O—2)2(AF2)? O—920A4F1)2? 
(g) TAEDA (hh) em i) COLESITOSEE ye 
: 17 : k 162 2 1 2 i 
@ apo © wep () xoean 
6d? gas 82+246 
(m) wry (2) or (0) Brox 


4. Apply the residue method and then check your answer by using partial fraction decomposition to determine the inverse 
Laplace transforms. 


37A—74 8(A+1) B(A+1) 
(a) (42 +1)(A2—9)* (b) (x2 —4A+5)(A2 4A 13)" (c) seq: 
(d) 28—2(A—3)(A-1) (e) 3A244A49 (f) 3A-7 
AFD A—3)A+F4)* (\2+2A+5)(A4+1) ° X2-+8A+13° 
(g) 3A+1 (h) 3-A-)OAt)) (i) 2A+3 
N24" O+4)A—-DA—2)* (Q-1)2+4° 
5. Use the convolution formula to find the inverse Laplace transform of each of the following functions. 
C246 = Oo. © 
241° +4" ¢ (A —2)2° M24+2A42° © 


6. Show that for any integer n > 1 and any a £0, 


“1 A Bf Ast r 
. laces al later | a 


7. Use the attenuation rule (5.2.7) to find the inverse Laplace transforms. 
3 6(A + 2) aN 


ees by. eS ee 
2) VFA +T (b) OFF D+ 13) (<) FORTS 
d ——__—_.. — f os ee 
ia) a? + d+ 10 (ec) 33 Fon p29 () DN? + GA+ 10 
eo es, YG eee ee, eee 
@ pera © oon) 63) =U 


8. Using the shift rule (5.2.6), find the inverse Laplace transform of each of the following functions. 


2e~? 2re7 2% 


(a) 3h (b) 3x: (c)  2AUEBAR HA eA, 

(d) OPO ae (e) SLO eee (f) <a + | ew. 
(2) wth: (hb) gia (lte™). @) py l-e). 

G) yag(e™-e°"). (k) >. () aftae™. 


9. Use the convolution theorem to find the inverse Laplace transform of the given functions. 
r a 
ST b; b) —— 
(a) (2 + a2)(02 + B2) a # b; (b) (2 + a2)? 
Section 5.5 of Chapter 5 (Review) 
1. Solve the initial value problems by the Laplace transform. 


(a) ¥+4y=0, y(0)=2, 910) =2. (b)¥+y+y=0, y(0) =1, y(0) =2. 
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(c) ¥+4y+3y=0, y(0) = 3, yO) =2. 

(d) y+ 4y+13y=0,  y(0) = 1, y(0) = -1. 

(e) g+3y+2y=0, y(0) =1, y(0) = 2. 

(f) $+ 6y+18y=0, y(0)=2, y/(0)=-3. 

(g) ¥+2y+10y=0, y(O)=1, 9(0)=2. (bh) ¥—3y+2y=0, y(0) = 1 
(i) G+ 6y+8y=0, y(0)=0, y(0) = 2. Gj) 24-1y—6y=0, y(0) = 3, y(0) =5. 
(k) 4y—39y—10y=0, — y( 
(1) 29 —27y —45y=0, — ( 


2. Let D = d/dt. Solve the given differential equations of order greater than 2 by the Laplace transform. 


(a) D’y—5D*y+4y=0, y(0) =y'(0) = y"(0) =y'"(0) = 1. 
(b) D’y+2D?y—Dy—2y=0, y(0) =y'(0) =y"(0) =1. 


(c) y + 8y” + 16y=0, y(0)=y'(0) =y"(0) = 1, _y'"(0) = 16 
(d) y"+y" —6y’ =0, y(0)=y'(0)=0,  y"(0) = 15 

(e) y”—y" +9y'-9y=0, y(0)=1, y'(0)=0, y"(0)=—-9. 
(f) y"+y"+4y'+4y=0, y(0)=2, y/(0)=1, y"(0)=—3. 
(g) y"—2y”—y'+2y=0, y(0)=3, y/(0)=2, y"(0)=6. 


The following second order differential equations with polynomial coefficients have at least one solution that is integrable 
in some neighborhood of t = 0. Apply the Laplace transform (for this purpose use rule (5.2.11) and Problem 13 in §5.2) 
to derive and solve the differential equation for y* = L[y(t)], where y(t) is a solution of the given differential equation. 
Then come back to the same equation and solve it using the formula (5.5.10). 

(a) ty+2(t+1l)yt+y=0; (b) ty+3y—4ty =0; (c) (2¢4+1)y—ty+3y =0; 

(d) tyt+(1+t)y+2y=0; (e) (¢-1g-Zty=0; (f) (-l)¥—(t+1y=0. 


. Determine the initial value problem for which the following inverse Laplace transform is the solution. 


4a, BAT3 444, A+ 
@) gop «6©0) Opes «© Oma 6) Ope 


Section 5.6 of Chapter 5 (Review) 


1. Let D = d/dt be the operator of differentiation. Obtain the Green function for the following differential operators. 


(a) D?+4D+13; (b) D?—6D+25; (c) D?+3D?+D—5; 
(d) D?—6D+18; (e) D?+D-—12; (f) (D?—2D+426)?; 
(g) D?+8D+17; (h) 6D?+D-—1; (i) D*+16; 

(j) 4D?-D-3; (k) 2D?+D-3; (1) 9p*—D. 


2. Solve the initial value problems by Laplace transform. 


(a) y+y=2sint, y(0)=2. (b) ¥+4y+3y=4e%, y(0) =0, 9(0) =2. 
(c) y+ 2y=1, y(0)=3. (d) y+ 1l6y=8sin4t, y(0)=1, 9(0) =-—-1. 
(e) 6§-y-—y=3e", y(0) =0, 9(0) =0. 
(f) 7+4y+13y=120sint, y(0) =0, 9(0) =3. 
(g) J+ y=8cos3t, y(0) =0, (0) =—6. 
(h) 29+77+5y=27te-*, (0) =0, 9(0) =3. 
(i) 49 +y=t, y(0) =0, 9(0) =2. (J) G+ 2 +5y=5e-™*, (0) =0, y(0) =5. 
(k) 7+5y+6y=2e', y(0) =0, 9(0) =3. 
(1) 2+y=68sin2t, (0) = —17, 9(0) =0. 
(m) 4j-—4y+5y=4sint, y(0) =0, y(0) =1. 
(n) y+ 2y+y=6sint—4cost, y(0) = —6, y(0) = 2. 


3. Use the Laplace transformation to solve the following initial value problems with intermittent forcing functions (H (é) 


is the Heaviside function, page 274). 
(a) y' + 2y = 4¢[H(t)— H(t—2)], y(0) =1. 
(b) 4y” — 12y’ + 25y = H(t)-2H(t—1), y(0)=1, y’(0) =3. 


Review Questions for Chapter 5 339 


(c) y” + 25y = 13 sint[H(t)+ H(t—57)], y(0) =0, y’(0) =0. 

(d) y"—2y' + 2y = H(t)— A(t—7/2),  y(0) =0, y’(0) = 1. 

(e) y” +2y' +10y =e [H(t) — H(t—77/3)],  y(0) =1, y’(0) = —2. 
(f) y” —4y' +53y =t[H(t — 1) — H(t — 2)] 

(g) y” +4ry' + An?y = An? [H(t) — H(t — 2) 
(h) y" +4y'+3y= A(t—2), y(0)=0, y’(0) =1. 

(i) y+ 2y’+y=e*H(t-1), (0) =1, y’(0) =1. 

Gj) y”—y' —6y =e'H(t—2), y(0) =0, y’(0) =1. 

(k) 2y” + 2y' + y = (cost)H(t— 7/2), y(0) =0, y/(0) = 1. 

(1) y” +2y + Sy = (t- 2) H(t—- 2) -(¢-2-—k) H(t-2-k), y(0)=1, y'(0) =0. 


4. Use the Laplace transformation to solve the given nonhomogeneous differential equations subject to the indicated initial 


conditions. 
6, O<t<l, 
(a) §+4y+3y=f()= 4 0, 1<t<2,  yO)=1, y'(0)=-4. 
6, 2<t, 
0, O0<t<l, 
(b) ¥+y=f()=4 2, 1<t<2,  y(0)=1, y'(0)=0. 
0, t>2, 
0, 0<t<l, 
. _ - J t-1, 1<b< 2, Oy Ge, 3 
(c) y-4y=fO= 3-t, 2<t <3, y(0) =0, y’(0) = 1. 
0, 3<t, 
ae oe _ft, 0<t<2, pee iee  * 
(a) #59 +6y=FH={ f OSES™ — (0)=1, HO =0. 
t?, 0<t<l, 
(e) 39 -—5y-2y=fH=<¢ et," 1<t<2,  y(0)=1, 9(0) =2. 
cos(t — 2), 2<t, 
0, O0<t<l, 
(f) j-4y+1l3y=ft)=¢ eb 1, 1<t<3,  y(0)=1, 9(0) =2. 
t—3, 3<t, 
0, 0 < t a TT, 
(g) j-Tyt+6y=f® =e sin2, aw<t<2n, y(0)=1, 9(0)=6. 


t-2n7, @<t, 
5. Solve the initial value problems for the differential equations of order higher than 2. 
(a) yo!” + 6y” + 12y' + 8y = 4913 cos$, (0) = y/(0) = (0) = 0. 
(b) y+ 3y" + 4y' + 2y = 10sint, y(0) =—3, y/(0) =—1, y"(0) =3. 
(c) y” —3y" +3y'-y=Pe*, y(0)=y'(0) =0, (0) = 2. 
6. Solve the [VPs involving the Dirac delta-function. The initial conditions are assumed to be homogeneous. 


(a) y” + (2n)?y = d(t — 1) — 36(t — 1/2). (b) yy” 4+2y' +y = 36(t — 2). 
(c) oo +y" +y' +y =6(t— 2) —6(t— 4m). (a) yO +4y' + 4y = 5(t-2). 


7. Use Laplace transforms to solve the IVP y+ 2y+ 5y = f(t), y(0)=0, y(0) =1, where f(t) = 


and f(t) is periodic on [0,0o) with period 2. Hint: L{[f](A) = 10[tanh(A/2)]/. 


8. Consider a simple electrical circuit with current satisfying LI + RI = 6(t —5), I(0) =0. Use the Laplace transform 
method to find the response of the system. 


9. Suppose that a weight of constant mass m = 1 is suspended from a spring with spring constant k = 5 and that the 
air resistance is proportional to the speed. Thus, the differential equation of motion is y+ 2y + 5y = f(t), where y(t) 
denotes the vertical displacement of the mass at time t and f(t) is some periodic force acting on the system whose 
description in one period 7 is f(t) =sint, 0 <t< 7. Find y(t) subject to the initial conditions y(0) = y’(0) = 0. 
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10. Consider a mass-spring-dashpot system that is modeled by the following differential equation: 


y” + 4y' + 20y = f(t) = 20H(t) +405 °(-1)" H(t — nn), 


n>1 


where f(t) is an external force acting on the system and y(t) is the displacement of the mass from the equilibrium 
position. Assuming that the system is initially at rest at equilibrium (y(0) = y’(0) = 0), find the position function y(t). 


11. Find the Laplace transform of Jo(t) using the following steps. 
a) Apply the Laplace transform term-by-term to its Taylor expansion 
(a) Apply Pp y y Pp 
- ea” “| =5 (=1)* (2k)! 


kik! 22k kl kl \2k+1" 
k>0 


LlJo(t)] =L | 


k=0 


(b) Since (2k)! = 2° k![1-3-5...(2k — 1)] = 2" k! (2k — 1)! conclude 


k —1/2 
Loli] = 5 iS =< (143) . 


2k klA2* v2 
because 
2)-1/2 —1/2\ x wa (—L)* (2k\ on al ae Lae 
area ye (YP) aa SP) a= 
k>0 k>0 k>0 
(c) Hence 
1 
L{Jo(t)) = ———. 


12. Use rule (5.2.12) for division by t to obtain the Laplace transform of the Bessel function L [.Jo(t)] = (A” + 1) 


-1/2 


Chapter 6 


|meo 


Two pendula connected with a spring 


Introduction to Systems of ODEs 


Many real-world problems can be modeled by differential equations containing more than one dependent variable 
to be considered. Mathematical and computer simulations of such problems can give rise to systems of ordinary 
differential equations. Another reason to study these systems is the fact that all higher order differential equations 
can be written as an equivalent system of first order equations. The inverse statement is not necessarily true. For 
this reason, most computer programs are written to approximate the solutions for a first order system of ordinary 
differential equations. 


6.1 Some ODE Models 


Differential equations might not be so important if their solutions never appeared in physical models. The electric 
circuit model and mechanical spring-mass model are two classical examples of modeling phenomena using a system 
of differential equations that almost every differential equation textbook presents. We cannot avoid a description of 
these models mostly because of tradition, their simplicity, and their importance in practice. Thus, we will start with 
these two models and then present other models of interest that also use systems of ordinary differential equations. 


6.1.1 RLC-circuits 


A typical RLC electric circuit consists of several loops or devices that can include resistors, inductors (coils), capaci- 
tors, and voltage sources (batteries). When charge, denoted by q(t), flows through the circuit, it is said that there is 
a current—a flow of charge. The unit of electric charge is the coulomb’ (abbreviated C). The conventional symbol 
for current” is J, which was used by the French scientist André-Marie Ampére (1775-1836), after whom the unit of 
electric current is named. 


55Charles Augustin de Coulomb (1736-1806), French physicist and engineer who was best known for developing Coulomb’s law that 


describes force interacting between static electrically charged particles. 
561¢ originates from the French phrase intensité de courant or in English current intensity. 
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The circuits we will consider involve these four elements (resistor, inductor, capacitor, and voltage source) and 
current, as flows through each of these elements causes a voltage drop. The equations needed to estimate the voltage 
drop across each circuit element are presented below. We are not going to discuss the conditions under which these 
equations are valid. 


e In a resistor, the voltage drop is proportional to the current (Ohm’s law, 1827): 
AVyes = Rls. 


The positive constant R is called the resistance and it is measured in ohms®’ (symbol: Q) or milliohms (with 
scale factor 0.001). 


e In a capacitor, the voltage drop is proportional to the charge difference between two plates: 


AVeap = ~ : 


The positive constant C is the capacitance and is measured in farads®® (F) or microfarads (symbolized by 
uF, which is equivalent to 0.000001 = 10~° farads). 


e In an inductor or coil, the voltage drop is proportional to the rate of change of the current: 


dlina 


AVin =L 
Vind dt 


The positive constant L is called the inductance of the coil and is measured in henries®? (symbol H). 
e A voltage source imposes an external voltage drop Vext = —V(t) and it is measured in volts®° (V). 


A current is a net charge flowing through the area per unit time; thus, 


where the dot stands for the derivative with respect to t and D = d/dt. The differential equations that model 
voltage analysis of electric circuits are based on two fundamental laws®! derived by Gustav R. Kirchhoff in 1845. 
Kirchhoff’s Current Law states that the total current entering any point of a circuit equals the total current 
leaving it. Kirchhoff’s Voltage Law states that the sum of the voltage drops around any loop in a circuit is zero. 


Example 6.1.1: (A Simple LRC Circuit) In Fig. 6.1 on page 343, we sketch the simplest circuit involving a 
resistor, an inductor, a capacitor, and a voltage source in series. According to Kirchhoff’s current law, the current 
is the same in each element, that is, 


Tresistor = Lcoil = capacitor = Lsource = dy 


Next we apply Kirchhoff’s voltage law to obtain the following system of differential equations: 


dq 
— - 7 
dt ; 
0 = Vresistor + Veoil + Veapacitor = Veource 
dl gq 
= RI+L—+—-Vi(t). 
gage 


57George Simon Ohm (1789-1854) was a German physicist and mathematician who taught mathematics to Peter Dirichlet at the 
Jesuit Gymnasium in Cologne. 

58 Michael Faraday (1791-1867) was an English scientist who contributed to the fields of electromagnetism and electrochemistry. 

59 Joseph Henry (1797-1878) from the United States, who discovered electromagnetic induction independently of and at about the 
same time as Michael Faraday. 

5°Named after Italian physicist Alessandro Volta (1745-1827), known for the invention of the battery in the 1800s. 

61 German physicist Gustav R. Kirchhoff (1824-1887) who first demonstrated that current flows through a conductor at the speed of 
light. Both circuit rules can be directly derived through approximations from Maxwell’s equations. 
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H 6 0000 
C 
V(t) 
: Ore 
L 
Figure 6.1: LRC-circuit. Figure 6.2: A two-loop circuit. 


Example 6.1.2: (A Two-Loop Circuit) Figure 6.2 on page 343 shows a circuit that consists of two loops. We 
denote the current flowing through the left-hand loop by J; and the current in the right-hand loop by Ig (both in 
the clockwise direction). The current through the resistor R, is I; — Iz. The voltage drops on each element are 


AVr, = Ri(h—I2), AVer, = Rolo 


cal dls 
Ke rah Vi Fr 
The voltage analysis using Kirchhoff’s law leads to the following equations: 
V@ = a + Rit; — Ry Io (left loop), 
dle ‘ 
0 = Rolh +R me Ih (right loop), 
dq 
— = i. 
dt . 


Figure 6.3: A three-loop circuit. 


Example 6.1.3: (A Three-Loop Circuit) To handle the three-loop circuit sketched in Fig. 6.3, we apply 
Kirchhoff’s laws to every loop: 


1 
VO+hh+aq = 0, 
1 
—Gqait le (DIg) + Ro(I2 — Is) = QO, 
V(t) — Reo — Is) + Ls (DI3) = OQ, 
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M, M, 


Figure 6.4: Spring-mass system. 


where D = d/d¢t is the differential operator. From the first equation, it follows that 


q q V(t) 
Rh=VvV@-+ => h=- 
ie O-G Ch.’ Ri 
Hence, 
def dq q V(t) 
bg@=g=— = h-h= Db, 
: dt ee aie Re 
: dy i Ro Ro 
Dh2#h=—~ = eps tf L 
a) ae Ci, i) in 
is dls R3 Roy V(t) 
Diz = [3 = a 
5 8 at fa? abe Ee 


6.1.2 Spring-Mass Systems 


When a system with masses connected by springs is in motion, the springs are subject to both elongation and 
compression. It is clear from experience that there is some force, called the restoring force, which tends to return 
the attached mass to its equilibrium position. By Hooke’s law, the restoring force F has a direction opposite to the 
elongation (or compression) and is proportional to the distance of the mass from the equilibrium position. This can 
be expressed by F = —kx, where k is a constant of proportionality, called the spring constant or force constant, 
and x is the displacement. The former has units kg/sec? in SI (abbreviation for the International System of Units). 


In Figure 6.4, the two bodies of masses m1 and mz are connected to three springs of negligible mass having spring 
constants k,, kz, and ks, respectively. In turn, two of these three springs are attached to rigid supports. Let 2 (t) 
and £2(t) denote the horizontal displacements of the bodies from their equilibrium positions. We assume that these 
two bodies move on a frictionless surface. 

According to Newton’s second law of motion, the time rate of change in the momentum®? of a body is equal in 
magnitude and direction to the net force acting on the body and it has the same direction as the force. 

Let us consider the left body of mass m,. The elongation of the first spring is x,, which therefore exerts a force 
of —kx, on mass m, by Hooke’s law. The elongation of the second spring is x2 — x, because it is subject to both an 
elongation and compression, so it exerts a force of k2(a2 — 21) on mass m1. By Newton’s second law, we have 


du 
ae = —kj a, + ko (x9 a 


where v, is the velocity of the mass my . 
Similarly, the net force exerted on the mass mz is due to the elongation of the second spring, namely, x2 — 71, 
and the compression of the third spring by x2. Therefore, from Newton’s second law, it follows that 


where v2 is the velocity of the mass mg. 


62 Momentum of a body of mass m is the product mv, where v is its velocity. 
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Since velocity is the derivative of displacement, we conclude that the motion of the coupled system is modeled 
by the following system of first order differential equations: 


(6.1.1) 


6.1.3. The Euler-Lagrange Equation 


Many interesting models originate from classical mechanical problems. The most general way to derive the corre- 
sponding systems of differential equations describing these models is the Euler-Lagrange equations. The method 
originated in works by Leonhard Euler in 1733; later in 1750, his student Joseph-Louis Lagrange substantially 
improved this method. There are two very important reasons for working with Lagrange’s equations rather than 
Newton’s. The first is that Lagrange’s equations hold in any coordinate system, while Newton’s is restricted to an 
inertial frame. The second is the ease with which we can deal with constraints in the Lagrangian system. 

We demonstrate the Euler-Lagrange method in some elaborative examples from mechanics. We restrict ourselves 
to conservative systems. Let £ = K — II be the Lagrange function (or Lagrangian) that is equal to the difference 
between the kinetic energy K and the potential energy II. 

For the spring-mass system from 86.1.2, the kinetic energy is given by 


1 1 
K= 5 m4? + 5 m3, 


where « = da/dt is the derivative of x with respect to time. The potential energy is proportional to the square of 
the amount the spring is stretched or compressed, so 


1 1 1 
T= 5 kya? + 3 ko(a1 _ ar)” + 5 k3x3. 


The Euler-Lagrange equations of the first kind for a system with two degrees of freedom are 


00. Ob. 
dt 02; Ox; : 


i=1,2, (6.1.2) 


where 21, 2 are (generalized) coordinates. When the kinetic energy does not depend on displacements, and the 
potential energy does not depend on velocities, the Euler-Lagrange equations (6.1.2) become 


d ok Ol 


— —4+— = } = 1,2. 6.1.3 
dt Ox; . Ox; a; : : ( ) 
OL OK d OL OL Ol 
Hence, Oa; = ag, 7 dt Od; = Mj, %, 7= 1,2; and we have on = Oe Since 
Ol oll 
a (x1 — £2), Chee (a1 — %2) + k3 a2, 
the Euler-Lagrange equations are read as 
my X1 = ky Ly ko (v1 x2) and mg Le = kg v4 ko (a1 x2), 


which coincide with Newton’s second law equations (6.1.1) found previously. 
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6.1.4 Pendulum 


A bob of mass m is attached to one end of a rigid, but inextensible weightless rod (or shaft) of length @. The other 
end of the shaft is attached to a support that allows it to rotate without friction. If the bob’s oscillations take 
place within a plane, this system is called an ideal pendulum. The position of the bob is measured by the angle 
@ between the shaft and the downward vertical direction, with the counterclockwise direction taken as positive. 

To analyze the plane motion of an oscillating pendulum, it is convenient to reformulate Newton’s second law into 
its rotational equivalent. In science and engineering the usual unit of angle measurement is the radian, abbreviated 
by rad. It was abolished in 1995 and can be dropped—it is dimensionless in SI (from French: Systéme international 
d’unités). 

Velocity is a measure of both the speed and direction that an object is traveling. When we consider only rotational 
motion, the direction of the object is known and its velocity is defined by its magnitude—speed, and its direction is 
determined by right hand rule. Therefore, we can drop vector notation and operate only with scalar quantities. The 
angular velocity (or instantaneous angular speed) of a body rotated about a fixed axis is the ratio of the angle 
traversed to the amount of time it takes to traverse that angle when the time tends to zero: 

i AOé b dé 
= ato At dé’ 
where @ is an angle in the cylindrical coordinate system and in which the axis of rotation is taken to be in the z 
direction. The unit of angular velocity is a radian per second (rad/sec, which is 1/sec) and it is a measure of the 
angular displacement per unit time. Since the velocity vector is always tangent to the circular path, it is called the 
tangential velocity. Its magnitude v = rd@/dt = rw is the linear velocity, where r = |r| and w = |w|. In words, 
the tangential speed of a point on a rotating rigid object equals the product of the perpendicular distance of that 
point from the axis of rotation with the angular speed. Note that the tangential velocity and angular velocity only 
refer to its magnitude; no direction is involved. Although every point on the rigid body has the same angular speed, 
not every point has the same linear speed because r is not the same for all points on the object. 
The angular acceleration is the rate of change of angular velocity with time: 


Aw dw d°6_ 


a= 


Atso At dt dé 


It has units of radians per second squared (rad/sec?), or just sec”. Note that @ is said to be positive when the 
rate of counterclockwise rotation is increasing or when the rate of clockwise rotation is decreasing. When rotating 
about a fixed axis, every particle on a rigid object rotates through the same angle and has the same angular speed 
and the same angular acceleration. That is, the quantities 6, w, and a characterize the rotational motion of the 
entire rigid body. 

Angular position (0), angular speed (w), and angular acceleration (a) are analogous to linear position (a), linear 
speed (v), and linear acceleration (a) but they differ dimensionally by a factor of unit length. The directions of 
angular velocity (w) and angular acceleration (a) are along the axis of rotation. If an object rotates in the xy plane, 
the direction of w is out of the plane when the rotation is counterclockwise and into the plane when the rotation is 
clockwise. 

A point rotating in a circular path undergoes a centripetal, or radial, acceleration a, of magnitude v?/r directed 
toward the center of rotation. Since v = rw for a point on a rotating body, we can express the radial acceleration of 
that point as a, = rw?. 

Suppose a body of mass m is constrained to move in a circle of radius r. Its position is naturally described by 
an angular displacement @ from some reference position. Such a body has mass moment of inertia® 


J=mr? [kg-m’). 


It is a measure of an object’s resistance to changes to its rotation rate—the angular counterpart to the mass. If a 
rotating rigid object consists of a collection of particles, each having mass m;, then J = 5>,mj,r?. The rotational 
kinetic energy of a rigid body is 


1 il 
K= 3 domariw® =5 ar 


63We assume that the body has a point mass. The moment of inertia for a continuously distributed mass is the integral of density 
times the square of the radius. 
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Recall that the angular momentum of a body (particle) of mass m is 
A=rxmv= Jw, 


where v is the velocity vector of the particle, r is the position vector of the particle relative to the origin, mv is the 
linear momentum of the particle, and x denotes the cross product. For a system of particles, the angular momentum 
is the vector sum of the individual angular momentums. The time derivative of the angular momentum is called 
torque: 


EE an ot agen 
dt = dt dt dt 
because 
rxmv=vxmv => lv x mv| = m|v|- jv] sin 0° = 0. 


Torque or moment is a measure of how much a force acting on an object causes that object to rotate. Suppose 
that a force F acts on a wrench pivoted on the axis through the origin. Then torque is defined as 


M=rxF = M = |M| =rFsing = Fran’, 


where Fign is the component of the force tangent to the circle in the direction of an increasing @ in counterclockwise 
direction, r is the distance between the pivot point and the point of the application of F, and ¢ is the angle between 
the line of action of the force F and the line of the radius r. The torque is defined only when a reference axis is 
specified. 

Imagine pushing a door open. The force of your push (F) causes the door to rotate about its hinges (the pivot 
point). How hard you need to push the door depends on the distance you are from the hinges (r). The closer you 
are to the hinges (i.e., the smaller r is), the harder it is to push. 

Thus, Newton’s second law when applied to rotational movement becomes 

2 
(i 


re (6.1.4) 


that is, the time rate of change of angular momentum about any point is equal to the resultant torque exerted by all 

external forces acting on the body about that point. The torque acting on the particle is proportional to its angular 

acceleration, and the proportionality constant is the moment of inertia: M = Ja. i] 

For the pendulum, there are two forces (we neglect friction resistance force at the supported end of the shaft) 

acting on it: gravity, directed downward, and air resistance, directed opposite to the direction of motion. The latter 

force, denoted Fresistance, has only a tangential component. This damping force is assumed to be approximately 
proportional to the angular velocity, that is, 
Fresistance =—-K0 


for some positive constant «. We also assume here that @ and 6 = d@/dt are both positive when moved in counter- 
clockwise direction. From this assumption, the moment of the damping force becomes 
Myesistance = —K00. 


The tangential component of the gravitational force has magnitude mgsin@ and acts in the clockwise direction. 
Therefore, its moment is 
Meravity = —£mg sin@. 


From Newton’s second law (6.1.4), it follows that 


d’0 do A 
2 —_—_- = — —_— 3] 2 gi = male 
me 2 Ke a émg sin 0 or 6+70+u* sind = 0, (6.1.5) 
where i j 
eae 2_7 
i err and wW a 


If the damping force (due to air resistance and friction at the pivot) is small, it can be neglected; so we can set 7 = 0 
and the pendulum equation becomes ; 
6+w” sind = 0. (6.1.6) 
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Figure 6.5: An oscillating pendulum. 


Equation (6.1.6) has no solutions in terms of elementary functions. If 9 remains small, say |6| < 0.1 radians, we may 
replace® sin@ by @ in the pendulum equation and obtain the following linear differential equation: 


6+w?6 =0. a 
We can also derive the pendulum equation (6.1.5) using the Euler-Lagrange equation of motion (with n degrees of 
freedom) 
d /0L OL 
Sele ee. PSi ee 6.1.7 
dt (si) Oak ” (6.1.7) 


where £ is the Lagrange function, that is, £ = K —II, K is the kinetic energy, and II is its potential energy. Here gy, 
represents the generalized coordinates of the system. For the pendulum n = 1 and gq, = 6. Since the linear velocity 
of the bob is £0, the kinetic energy is 


K = im (06) 
==-m : 
sme) 
For the potential energy expression, we assume that datum to be the lowest position of the bob, that is, when 6 = 0. 
When the pendulum is at some angle 0, the position of the mass above the datum is @(1—cos@). Hence the potential 


energy is 
II = mg €(1 — cos @). 


Therefore, the Lagrange function becomes 


1 do \” 1 \2 
c= 5m (e2) — mg &(1 — cos) = 5m (£6) — mg €(1 — cos 6), 


and we have 


a6 dt \'a6 
OL Oll . 
3 7907 —mgé sin 8, 


which leads to Eq. (6.1.5). 


64Since the Taylor series for sin@ about @ = 0 is the following sign alternating series 


a 6 
a Se re 


the first term @ is a good approximation of the function sin @ for small values of 6. 
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So far we have considered only an ideal pendulum that can oscillate within a plane. Now we relax this condition 
and consider a pendulum consisting of a compact mass m on the end of a light inextensible string of length @. 
Suppose that the mass is free to move in any direction (as long as the string remains taut). Let the fixed end of 
the string be located at the origin of our coordinate system. We can define Cartesian coordinates, (x, y, z), such 
that the z-axis points vertically upward. We can also define spherical coordinates, (r, 0, 6), whose axis points along 
the —z-axis. The latter coordinates are the most convenient since r is constrained to always take the value r = . 
However, the two angular coordinates, 9 and @, are free to vary independently. Hence, this is a two degrees of 
freedom system. 

In spherical coordinates 

z=fsindcosd, y=fsindsingd, z= —écos8, 


the potential energy of the system becomes II = mg(é+ z) = mgé (1 — cos@) . Since the velocity of the bob is 
eart+yt ear (@ + sin? 60”) 
the Lagrangian of the system can be written as 
£=K-I= i,m (# + sin? 06°) + mgé (cos@—1). 
The Euler-Lagrange equations give 


d ; : 
a (me6) — mé? sin 8 cos 6 ¢* + mgé sind = 0, 
» (me? sin? 66) =0 
dt , 
Example 6.1.4: (Coupled pendula) Consider two pendula of masses m; and mz coupled by a Hookian spring 
with spring constant k (see figure on the front page 341). Suppose that the spring is attached to each rod at a 
distance a from their pivots, and that the pendula are far apart so that the spring can be assumed to be horizontal 
during their oscillations. Let 61/02 and ¢;/€2 be the angle of inclination of the shaft with respect to the downward 
vertical line and the length of the shaft for each pendulum. Its kinetic energy is the sum of kinetic energies of two 


individual pendula: ‘ 2 
K=" (ah) +2 (abr) 


The potential energy is accumulated by the spring, which accounts for & (a sin @, — a sin 62)”, and by lifting both 


masses 2 


k 
II = mg; (1 — cos 01) + magég (1 — cos 62) + > (sin 0, — sin 02)” : 
Substituting these expressions into the Euler-Lagrange equations (6.1.3), we obtain the system of motion: 


m,026, + mygey sin 6, + ak (sin 61 — sin 62) cos 4, = 0, (6 1 8) 
m020> + mogta sin 02 — ak (sin 6; — sin 62)) cos 62 = 0. -_ 


6.1.5 Laminated Material 


When a space shuttle enters the atmosphere to land on Earth, air friction causes its body to be heated to a high 
temperature. To prevent the shuttle from melting, its shell is covered with ceramic plates because there are no 
metals that can resist such a high temperature. 

Let us consider the problem of describing the temperature in a laminated material. Suppose that the rate of 
heat flow between two objects in thermal contact is proportional to the difference in their temperatures and that 
the rate at which heat is stored in an object is proportional to the rate of change of its temperature. The net heat 
flow rate into an object must balance the heat storage rate. This balance for a three-layer material is described as 


dT 


aa pi2(T2 — T1) + poi(To — Ti), 
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c2— = pi2(Ti — T2) + po3(T3 — Ta), 


ara = po3(T2 — T3) + ps3a(T4 — Ts), 


where the c’s and p’s are constants of proportionality, and Ty) and T, are the temperatures at the two sides of the 
laminated material. Here T; is the temperature in the k-th layer (k = 1, 2,3). 


6.1.6 Flow Problems 


Examples of leaking tanks containing liquids constitute only a portion of flow problems; however, they model a whole 
host of familiar real-world phenomena. For example, “liquid” can also be heat flowing out of a cooling cup of coffee, 
a cold soda warming up to room temperature, an electric charge draining from a capacitor, or even a radioactive 
material leaking (decaying) to a stable material. The thermal counterparts of water/liquid volume, water height, 
tank base area, and drain area are heat, temperature, specific heat, and thermal conductivity. The corresponding 
electrical units are charge, voltage, capacitance, and the reciprocal of resistance. 

We start with a simple example of two tanks coupled together. We assume that all tanks are of the same 
cylindrical shape and have a volume with base area 1; this allows us to track the tank’s water volume using water 
height y(t). If k denotes the cross-sectional area of its drain, we choose the units so that the water level y satisfies 
y’ = —ky when each tank is in isolation. To finish the physical setup, we put two tanks one above the other so 
that water drains from the top tank into the bottom one, and the pump returns to the top tank water that has 
gravity-drained from the bottom tank. This leads to the following system of differential equations: 


y= —kiyi + keys, yo = kiys — key. (6.1.9) 
By adding a third tank, the corresponding vector differential equation becomes 


1 = —kiyi + Oy2 + kz ys, 
yo = kiys — kayo + Oys, (6.1.10) 
ys = Oy1 + kaya — kgys. 


A similar system of differential equations can be used to model a sequence of n connected tanks (n is a positive 
integer). 


Example 6.1.5: (Cascading Tanks) Suppose there are three cascading tanks in sequence. Tank A contains 
2001liters (1) of a salt solution with a concentration of 100 grams per liter. The next two tanks, B and C, 
contain pure water with volumes of 100 liters and 50 liters, respectively. At instant t = 0, fresh water is poured 
into tank A at arate 101/min. The well-stirred mixture flows out of tank A at the same rate it flows into tank B. 
The well-stirred mixture is pumped from tank B into tank C' at the same rate of 101/min. Tank C' has a hole that 
allows brine to run out at a rate 101/min. Find the amount of salt in each tank at any time. 

Solution. We denote the amount of salt in the containers A, B, and C by Q4(t), Q?(t), and Q@(t), respectively. 
Initially, 

Q4(0) = 200 liters x 0.1kg/liter =20kg, @Q?(0)=0, and Q°(0)=0. 


The rate in and rate out for each of the tanks are 


RA ee es RA, — = kg ) . (1 wee a QA(t) kg 


min’ out \ 200 liters min 20 min’ 

Bt); ee liters QFP(t) kg 

RP =RA,, RB = Q"( 10 = 

- ute que 100 liters : min 10 min’ 
c k lit Bit) k 

RE = Roy Ry = (SOE —) x (10 SE) — SO 
50 liters min 5 min 


The balanced equation for each tank is 


dQ*(t)_ _ Q4*() 


pores A 
7 5 7 70.05 2A (0), (6.1.11) 
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A B C 


Figure 6.6: Cascading tanks. 


M, k M, 


Figure 6.7: Problem 1. 


ie) = 0.05 Q4(t) — 0.197 (8), (6.1.12) 
eo) = 0.19?) —0.2Q°(), (@.113) 


Since Eq. (6.1.11) is a linear differential equation, the solution that satisfies the initial condition Q4(0) = 20 is 
Ora) = 20 e005. 


Equations (6.1.12) and (6.1.13) are nonhomogeneous linear differential equations. We recall from §2.5 that the initial 
value problem 
dQ 


= +aQh) =f, Q0) =0 


has the explicit solution (see Eq. (2.5.9) on page 88) 
t 

=| fae Yd. 
0 


With this in hand, the solutions of equations (6.1.12) and (6.1.13) that satisfy the homogeneous initial conditions 
are 


t 
QP (t) = 0.05 | [20 ener e 7 9-1(t—7) dr = 20 lens = aoe 
0) 


t 
and Q°(t) = Q1 i: 20 lemnee _ en e7 0-2(t—7) dr 
0 


40 ene _ eo | — 20 le -_ ml 
3 : 


1. Two cars of masses m1 and m2 are connected by a spring of force constant k (Fig. 6.7). They are free to roll along the 
abscissa. Derive the equations of motion for each car. 


2. Three cars of masses m1, m2, and m3 are connected by two springs with spring constants ki and kz, respectively (see 
Fig. 6.8). Set up the equations of motion for each car. 
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Figure 6.8: Problem 2. 


X| X 


k 
=. 2 
m --VVL4 


F(t) 


Figure 6.9: Problem 3, dynamic damper. 


Figure 6.10: Problem 4. 


Figure 6.11: Problem 5. 
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Figure 6.12: Problem 6. 


Figure 6.14: Problem 8. Figure 6.15: Problem 9. 


3. Set up the system of differential equations that simulate a dynamic damper with F(t) = sin(wt + a); see Fig. 6.9. 


4. Derive the system of differential equations that simulate the mass-and-spring system shown in Fig. 6.10. 


5. Consider the mechanical system of three springs with force constants ki, k2, and k3 and three masses m1, m2, and m3 


(see Fig. 6.11). They are free to slide along the abscissa. The left spring is attached to the wall. Set up the equation 
of motion. 


. A pendulum consists of a rigid massless rod with two attached bobs of masses m1 and mz (see Fig. 6.12). The distance 
between these bobs is L2 and the bob of mass mz is attached to its end. The total length of the rod is L. Find the 
equation of motion when friction is neglected. 


. A pendulum consists of a bob of mass m attached to a pivot by a rigid and massless shaft of length L (see Fig. 6.13). 
The pivot, which is the axle of a homogeneous solid wheel of radius R and mass M, is free to roll horizontally. Set up 
the equation of motion for the system. 


. A pendulum consists of a bob of mass m attached to a pivot by a rigid and massless shaft of length L (see Fig. 6.14). 
The pivot is attached to a block of mass M that is free to slide along abscissa. The block is attached to a wall to the 


AVM 


@: 


Figure 6.16: Problem 10. 
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° F(t) 
k 


Figure 6.17: Problem 11. Figure 6.18: Problem 12. 


Figure 6.19: Problem 13. Figure 6.20: Problem 14. 


left by a spring with a spring force constant k. Derive the equation of motion. 


A pendulum consists of a bob of mass m attached to the lower end of a massless shaft of length L (see Fig. 6.15). If 
the spring has force constant k and is attached to the pendulum shaft at a distance a < L, find the equation of motion. 


A pendulum consists of a rigid rod of length L, pivoted at its upper end and carrying a mass m at its lower end (see 
Fig. 6.16). Two springs with force constants k; and kz, respectively, are attached to the rod at a distance a from the 
pivot and their other ends are rigidly attached to a wall. Find the equation of motion. 
Find a system of equations for the mass m attached to a spring of force constant k via a pulley of radius R and mass 
M (see Fig. 6.17). 
Derive an equation of motion for a uniformly thin disk of radius R that rolls without slipping on the abscissa under the 
action of a horizontal force F’ applied at its center (see Fig. 6.18). Let k be the coefficient of friction. 
A uniform rod of length Z and mass M is pivoted at one end and can swing in a vertical plane (see Fig. 6.19). A 
homogeneous disk of mass m and radius r is attached by a pivot at its center to the free end of the rod. Ignoring 
friction, set up the equation of motion for this system. 
Two pendula of masses m and M, respectively, each of length L, are suspended from the same horizontal line (see 
Fig. 6.20). Their angular displacements are 6(t) and y(t), respectively. The bobs are coupled by a spring of force 
constant &. Derive an equation of motion. 
(a) Show that the functions 

x(0) = 3 cos@+cos36, y(@) =3 sind — sin30 


satisfy the initial value problem 
xv’ —2y'+3x2=0, y” +22’ +3y =0, x(0) =4, y(0) = 2'(0) =y'(0) =0. 


(b) Verify that these differential equations describe the trajectory (x(t), y(t)) of a particle moving in the plane along 
the hypocycloid traced by a point P(x, y), fixed on the circumference of a circle of radius r = 1, and that rolls around 
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Figure 6.21: Problem 18. Figure 6.22: Problem 19. 


inside a circle of radius R = 4. The parameter @ represents the angle measured in the counterclockwise direction from 
the abscissa to the line through the center of the small circle and the origin. 


16. A ball of mass m and radius r rolls in a hemispherical bowl of radius R. Determine the Lagrangian equations of 
motion for the ball in spherical coordinates (9 and ¢); denote by g the acceleration due to gravity. Find the period of 
oscillations. To avoid elliptic integrals, keep @ small and replace sin@ by 0, and cos@ by 1 — 6? /2. 


17. Solve the previous problem when the ball slides from the edge to the bottom. Find the time of descent. 

18. An electrical circuit consists of two loops, as shown in Fig. 6.21. Set up the dynamic equations for the circuit. 
19. An electrical circuit consists of two loops, as shown in Fig. 6.22. Set up the dynamic equations for the circuit. 
20. Solve the problem from Example 6.1.5 when all three cascading tanks contain 200 liters of liquid. 


21. A shaft carrying two disks is attached at each end to a wall. The distance between the disks and between each disk and 
the wall is 2. Each disk can turn about the shaft, but in so doing it exerts a torque on the shaft. The angular coordinates 
01, 02 represent displacements from equilibrium, at which there is no torque. The total potential energy of the system 


is TI = . (0? + (01; — 62)? + 65], where k = C/€ and C is the torsional stiffness of the shaft; the kinetic energy is K 


= 3 [di + 3263], where J; and J2 are two moments of inertia. Set up the Lagrangian differential equations for the 
system. 


22. The Harrod—Domar model”? is used to explain the growth rate of developing countries’ economies based on saving 
capital and using that capital as investment in productivity: K= s P(K,L), where K is capital, L is labor, P(K, L) 
is output, which is assumed to be a homogeneous function P(aKk,aL) = a P(K, L), s is the fraction (0 < s < 1) of the 
output that is saved and the rest is consumed. Assuming that the labor force is growing according to the simple growth 
law L = rL, derive a differential equation for the ratio R(t) = K(t)/L(t). 


23. Consider two interconnected tanks. The first one, which we call tank A, initially contains 50 liters of a solution with 
50 grams of salt, and the second tank, which we call tank B, initially contains 100 liters of a solution with 100 grams of 
salt. Water containing 6 gram/1 of salt flows into tank A at a rate of 101/min. The mixture flows from tank A to tank 
B at a rate of 61/min. Water containing 10 gram/1 of salt also flows into tank B at a rate of 121/min from outside. 
The mixture drains from tank B at a rate of 71/min, of which some flows back into tank A at a rate of 31/min, while 
the remainder leaves the system. 


(a) Let Q4(t) and Q(t), respectively, be the amount of salt in each tank at time t. Write down differential equa- 
tions and initial conditions that model the flow process. Observe that the system of differential equations is 
nonhomogeneous. 


(b) Find the values of Q4 and Q® for which the system is in equilibrium, that is, does not change with time. Let Qa 
and Q& be the equilibrium values. Can you predict which tank will approach its equilibrium state more rapidly? 


(c) Let ai(t) = Q4(t) — Q# and x2 = Q?(t) — QZ. Determine an initial value problem for 21 and a2. Observe that 


the system of equations for x1 and x2 is homogeneous. 


24. In 1963, American mathematician and meteorologist Edward Norton Lorenz (1917-2008) from Massachusetts Institute 
of Technology introduced a simplified mathematical model for atmospheric convection 


b=o(y—2), y=pr—y—az, 2=ay— Bz, 


where o, p, and 6 are constants. Use a computer solver to plot some solutions when o = 10, p = 28, and 6 = 8/3. 


65 The model was developed independently by an English economist Roy F. Harrod (1900-1978) in 1939 and in 1946 by Evsey Domar, 
a Russian American economist Evsey David Domar/Domashevitsky (1914-1997), who immigrated to the US in 1936. 
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6.2 Matrices 


Some quantities may be completely identified by a magnitude and a direction, as, for example, force, velocity, mo- 
mentum, and acceleration. Such quantities are called vectors. This definition suggests the geometric representation 
of a vector as a directed line segment, or “arrow,” where the length of the arrow is scaled according to the magnitude 
of the vector. Observe that the location of a vector in space is not specified; only its magnitude and direction are 
known. Fixing a Cartesian system of coordinates, we can move a vector so that it starts at the origin of a rectangular 
coordinate system. Then each vector can be characterized by a point corresponding to the end of the arrow. This 
endpoint is uniquely identified by its coordinates. 

There are two ways to write coordinates of a vector: either as a column or as a row. Correspondingly, we get 
two kinds of vectors: column vectors, denoted by lower case letters in bold font (as x), and row vectors, denoted by 
lower case letters with arrows above them (as #). A transformation of a column vector into raw vector or vice versa 
is called transposition. In applications, it is a custom to identify a vector with its coordinates written in column 
form. For example, a vector u in fixed Cartesian 3-space can be written as a three-dimensional column vector: 


U1 U1 
= = Ts 
u= | ue or u= |u2 or u’ = (tu, U2, U3), 
U3 U3 


where T denotes transposition (see Definition 6.3, page 358) and tw = (uj, ug, u3) is a row vector. In this text, we 
mostly use column vectors. 


Definition 6.1: The scalar or inner product of two column vectors x? = (a1,%2,...,2n) and y? = 


(Y1,Y2;+++;Yn) of the same size n is a number (real or complex) denoted by (x,y) and is defined through 
the dot product: 


where X is a complex conjugate of the vector x. The Euclidean norm, or length, or magnitude of a vector x 
is the positive number 


xl] = (x,x)!/? = (Rx 


Thus, the Euclidean norm of a vector with real components is the square root of the sum of the squares of its 
components. If, for example, a vector x has complex components x? = (x1, 22,...,%n), Te = aptjby, k =1,2,...,n, 
then its norm is 


[xl] = [a2 +02 +02 402 +---+02 +02)”. 


There are known other equivalent definitions of the norm; however, this one is the most useful in applications. 


Example 6.2.1: In MATLAB®, we can define a vector of 20 random elements and find its Euclidean norm: 
vect=rand(1,20); norm(vect,2); 

To find the sum of maximum entries (Manhattan norm), just type norm(vect, 1). Maple” has a similar command 
with(LinearAlgebra): VectorNorm( <2,-1>, 2); which gives ,/2? + (—1)? = V5. Similarly, the input 
VectorNorm( <2,-1>, 1) will give the Manhattan norm 3 = 2+|—1| as output. Mathematica® also has a dedicated 
command: Norm[vect]. To operate with vectors in Maxima, one needs to load the package load(vect). If your 
vector is (—3,2,4) and you need to find its square norm, type in Maxima: 

lsum(x? , x, [-3,2,4]); 
Sage uses: v.norm() or v.norm(2) for the Euclidean norm of vector v. 


Definition 6.2: Two vectors x and y are said to be orthogonal if their scalar product is zero: 


(x,y) =0. 
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A matrix is a rectangular array of objects or entries, written in rows and columns. The word originated in ancient 
Rome from Latin matr-, mater (womb or parent). The term “matrix” was first mentioned in the mathematical 
literature in a 1850 paper®® by James Joseph Sylvester. In what follows, we use numbers (complex or real) or 
functions as entries, but generally speaking these objects can be anything you want. The rectangular array is 
usually enclosed either in square brackets 


x 

Q11 a12  *'* Gin 
a21 a22  *'* G2n 

A= [ais] = 
y Am1 aAm2 as. Amn 

Figure 6.23: Orthogonal vectors 
or in parentheses 
a11 a12  "'* Gin 
a21 a22  *'' G2n 
A = (aij) = fay ‘ 
aml Am2 ars Amn 


and consists of m rows and n columns of mn objects (or entries) chosen from a given set. In this case we speak 
about an m x n matrix (pronounced “m by n”). In the symbol a;;, which represents a typical entry (or element), 
the first subscript (i) denotes the row and the second subscript (7) denotes the column occupied by the entry. 

The number of rows and the number of columns are called the dimensions of the matrix. When two matrices 
have the same dimensions, we say that they are of the same shape, of the same order, or of the same size. Unless 
otherwise indicated, matrices will be denoted by capital letters in a bold font. 

Two matrices A = [a;;] and B = [};,] are equal, written A = B, if and only if they have the same dimensions 
and their corresponding entries are equal. Thus, an equality between two m x n matrices A and B entails equalities 
between mn pairs of elements: a1, = 611, G12 = 612, and so on. Two matrices of the same size may be added or 
subtracted. We summarize the basic properties of arithmetic operations with matrices. 


e Equality: Two matrices A = [a;;] and B = [b;;] are said to be equal if a;; = b,,; for all indices i, j. 


e Addition and Subtraction: For matrices A = [a;;] and B = [};,] of the same size, we have 


AtB= (ai; =e bij]. 


e Multiplication by a constant: For any constant c, we have c- A = [c- a,j]. 


Example 6.2.2: The equality between two 3 x 2 matrices 


Qi aj12 L, 2 
21 a22 = 2 3 
a3i 32 3.4 


is equivalent to the six identities ayy = I, aiy2= 2, a2, = 2, ag2 = 3, a31 = 3, a32 = 4, 


Example 6.2.3: The sum and difference of two matrices 


1 
A= | 2 and B= 2 —2 
5 


New 


66 Sylvester (1814-1897) was born James Joseph to a Jewish family in London, England; later he adopted the surname Sylvester. He 
invented a great number of other mathematical terms such as graph and discriminant. 
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are 


0 4 2 2 
A+B=] 4 2 and A-—B=] 0 6 
8 8 2 6 


The product of a scalar (number) a with an m x n matrix A = [a,j] is an m x n matrix denoted by aA = 
[aa,;]. The scalar multiplies through each element of the matrix. Usually, we put the scalar multiplier on the left of 
the matrix, but Aa means the same as aA. Also, (—1)A is simply —A and is called the negative of A. Similarly, 
(—a)A is written —aA, and A + (—B) is written A — B. 


Definition 6.3: A matrix that is obtained from an m x n matrix A = [a;,;] by interchanging rows and columns 
is called the transpose of A and is usually denoted by A? or A‘ or even A’. Thus A? = [a,;]. A matrix is 


called symmetric if A7 = A; that is, Qij = A745. 
For any two matrices of the same size, A and B, we have 
(AT)) =A, (A+B)? =A7™+BT. 


If X is a constant, then 


(\A)* = AAT. 
Example 6.2.4: Let 
1 3 -1 1 
A=|2 4 and B= 2 -—2 
5 7 3 1 
Then 
T 1 2 5 pr | -l 2 3 
elas and B* = 1 21 
From Example 6.2.3, it follows that 
T 
0 4 
AGB =| 4.9) 2] * * |) aarp 
8 8 4 2 


Definition 6.4: The complex conjugate of the matrix A = [a;;], denoted by A, is the matrix obtained from A 
after replacing each element a;; = a+ jG by its conjugate @;; = a — j8, where j? = —1. 


Definition 6.5: The adjoint of the m x n matrix A is the transpose of its complex conjugate matrix and is 
denoted by A* or A#, that is, A* = A? = A’. A matrix is called self-adjoint or Hermitian if A* = A; 
that is, Gij = Aji. 


Note that if A is a real-valued matrix, then its adjoint is just its transposition because the complex conjugate 
operation does not change real entries. If A and B are two n x n matrices and is a (complex) scalar, then the 
following properties hold: 


0 (A)=A, (AT)JT=A, (A*)* =A. 
e (A+B)=A+B, (A+B)? =A7+B’, (A+B)* = A*+B*. 
e OA) =NA, (AA)F =AA™, (AA)* =X A". 

Example 6.2.5: Consider the following two matrices with complex entries: 


_|iti j _|2 j 
a= [tid] aa w=[2, 
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Their complex conjugate, transposed, and adjoint matrices can be written as follows: 


aly? al eae, a) 


is denoted by the symbol I (or I,, when there is a need to emphasize the dimensions of the matrix) and is called 
the identity matrix (or unit matrix). The matrix with all entries being zero is denoted by 0 and is called the 
zero matrix. 


The product AB of two matrices is defined whenever the number of columns of the first matrix A is the same 
as the number of rows of the second matrix B. If A is an m x n matrix and B is an n x r matrix, then the product 
C = AB is an m X r matrix whose element cj; in the 7-th row and j-th column is defined as the inner or scalar 
product of the i-th row of A and j-th column of B. Namely, 


n 
Cig = 4101; + Gigbey + +++ + Gindar = s Aik dj. 
k=1 


Every n xX r matrix B can be represented in column form as 


B= [b;,b2,...,b,] 


with b;,..., b, being column vectors of size n: b; = (b1;,baj,.--,bnj)*. Similarly, the transpose n x m matrix AT 
can also be rewritten in column form: 
A? =[aj,a9,..., am] 
with n-vectors ay = (ap1,442,-.-,@kn)’, k = 1,2,...,m. Then the product AB of two matrices becomes 
AB = |[ax- bj] (KH 152) oie pS Tr) 
The product of m x n matrix A and an n-column vector x = (x1,22,...,@n)? (which is an n x 1 matrix) can be 
written as 
Ax =a, + tag +-+++%nan, 
where A = [aj,a,...,a,] is the matrix comprised of column vectors of size m. 


It is easy to show that matrix multiplication satisfies the associative law (AB)C = A(BC) and the distributive 
law A(B + C) = AB + AC, but, generally speaking, it might not satisfy the commutative law, that is, it may 
happen that AB 4 BA. 

Some properties of matrix multiplication differ from the corresponding properties of numbers, and we emphasize 
some of them below. 


e The multiplication of matrices may not commute even for square matrices. Moreover, the product AB of two 
matrices A and B may exist but their inverse product BA may not. In general, 


ABBA. 


e AB = 0 does not generally imply A = 0 or B= 0 or BA = O. 
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e AB = AC does not generally imply B = C. 


The identity matrix I commutes with any other square matrix: I-A = A-I. However, there may exist a matrix 
B #1 such that BA = A and AB = A for a particular matrix A. 


Example 6.2.6: For the 2 x 2 matrices below, we have 


Pra} [a V]-[foo} [rt] [o s}-[2 s]-2lt 3] 
fri] [a a}-[2 a]-2[3 T], 


The following properties of matrix multiplication hold: 


and 


(AB)=A-B, (AB)? =B7A‘, (AB)* =B*A*. 


Definition 6.7: A square n x n matrix A is called normal if it commutes with its adjoint: AA* = A*A. 


Definition 6.8: The trace of an n x n matrix A = [ ai; ], denoted by tr (A) or tr A, is the sum of its diagonal 
elements, that is, tr (A) = a1, + dog + +++ + nn. 


Theorem 6.1: Whenever a is a number (complex or real) and A and B are square matrices of the same 
dimensions, the following identities hold: 


e tr(A +B) = tr(A) +tr(B); 
e tr(aA) = atr (A); 
e tr(AB) = tr(BA). 


Example 6.2.7: In Maple, you may define a matrix in a couple of different ways. However, before using matrix 
commands one should invoke one of the two linear algebra packages (or both): LinearAlgebra (recommended) or 
linalg (an old version, but still in use). A colon (:) at the end of each command is used when the user does not 
want to see the results of its execution on the screen. To minimize the possibility of mixing data from different 
problems, start every problem with 
restart: with(LinearAlgebra): or/and with(linalg): 

There are a few examples of entering the matrix with the same output: 
with(LinearAlgebra) : 
A := Array(1..2, 1..3, [ [a,b,c], [d,e,f] ] ); 


or A := Matrix( 2, 3, [ [a,b,c],[d,e,f] ] ); or A := Matrix( 2, 3, [a, b, c, d, e, f ] ); 


which yields 
a be 
an [$26] 


Note that the linalg package uses similar commands but they all start with lower case letters, for example, matrix 
instead of Matrix. The identity 3 x 3 matrix can be defined with the command: 
M:=Matrix(3,3,shape=identity) : 
The entries of a matrix may be variables and not only numbers. 
with(LinearAlgebra) : 
A := <<1,5,w>|<2,6,x>1<3,7,y>|<4,8,z>>; 
or 
linalg [matrix] (2,3, [x,y,z,a,b,c]); 
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The entermatrix procedure prompts you for the values of matrix elements, one by one. 

To find a transpose or Hermitian to the matrix A, type in 

transpose (A) ; Transpose(A) or HermitianTranspose(A) 

The evalm function performs arithmetic on matrices: evalm(matrix expression); also, the dot as A.B or 
the command Multiply(A,B) is used within the LinearAlgebra package. When the package linalg is loaded, 
multiplication of two or more matrices can be obtained by executing the following command: multiply (A,B). Recall 
that the matrix product ABC of three matrices may be entered within the evalm command as A &* B &* C or as 
&* (A,B,C), the latter being more efficient. Automatic simplifications such as collecting constants and powers will 
be applied. Do not use the * to indicate purely matrix multiplication, as this will result in an error. The operands 
of &* must be matrices (or their names) with the exception of 0. Unevaluated matrix products are considered to be 
matrices. The operator &* has the same precedence as the * operator. So the LinearAlgebra package makes matrix 
operations more friendly, and its commands are similar to the ones of Mathematica (see Example 6.2.9). 

In Maple, the trace of a matrix A can be found as’ linalg[trace](A); or just trace(A); if either 
of the packages linalg or LinearAlgebra was previously loaded. Actually, Maple has a dedicated command 
MatrixFunction, which is a part of the LinearAlgebra package: 
with(LinearAlgebra) : 

A:=Matrix(2,2,[9,8,7,8]): 

LinearAlgebra[MatrixFunction] (A,cos(t*sqrt (lambda) ) , lambda) 
MatrixFunction(A, exp(t*lambda), lambda) 
MatrixExponential(A, t) 


Example 6.2.8: In MATLAB, a matrix A can be entered as follows 
A = [1,2,3,4;3,4,5,6;5,6,7,8] or A=[1234;3456;567 8) 


Then on the screen one can see 


12 3 4 
A=]3 4 5 6 
5 6 7 8 
1 
If you type A(:,1), then on the screen you will see the first column, namely, 3 . The command A(:,2) will invoke 
5 


the second column, and so on. If you want to see the first row of the matrix A, type A(1,:) to observe 
1234. 


To display the diagonal of the matrix A, type diag(A) or diag(A,0) to observe [1 ; 4 ; 7]. The next diagonals 
can be retrieved by 
diag(A,1) shows [2,5,8]; diag(A,-1) shows [8,6]. 


To find the transpose matrix of matrix A, just type A.’ (with dot) at the MATLAB prompt. To get the adjoint matrix 
A*, you need to type A’ (without dot); if the matrix A has only real entries, then A’ gives you its transposed matrix. 


Example 6.2.9: In Mathematica, a 2 x 3 matrix can be defined as follows: 
441,298) 4-2, 0581) (* or *) {{1,2,3},{-2,0,3}} // MatrixForm 


To multiply a matrix by a number, place the number in front of the matrix. A dot is used to define multiplication 
of matrices: (A.B). The operators Transpose[A] and ConjugateTranspose [A] are self-explanatory. 
In Mathematica, the output of IdentityMatrix[n] gives the n x n identity matrix. 


Example 6.2.10: In Maxima (or Sage), a matrix is defined as a collection of row vectors: 
A: matrix([1,2],[-3,1]); 

transpose(A); /* to find a transpose matrix */ 

A.A; /* or to square the matrix A */ Aw72; 

mattrace(A); /* trace of the matrix A */ 


362 Chapter 6. Introduction to Systems of ODEs 


6.3 Linear Systems of First Order ODEs 


A system of differential equations is a set of equations involving more than one unknown function and their deriva- 
tives. The order of a system of differential equations is the highest derivative that occurs in the system. In this 
section, we consider only first order systems of differential equations. As we saw in §6.1, certain problems lead 
naturally to systems of nonlinear differential equations in normal form: 


da,/dt = gi(t,21,%2,..-,2n), 


day/dt = go(t,21,%2,.--,2n), 6.3.1) 


da,/dt = gn(t,21,22,---,2n), 


where gz(t,21,22,---,2n), kK =1,2,...n, is a given function of n+ 1 variables. Instead of dx/dt we will use either 
of shorter notations x’ or the more customary notation « to denote a derivative of x(t) with respect to variable t 
associated with time. Note that we consider only the case when the number of equations is equal to the number of 
unknown variables, which is called the dimension of the system. A system of dimension 2 is called a planar system. 
If the right-hand side functions g), g2, .--,; Jn do not depend on t, then the corresponding system of equations is 
called autonomous. 

When we say that we are looking for or have a solution of a system of first order linear differential equations, 
we mean a set of n continuously differentiable functions x1(t), xvo(t), ..., @n(t) that satisfy the system on some 
interval. In addition to the system, there may also be assigned initial conditions: 


i(to) = 210, e(to) = 220, .--,  tn(to) =Xno, 


where to is a specified value of t and x19, %29, .--, Zno are prescribed constants. The problem of finding a solution to 
a linear system of differential equations that satisfies the given initial conditions is called an initial value problem 
or a Cauchy problem. 


When the functions g,(t,71,%2,...,%n), k = 1,2,...n, in Eq. (6.3.1) are linear functions with respect to n 
dependent variables x1, x2, ..., YZ, we obtain the general system of first order linear differential equations in normal 
form: 

&1(t) = pir 21(t) + pio Fa(t) +--+ + pin en(t) + fild), 
&o(t) = por £1(t) + poo £2(t) + +++ + pan tn(t) + fo(t), (6.3.2) 


En(t) = Pni xy(t) + Pn2 x(t) ates + Pnn L(t) ae frit). 


In this system of equations (6.3.2), the n? coefficients p1i(t), ..., Pnn(t) and the n functions fi(t) ,..., fn(t) are 
assumed to be known. If the coefficients p;; are constants, then we have a constant coefficient system of differential 
equations. Otherwise, we have a linear system of differential equations with variable coefficients. The system is said 
to be homogeneous or undriven if f(t) = fo(t) =... = fn(t) = 0. If at least one of the components of the vector 
function f(t) = (fi(t),..., fr(t))” is not identically zero, the system is called nonhomogeneous or driven and the 
vector function f(t) is referred to as a nonhomogeneous/driving term or forcing function or input. 


Example 6.3.1: The planar linear nonhomogeneous system of equations 


Ly =%1+ 222 + sin(¢), 
b2 = 34a, +422 +t cos(t) 


is in normal form. However, the system 


a +4¢@9 = 27, 4+22%9+ sin(t), 
241 — &o = 241 +422 +t cos(t) 


is not in normal form. The system 
Ly = 122, fg = 2%, + @2 


is an example of a nonlinear system. 
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We can rewrite the system (6.3.2) much more elegantly, in vector form. Let x(t) and f(t) be n-dimensional 
vectors, and P(t) denote the following square matrix: 


x(t) pi(t) pio(t) .-. pin(t) Si(t) 
= 7  PW= pail?) Paalt) “ Pan(t) = ae 
En(t) Pritt) Pralt) <<. Pant) Fn(t) 


a(t) pi(t) pi2(t) ... pin(t) a(t) fi(t) 
d | a(t) pai(t) poo(t) ... pon(t) £(t) fa(t) 
Gla |) fF F % 3 2 YT]: 
Lp(t) Drill) Pna(t) ... Pan(t) En(t) In(t) 
or simply 
x(t) = P(t)x(t) + f(t). (6.3.3) 


If the system is homogeneous, its vector form becomes 
x(t) = P(t)x(t). (6.3.4) 


It is often a useful trade-off to replace a differential equation of order higher than one by a first order system at 
the expense of increasing the number of unknown functions. Such a transition to a system of differential equations 
in normal form (6.3.1) is important for numerical calculations as well as for geometrical interpretations. Any n-th 
order linear differential equation is equivalent to the system (6.3.3), meaning that their solutions are the same. If 
such an n-th order equation is given in normal form, 


yo) = any) + an_ay +++ tary’ +a0y + G(t), (6.3.5) 


(n-1) 


then introducing n variables regarding y, y’, ..., y and renaming them 


er(t)= y(t), #2(t)= yO), w(t) Sy"), otal) Sy" VO), 


0 1 0 0 ry 0 
dx 0 oO 1 0 Xr 0 
—=Px+f, P= ,x=|.|/, f=]. |. (6.3.6) 
dt : 

ag a, a2 eee An-1 In g(t) 


Note that if the linear operator associated with the given n-th order equation, L[D] = D” — a@n_1D"~! — an_2D"~? — 
a,D — ag, where D is the derivative operator, has constant coefficients, then the determinant of the corresponding 
matrix AI — P is 

det(Al =P) =A" = 9,230" = god SA = ap: 


This means that the characteristic polynomial of the operator L[D] coincides with the characteristic polynomial of 
the constant matrix P. (The definition of a determinant is given in §7.2, page 380.) The general case is considered 
in the following statement. 


Theorem 6.2: Any ordinary differential equation or a system of ordinary differential equations in normal form 
can be expressed as a system of first order differential equations. 


Any n-th order differential equation, linear or nonlinear, can be written in the form 


y= FG UY piney), (6.3.7) 
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where derivatives are with respect to t. If we now rename the derivatives 
r=, way Si, Wey Hatha a, 
then Eq. (6.3.7) can be rewritten as 


SU ig: ice fn 
oe ae dt 


= F(t, wi, U2, .--, Un), 
which is a system of n first order differential equations. A similar proof is valid for any system of ordinary differential 
equations. 


Example 6.3.2: Suppose that a mechanical system is modeled by the forced damped harmonic oscillator 


H(t) — 2B y(t) + w°y(t) = fd), 


where 3, w are positive constants and f(t) is a given function. In order to convert this second order differential 
equation to a vector differential equation in standard form, we introduce a vector variable whose components are y 


and y. In particular, 
x(t) = bee = | or x! = (w1, a2)" = (y,y). 


a : : 
Then its derivative is 7 — | j | = | e I From the oscillator equation, we get ¥ to be y¥ = f(t) +26y—w?y = 
2 


f +2822 —w?2x,. Now we can transform the given second order equation into standard normal form: 
y r2 


dx da] dy} _ _ 
ee at || pe ee, | 


0 1 0 0 1 neal 0 
= x(t) + = + . 
[te oe] *0+| sty [=| ce oe LE] +L so 
1. Reduce the given differential equations to first order systems of equations. If the equation is linear, identify the matrix 
P(t) and the driving term f (if any) in Eq. (6.3.3) on page 363. 


(a) ty" 4+3ty t+y=t; (b) ?t?y"” —6ty’ +sin2ty = Int; 
(c) y"4+3y'+y/t=t; (d) yy’ +ty’ —y Int = cos2t; 
(e) By” — 2ty’ +y=t'; (f) yn + 3ty” +3? y +y=7; 
(g) y” =fly)+9(6); (h) By” +3ty’ +y=sint; 


(i) Py’ +5t7*y’-2ysint=t; (ji) y(t) Py’) 4 2y(t) = sine. 
2. In each problem, reduce the given linear differential equation with constant coefficients to the vector form, x = Ax+f, 
and identify the matrix A and the forcing vector f. 
(a) y+ 2y’+y=1; (b)  y” — 2y' + Sy =e’; 
(c) y! —3y'- Ty =4; (d) oy!" + 8y" + 3y' +y = 5; 
(e) 3y”+5y’ —2y= 307; (f) yo” = 2y” — 4y' +sint. 
3. Rewrite the system of differential equations in a matrix form x = Ax by identifying the vector x and the square matrix 
A. 


(a) t=x-2y, y=3x—4y. (b) é1 =$2,+ 322, é2 = 4201-322. 
(c) 2 —x+2y=0, y’ty—-2=0. 

(d) « +5a”—2y=0, y'+2x-—y=0. 

(e) 2’ —3r+2y=0, y’—2+3y=0. 

(f) 2 +x2-z2=0, y-ytar=0, 2 +242y—3z=0. 

(g) e =-0.52+2y—-32z, y=y-052, 2 =-2x+2z. 


4. Verify that the system 
& = —ay — (1—a)sint, y = ax —a°t+(1—a) cost 


has a solution of the form x = at+cost+ccosat, y = sint —1+c sinat for some constant c. 


5. Show that the system « = ty, y = —tz has circular solutions of radius r > 0. Hint: Show that «z+ yy = 0. 
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6.4 Reduction to a Single ODE 


The Gauss elimination method for solving systems of algebraic equations can be adapted to systems of linear 
differential equations, not necessarily in normal form. In many cases, it is possible to eliminate all but one dependent 
variable in succession until there remains only a single differential equation containing only one dependent variable. 
When this single differential equation can be solved, other dependent variables can be found in turn, using the 
original system of equations. Such a procedure, called the method of elimination, provides an effective tool for 
solving systems of differential equations. The solution obtained may contain the sufficient number of constants of 
integration to identify it as the general solution. However, the eliminating procedure may not lead to an equivalent 
single differential equation, and some solutions could be missing. We shall illustrate the elimination method in 
examples, as it will enable us to anticipate the form of the solution. 


Example 6.4.1: Suppose that there are two large interconnected tanks feeding each other; one of them we call tank 
A and the other one tank B. Suppose that initially tank A holds 60 liters of a brine solution, and tank B contains 
40 liters of the same solution. Fresh water flows into tank A at a rate of 2 liters per minute (1/min), and fluid is 
drained out of tank B at the same rate. Also, 11/min of fluid are pumped from tank B to tank A, and 21/min 
from tank A to tank B. The liquids inside each tank are kept well stirred so that each mixture is homogeneous. If, 
initially, the brine solution in tank A contains xo kg of salt and that in tank B contains yo kg of salt, determine the 
amount of salt in each tank at time t > 0. 

Solution. Note that the total volume of liquid remains constant at 100 liters because of the balance between 
inflow and outflow volume rates. However, the volume of liquid in tank B increases at a rate of 1 liter per minute, 
while the volume of liquid in tank A decreases at the same rate. Therefore, the volume of liquid in tank A at time 
t is 60 — t, and the volume of liquid in tank B becomes 40+ ¢. Let x(t) denote the amount of salt in tank A at time 
t and y(t) in tank B. 

To formulate the equations for this system, we equate the rate of change of salt in each tank with the net rate at 
which salt is transferred to that tank. The salt concentration in tank A is «(t)/(60 — t)kg/1, so the salt is carried 
out of tank A at a rate of 27/(60 — t)kg/min. Similarly, the salt in tank B is transferred to tank A in the amount 
of y/(40 +t) kg/min. The fresh water has no salt and its input just maintains the total volume. Since the difference 
between the input rate and output rate gives the net rate of change, we get the following system of equations: 


a2 p de yt 
Em ES AO — Bae 
. def dy 22 3y 


dt 60-t 40+¢t’ 

where D is the derivative operator. The system of linear differential equations obtained has variable coefficients. If 
the rate of exchange between the two tanks remains the same, say at 2 liters per minute, we get instead a constant 
coefficient system of differential equations: 


1 1 
ee 2e Ye (+5) 0+ g)e= = 
40 60 20 30° _, 10 30 600 
pee eee ge ee 
Y= 60 40 30 10’ wA0p N30 = 600" 
Example 6.4.2: Solve the following system of differential equations by reducing it to a single second order equation. 

x’ = 3y, 
y! = —a — Ay. 


Solution. Differentiating the second equation yields 


Mu / 


y" =a! —4y =-3y-—4y0 =o" +: 4y' + 38y = 0. 


This homogeneous linear differential equation in y has the general solution y = c,e~' + c2e~*. To find z(t), 
we substitute y(t) into x’ = 3y to obtain the equation 2’ = 3c;e~* + 3cge~**, which can be easily integrated: 
x(t) = —3c,e~* — cge~** +c, where c1,c2, and ¢ are arbitrary constants. Substituting x back into the equation 
y’ = —x — Ay gives the condition c = 0. Therefore, the given system of equations has the general solution 


v= —3c,e'—c ae y=C1 e '+c9 eres 


with two arbitrary constants c, and c2. 
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It is convenient to introduce the derivative operator, commonly denoted by D. Therefore, Dy means y’ or y, the 
derivative of the function y. The composition of D with itself, when it operates on functions, is naturally denoted 
by D?, which gives the second derivative when it is applied to a function. This allows us to define a polynomial in 
variable D. For example, (D? + 2D —1)y= 4 +2y—y. Note that we consider only constant coefficient polynomials. 


Example 6.4.3: Solve the system of differential equations 


/ 
Ly = «+ 2%0, 


I 


x 224 = 2x2, 


subject to the initial conditions 


Solution. Using the notation D = d/dt for the derivative operator, we rewrite the given system in the form 
& g y' 


(D—1)a1 = 2x22 =0, 
—24, + (D+ 2) a2 =0. 


To eliminate x, from these equations, we multiply the first equation (D — 1) a1 — 2x2 = 0 by 2, which leads to 
2(D — 1) a, —4a2 =0. 


Then we operate on the equation —22, + (D+ 2) #2 = 0 with D— 1 to obtain 


2(D — 1) 21 + (D— 1)(D + 2) a2 = 0. 


Adding these equation eliminates x; and yields 


4x9 +(D—1)(D+2)a2=0 or (D?+D—6)22=0. 


This constant coefficient differential equation has the characteristic polynomial .(A) = A? + A — 6 = (A +3)(A — 2) 
with roots A; = —3 and Az = 2. Therefore, its general solution is 


xo(t) = c1e** +. co e* 

for some arbitrary constants c; and cg. The initial condition x2(0) = 0 is not enough to determine these two arbitrary 
constants—there remains a relation c; = —c2 between constants. From the equation —2 21+ (D+2) x2 = 0, it follows 
that 


| 


(D+ 2) a2 = (D+2) [ey e+e e**] 


= D [er eer. C2 e”*|] +2 [er é ea c2 e**| 


224 


or 
22, =-3c, por we 2c2 eo 2c1 BoP hep 202 ew! = —C) mae 4 co et. 


Hence, 


1 
x(t) = oa e+ 2 ee, 
Using the given initial conditions, we obtain the system of algebraic equations 
1 
=eeut ta; cy tcp = 0. 


Consequently, cj = —2 and cy = 2. Therefore, 


gy=e tt 4+4e%, gy =—-2e7%* 4+ Qe". 


6.4. Reduction to a Single ODE 367 


Example 6.4.4: Let us consider the problem of finding solutions x(t) and y(t) for the system of differential equations 
of second order 

ve’ +3a-y' =t, 

8a +y"—3y=3. 


Since we don’t know any method of solving a system of differential equations, it is reasonable to attack this system 
by eliminating one dependent variable in order to reduce it to a single equation in the other dependent variable. 
We denote by D the operator of differentiation, that is, Du(t) = du/dt = u(t). This operator assigns the derivative 
to every smooth function u(t). With this in hand, we rewrite the given system of differential equations in operator 
form: 


(D? + 3)x — Dy =t, 
8Dx + (D? — 3)y = 3. 


To eliminate the dependent variable 2, we apply 8D to the first equation, D? + 3 to the second equation, and then 
subtract the results to obtain 


[—8D? — (D? — 3)(D? + 3)] y = 8Dt — (D? + 3)3 
7 [—p* — 8D? + 9]y(t) = -1 = [D* + 8D? — 9Jy(t) = 1 


because D?t = 0, Dt = 1, and D3 = 0. A particular solution can be found using the method of undetermined 
coefficients. In our case, we choose a constant function, y = a. Evaluation of [D+ + 8D? — 9](a) = 1 yields a = —1/9. 

The general solution of this equation is a sum of the general solution of the corresponding homogeneous equation 
and a particular solution of the nonhomogeneous equation. Since D4 + 8D? — 9 = (D? — 1)(D? + 9), we have 


1 
y(t) = ~9 +a,e' +a,e~* +43 sin 3t + a4 cos3t, 


for some constants a1, a2, a3, and a4. The function y(t) may be eliminated in a similar manner, which leads to the 
equation 
[(D* + 3)(D? — 3) + 8D?] a(t) = (D? — 3) + D3 
or 
[p* + 8D? — 9] a(¢) = —3¢. 


Therefore its general solution is 
t 
x(t) = = +b, e' +b, e~* + bg sin 3t + b4 cos 3t. 


The constants a; and b; i = 1,2,3,4, in these expressions for x(t) and y(t) should be chosen to satisfy the given 
system of differential equations. We substitute these functions into the corresponding homogeneous system to obtain 


(4a, — b,)e* + (4az + b2)e~* — 3(2a3 — b4) sin 3t — 3(2a4 + b3) cos 3t = 0. 
The linearly independence of the functions e’ ,e~* , sin 3t, and cos 3t requires that 
4a, —b, =0, 4ag+b2=0, 2a3— 64, 2a4+63 = 0. 


This allows us to express the b’s in terms of the a’s and create a general solution that contains four arbitrary 
constants: 


t 
z(t) = 3 t 4a e' — dage~' — 2a3 sin 3t + 2a3 cos 3t, 
1 
y(t) = 9 ta,e’ + age’ + a3 sin3t + a4 cos3t. 


We can reduce the given system of equations to an equivalent normal system of differential equations by setting 
p=w andq=y’. Then we have 


x(t) 0 0 01 0 

dx(t) _ ly(t) _ 10 _ {0 00 1 
rF =Ax+f, wherex= p(t) |? f= ae A= ee 
q(t) 3 0 3 8 0 
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Polynomial Equations 


The above elimination procedure works for any linear system of differential equations with constant coefficients 
regardless of the order of equations and the number of unknowns. We demonstrate this approach in the case of two 


equations with two unknown variables (which we usually denote by 21 and x2 or x and y) of arbitrary order. 
Let D = d/dt be the operator of differentiation and L(\) be a constant coefficient polynomial in \. Substituting 
D instead of A, we obtain a differential operator L[D] or simply L. This operator is a linear differential operator, 


meaning that the following properties hold: 
e LD\(a+y) = L[Dja+L[D]y for any two smooth functions x(t) and y(t). 
e L[D\(axz) =aL[Djxz for any constant a anda smooth function z(t). 


Let £1(A), Le(A), L3(A), and L4(A) be polynomials with constant coefficients. With these four polynomials, we 
consider the following system of two equations with two unknowns: 


se + Lo[DJa2 = fi(t), (6.4.1) 


L3[D\a1 + La[D]x2 = fo(t), 


where f(t) and f(t) are given functions and the variables x(t) and x2(t) are to be determined. Since these four 
operators L[D], L2[D], L3[D], and L4[D] are constant coefficient linear differential operators, they commute (i.e., 
[1 L2 = L2L,) and we can eliminate one of the two variables 2; or 22. To do this, we multiply the first equation by 
L4[D] and the second equation by L2[D] to obtain 


Lgl x, + Lalore = La fi, LoL3x1 + Lolyxe = Lo fo. 


Here we dropped explicit dependence on the derivative operator D and write L instead of L[D]. Subtracting these 
results, we get the equation for one unknown variable: LyL4x, — Lol30, = (I1L4 — Lel3)a, = La fy — Lo fo. 
Similarly, we obtain the equation for x2, yielding two separate differential equations: 


(£1 L4 — LoL3) x1 = Lafi — Lofe, (L3L2 — L1L4) x2 = L3 fi — Life. 


Obviously, these two single differential equations may have a solution only when L,L4 — L2L3 4 0. This forces us 
to introduce another definition. 


Definition 6.9: The operator 


D| 
4 [D] 


) = [,L4— LeL3 


is called the Wronskian determinant or operational determinant of differential operators Ly, [L2, L3, L4. If 
the Wronskian is identically zero, then the system (6.4.1) is said to be degenerate. 


Since a degenerate system may have either no solution or infinitely many independent solutions, we assume that 
L4L, — L2L3 #0. The number of arbitrary constants in the general solution of the system (6.4.1) is equal to the 
degree of its Wronskian as a polynomial in D. 


Example 6.4.5: Let DL) =D—1= Le, L3 =D+1= Lu, and we consider the degenerate system of the following 
equations: 
(D—1)z+ (D—1)y=e', (D+1)a+(D+1l)y=e"™. 


After multiplying the first equation by D+ 1 and the second equation by D — 1, we find their difference 
0 = 2e’ + 37% or 2%? = -3. 


Since the latter equation has no solutions (the exponential function cannot be negative), we claim that the given 
system of equations has no solution. 
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Example 6.4.6: The two-loop circuit considered in Example 6.1.2, page 348, leads to the following system of 
equations: 
q+ hh -Rh=V(b), 
Ri +(LD4+ Ri + Ro) =0, 
Dqg=h, 


where D = d/d¢t is the derivative operator. Eliminating [,, we get the system with two equations: 


(RiD+C) q— Rolo = V(t), 
—RiDq+(LD4+ Ri + Ro) Io =0. 


The above system of equations is of the form (6.4.1) with Ly(\A) = RiA+ C71, Le(A) = —Re, L3(A) = —R1A, and 
L(A) =LIX + Ry + Ro. Since 


Ly La — Ly L3 = RL? + X(LC71 + R7) + C7 (Ri + Ra), 


the system of differential equations with two unknowns q(t) and I2(t) is not degenerate. 


Example 6.4.7: The degenerate system 
Daz — 2Dy =e, 3Dx2 — 6Dy = 3e* 


has a solution for any choice of y(t) because one of them is a multiple of the other. Actually, this system is equivalent 
to the single equation Dx = 2Dy(t) + e~*, which has a solution x(t) = [[2y’ + e~'] dt = 2y(t) —e~' +e. 


Sometimes a system of n differential equations of the first order can be reduced to a single differential equation 
of an order less than n. 


Example 6.4.8: Reduce the system of differential equations 


to a single equation and solve it. 
Solution. Differentiating the first equation, we obtain 


ti = 92+ 93 = y(t) + ys(t) + y(t) + yo(t) = 2m (t) + mE) 
since y2 + y3 = ¥1. Hence, we get a second order equation 
Yi — Yi — 2y1 = 0 


for yi(t), which has the general solution y; = Ae~' + Be”, containing two arbitrary constants A and B. Similarly, 
we can obtain second order differential equations for y2(t) and y3(t). However, the general solution of the given 
system of equations contains three arbitrary constants. For example, y; = (e?* +2 e—*) cy + te" _ e*) (co + €3). 
Constants cz and c3 are arbitrary, but we cannot use their sum cp + c3 as a new arbitrary constant because other 
functions y2, y3 utilize these constants differently. For instance, yz = (e7 — e~‘) (c1 + ¢3) + (e + 2e) ca. 


Problems| In all problems, D stands for the derivative operator, while D°, the identity operator, is omitted. The 


derivatives with respect to t are denoted by dots. 


1. Classify each of the systems as linear or nonlinear. 
(a) #+y=sint, te+2y=—ae+y. (b) #+y=t, tytae=—Ph. 
(c) @=t+er?ty, y=P+aety. (d) t= 4+y, g=sint+e. 
(ec) ¢=a?+y2, pay?-a?. (f) g=2+y, y=y—2n. 


2. Let L =tD? + 2D, where D = d/dt is the derivative with respect to t. Find 


(a) L(1), (b) L(t), (©) Lit’), (d) Le"). 
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3. Let L =t?D+2D?, where D = d/dt is the derivative with respect to t. Find 
(a) L(1), (b) Li), (©) Lt), (a) Le"). 


4. Reduce each system to normal form with a single first derivative. 


(a) t+y=y, t&-y=a. (b) €4+2y=t, t-y=rty. 
(c) ¢-y=a+y-t, 2443¢=2¢+6. 


(d) 24-y=t, 34+2y=y. (e) 54-3y=x+y, 34-yH=t. 
(f) t-4y=0, 24-3y=t+y. (g) 34+2y=sint, t-2y=cx+tt+y. 


5. Use elimination by operator multiplication to get rid of one of the dependent variables in the following systems of 
differential equations. 


(D — 4)a + 3y =t, (D?++5D+6)¢+D(D+ 1)y =, 
(2) —6x + (D+ 7)y = 0; (b) (D+ 1)a+ (D+ 2)y =0; 

(D + 2)a + Dy = sint, 4 (D? + 2D+1)a + (D—1)y =0, 
()  ) p_3)e + (D—2)y = 0: (4) ) (o42)0— (0? +404 3 =t; 

(D? — 4)x — 90y = 4t, (D? + a+ D?y =2e-", 
fey a re (f) 2 =; 

e+ (D+2)y=1,; (D* — 1)a + Dy =0; 

(DP? = De+ (D+ 1)y =, (D? + 1) + (D? + 2)y = 2e~*, 
(©) 4 D_t)e ty =e: (h) 9 (D2 — 1x + D2y = 0; 


(i) D? — 3D—2)a+ (D? —D—2)y =0, 
"  ) (2p? — 9p —5)a + (3D? — 2D — Dy =0. 


6. Show that each of the following systems has no solution. 


(a) (D — 2)a1 + 2Dax2 = t, (b) (D — 2)a1 + 2Dr2 = t,— 
(2D —4)ay + 4Dar2 = t; (D? — 2D)a1 + 2D? a2 = 0; 

(c) Dai + (D—5)a2 =e’, (d) (D—1)ai + (D+1)a2 =e", 
2Dax1 + (2D — 10)x2 = e'; (2D — 2)a1 + (2D4 2)ao =2?. 


7. Find the general solution of the given system of differential equations by transforming it into a single equation for one 
unknown variable. 


u’ =4u—v, g£t+2—y=t, xv’ =4r 4+ 2y, 
(a) yo, (c) wt (e) 4, 
v = —4u+ 4v; e+y+3y =sint; yS=—E+y; 
w =w-y-Zz, 
(b) <4’ =y4+3z, & = 3x + 4y, x’ = 2x — 3y, 
; cs) eee 0 
z= 3yt2; y = —2x — 3y; y =x—2y. 


8. Solve the system of ordinary differential equations, where D denotes the derivative. 


(a) : —1)t14+ 22 5 0, _ (b) (D-1)a+(D ae = 0, 
(D— 1)a1 + (D+ 1)a2 = 0; Da + (D — 1)y = 0; 
(c) (2D+ 8)x 4 e Uy =0, (a) (D* — 4)a — 8y = s 
(D+ 9)x + Dy = 9; (D — 2)a — (D+ 2)y =1. 


9. Solve the initial value problem 
(D—4)zr+(D—1)y=3e’, (4—D)a+(D+2)y=e’, x(0) = y(0) =1. 


10. Transfer the following systems of equations (don’t solve them!) into equivalent systems of first order differential 
equations in normal form: 


(a) (D—2)a+Dy=0, (D?-2WD)r+y=4; (b) Dx—(D?-1)y=0, #+(D+1)y=sint. 
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6.5 Existence and Uniqueness 


Since higher order not degenerate systems of differential equations can be converted into equivalent first order 
systems, we do not lose any generality by restricting our attention to the first order case throughout. “Equivalent” 
means that each solution to the higher order equation uniquely corresponds to a solution to the first order system 
and vice versa. Recall that |a, b| denotes any interval (open, closed, or semi-closed) with endpoints a, b. 

A first order system of ordinary differential equations has the general form 


duj du 


J (8 Ui nas pote) ya = fi(t,u1,..., Un). 6.5.1 
ae Fits May. Un) qe Silt n) (6.5.1) 
The unknowns u(t), ..., Un(t) are scalar functions of a real variable ¢t, which usually represents time. The right- 
hand side functions f;(t,u1,...,Un), ---, fn(t, u1,---,;Un) are given functions of n + 1 variables. It is customary to 
denote the derivative with respect to the time variable by a dot, that is, au = wu. By introducing column vectors 
u = (uz, U2,..-,Un)", f = (fi, fo,-.-, fn)", we rewrite Eq. (6.5.1) in a vector form: 
du 
a f(t, u) or u = f(t, u). (6.5.2) 


Definition 6.10: A solution to a system of differential equations (6.5.1) on an interval |a, b| is a vector function 
u(t) with n components that are continuously differentiable on the interval |a, 6]; moreover, u(t) satisfies the given 


vector equation on its interval of definition. Each solution u(t) serves to parameterize a curve in n-dimensional 
space, also known as a trajectory, streamline, or orbit of the system. 


When we seek a particular solution that starts at the specified point, we impose the initial conditions 
ur (to) = U10; u2(to) = U20; dnard ,Un(to) = Un0 or u(to) = Uo. (6.5.3) 


Here tg is a prescribed initial time, while the column vector ug = (u10, U20,--., Uno)” fixes the initial position of the 
desired solution. In favorable situations, to be formulated shortly, the initial conditions serve to uniquely specify a 
solution to the differential system of equations—at least for nearby times. A system of equations (6.5.2) together 
with the initial conditions (6.5.3) form the initial value problem or the Cauchy problem. 


Definition 6.11: A system of differential equations is called autonomous if the right-hand side does not 
explicitly depend upon the time t and so takes the form 


du 


= = f(u) a = f(u). (6.5.4) 


One important class of autonomous first order systems is steady state fluid flows, where v = f(u) represents the 
fluid velocity at the position u. The solution u(t) to the autonomous equation (6.5.4) subject to the initial condition 
u(to) = a describes the motion of a fluid particle that starts at position a at time to. The vector differential equation 
(6.5.4) tells us that the fluid velocity at each point on the particle’s trajectory matches the prescribed vector field 
generated by f. 


Theorem 6.3: Let f(¢,u) be a continuous function. Then the initial value problem 
u=f(t,u), u(to) = uo (6.5.5) 


admits a solution u(t) that is, at least, defined for nearby times, i.e., when |t — to| < 6 for some positive 6. 


Theorem 6.3 guarantees that the solution to the initial value problem exists in some neighborhood of the initial 
position. However, the interval of existence of the solution might be much larger. It is called the validity interval, 
and it is barred by singularities that the solution may have. So the interval of existence can be unbounded, possibly 
infinite, —co < t < oo. Note that the existence theorem 6.3 can be readily adapted to any higher order system of 
ordinary differential equations by introducing additional variables and converting it into an equivalent first order 
system. The next statement is simply a reformulation of Picard’s existence-uniqueness theorem 1.3 (page 23) in the 
vector case. 
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Theorem 6.4: Let f(t,u) be a vector-valued function with n components. If f is continuous in some domain 
and satisfies the Lipschitz condition 


| £(¢, u1) — f(t, u2) || < L- || xi — xe I, L is a positive constant, 
then the initial value problem 
u=f(t,u), u(to) = uo (6.5.6) 


has a unique solution on some open interval containing the point to. Here ||x|| = \/2?7 +2}+---+ 22 is the 
norm (length) of the column vector x = (#1, %2,...,2n)?. 


It turns out that with the aid of vector notation, the proof of this theorem can be established using 
Picard’s iteration method. Rewriting the given initial value problem in an equivalent integral form 


u(t) = uo +f f(s, u(s)) ds, (6.5.7) 


to 


we find the required solution as the limit of the sequence of functions 


u(t) = lim @,(t), 


n—- co 


where the sequence {@,,(t)}n>o is defined recursively by 


t 
do(t) = Uo, on (E) = uo +f f(s, Pn—1(8)) ds, n= 1, 2, 3, see (6.5.8) 


to 


Theorem 6.5: If f(t, u) is a holomorphic function (that can be represented by a convergent power series), then 
all solutions u(t) of Eq. (6.5.2) are holomorphic. 


Corollary 6.1: If each of the components f(t, u), ..., fn(t, u) of the vector function f(t, u) = (fi, fo,..., fn)” 
and the partial derivatives Of; /Ou1, ..., Ofi1/Oun, Ofe/Ou1, ..., Ofn/OUn are continuous in an (n + 1) re- 
gion containing the initial point (6.5.3), then the initial value problem (6.5.6) has a unique solution in some 
neighborhood of the initial point. OH 


As a first consequence, we find that the solutions of the autonomous system (6.5.4) are uniquely determined 
by its initial data. So the solution trajectories do not vary over time: the functions u(t) and u(t — a) parametrize 
the same curve in the n-dimensional space. All solutions passing through the point ug follow the same trajectory, 
irrespective of the time they arrive there; therefore, orbits cannot touch or cross each other. For a linear system of 
differential equations we have a stronger result. 


Theorem 6.6: Let the n x n matrix-valued function P(t) and the vector-valued function g(t) be continuous on 
the (bounded or unbounded) open interval (a,b) containing the point to. Then the initial value problem 


x= P(t)x+g(t), x(to)=xo0, to € (a,b), 


has a continuous vector-valued solution x(t) on the interval (a, b). 
Example 6.5.1: Consider the following initial value problem: 
Since the corresponding system is autonomous, all conditions of Theorem 6.4 are satisfied. We denote its unique 


solution by «(t) = cq(t) and y(t) = sq(t), called the cosquine and squine functions, respectively. It is not hard to 
verify that cq*(t) + sq*(t) = 1 (for all t). 
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Example 6.5.2: (Jacobi Elliptic Functions) Consider the initial value problem 
£=y2, y= —xcz, z= kay, x(0) = 0, y(0) =1, 2(0) = 1, 


where dots stand for derivatives with respect to t, and k denotes a positive constant < 1. The parameter & is known 
as the modulus; its complementary modulus is k = /1 — k?. Using recurrence (6.5.8), which takes the form 


t 
Pee ee | Orton. 
t 
y(e+1)(t) =1 -f Lz) (S) 2(n) (8) ds, 
0 


t 
Z(k+1)(t) = 1 — e | (k)(S) yxy (s) ds, 


we find some first approximations: 


2 5 
tay =t, ta) =t, aig) =t—(1+k?) $+ 6k? 5, 
t? t? 2 t* 
ya) =1, Y(2) =O Oe Y(3) mice Sc a 
za) =1; za) =1—k? 5; zg) =1—k? 5 + 3k? GF. 


If we proceed successively, we find that the coefficients of the various powers of t ultimately become stable, i.e., 
they remain unchanged in successive iterations. This leads to representation of solutions as convergent power series, 
which are usually denoted by the symbols snt (or sn (t,k) or sn(t,m) for m = k?), ent, and dnt, and are called sine 
amplitude, cosine amplitude, and delta amplitude, respectively. There are some famous relations: cn?t + sn?t = 1 
and dn? = 1 — k?sn?t. These functions were introduced by Carl Jacobi®” in 1829. 


Example 6.5.3: Consider the initial value problem for the first Painlevé®* equation: 
g=6y+t — y(0)=0, y(0) =1/2. 


First, we convert the given Painlevé equation to the system of first order equations and write corresponding Picard’s 
iterations: 


d |y v : 1+?? a 
a \o| = ley2te)> =" ven = . ve(s)ds, vn41(t) = —>— +6 : yi,(s) ds, 
k =0,1,2,.... Integrating, we find a few first Picard approximations: 
a: ae er ares 
Y= >> y= 5 6” Y= 5 6 8° 


1. For what values of parameter p does the following initial value problem have a unique solution? 
#(t)=-y?"@®), gd)=27"@), = -x(0) = 1, (0) = 1. 


2. Verify that the following two initial value problems define trigonometric functions. What are they? 


&(t)=—y(t), y(t) = a(t), u(t) =v°(t), ot) = u(t) v(t), 
x(0)=1, y(0)=0; u(0)=0, v(0) =1. 


3. Verify that Jacobi elliptic functions x = sn (t,k), y = cn(t,k), and z = dn(t,k) satisfy the first order equations 
@ = (1-27) (1-k'2?), P=(l—-y’) (Kh +h y’), 2 = (1-2) (2? #7). 


4, For the following second order differential equations subject to the initial conditions y(0) = 1, y’(0) = 0, find the first 
four Picard’s iterations. 
(a) y= 2ry'—10y; (b)  y” +ay! =27y; 
(c) y=30% yy (d) yy” = (1+ 2°) y' — Gry. 


67Carl Gustav Jakob Jacobi (1804-1851) was a German mathematician who made fundamental contributions to elliptic functions, 
dynamics, differential equations, and number theory. Jacobi was the first Jewish mathematician to be appointed professor at a German 
university. 

58Paul Painlevé (1863-1933) was a French mathematician and politician. He served twice as Prime Minister of the Third French 
Republic. 
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Review Questions 


Summary for Chapter 6 


1. 


A matrix is a rectangular array of objects or entries, written in rows and columns. The numbers of rows and columns 
of a matrix are called its dimensions. 


The matrix which is obtained from m x n matrix A = [a,j] by interchanging rows and columns is called the transpose 
of A and is usually denoted by A” or A‘ or even A’. Thus A? = [a;;]. A matrix is called symmetric if A7 = A; 
that is, iz = Aji- 

The complex conjugate of the matrix A = [a;;], denoted by A, is the matrix obtained from A by replacing each element 
aij by its conjugate ai;. The adjoint of the m x n matrix A is the transpose of its conjugate matrix and is denoted by 
A*, that is, A* = A’. A matrix is called self-adjoint or Hermitian if A* = A, that is, Gj = aj:. 


. The trace of an n x n matrix A = [a;;], denoted by tr(A) or trA, is the sum of its diagonal elements, that is, 


tr (A) = a11 + a22 + +++ + ann. 


The first order system of linear differential equations in normal form: 
—=Ax-f or x=Ax+f. 


The number of equations is called the dimension of the system. The system is said to be homogeneous or undriven 
if fi(t) = fe(t) =... = fr(t) = 0. Otherwise the system is nonhomogeneous or driven and the vector function f(t) 
is referred to as a nonhomogeneous term or forcing function or input. 


A second or higher order differential equation can be reduced to the equivalent system of equations in normal form. 


The system 


Ly[D] a1 + Le[D] x2 = filt), 
L3[D] a1 + La[D] v2 = fa(t), 
where D = d/dt and L1(A), Le2(A), L3(A), La(A) are polynomials with constant coefficients, can be reduced to an 


equation with one unknown variable. We assume that Lal; — L2Ll3 #0, otherwise the method fails and the system is 
said to be degenerate. The operator W = Lal, — L2Lz3 is called the Wronskian determinant of the above system. 


The existence and uniqueness-existence theorems for a single differential equation are valid for the system of differential 
equations in normal form. 


Review Questions for Chapter 6 


Section 6.1 of Chapter 6 (Review) 


1. 


Derive the system of ordinary differential 
equations for the currents 1, Jo, and I3 
for each loop in the circuit presented at the 7 
right. 


A 


. A popular child’s toy consists of a small rubber ball of mass m attached to a wooden paddle by a rubber band of length 


£cm. When the ball is launched vertically upward by a paddle with an initial speed vo, the rubber band is observed to 
stretch up to LZ cm when the ball reached its highest point. Assume that the rubber band behaves like a spring obeying 
Hooke’s law for the amount of stretching. Let y(t) be the height of the ball with respect to the paddle, and tg and tz 
represent the times at which the height of the ball is y(te) = @ and y(tz) = L. Upon introducing the velocity v(t) = y, 
transfer the given model system to the standard system of first order: 


my =—mg, O<t<te, y(0) = 0, y(0) = vo, 
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Section 6.2 of Chapter 6 (Review) 
1. Show that the following 2 x 2 matrices do not commute. 
2 3 3-2 2 3 5 -2 
(a) i; i and > 9 | (b) ; | and 2, 9 I 
3.44 3.5 1 4 3 1 
(c) f ‘| and |: | : (d) ki | and fi | ; 


iG) i ‘ snd : als (f) k dl ng 


Section 6.3 of Chapter 6 (Review) 


So 
i) 


1. Reduce the given differential equations to the first order system of equations. If the equation is linear, identify the 
matrix P(t) and the driving term f (if any) in Eq. (6.3.3) on page 363. 


(a) y= +0; (b) (¢? +1) y” + 2ty’ +y=sint; 
" 2,,¢ 3,,—9. " i 2 ata ake 

(c) ty” + ty’ +ty =2; (d) yy” +y' Int-y=e'; 

(e) y’/t—t?y’ +ysint =0; (f) ty’ 4 (e 4) y" +y = cost; 


(g) yy! +ty" +t y' + =0; (h) yl +P y”+y’ Int+y=5; 
2. In each problem, reduce the given linear differential equation with constant coefficients to the vector form, x = Ax+f, 
and identify the matrix A and the forcing vector f (if any). 


(a) y" + by’ + 9y = 2; (b) y" —4y' + 13y=sint; (c) y+ by’ + 13y = 0; 
(d)  y” —6y"” + 12y’—-8y=0; (e) y” +2y' + l5y=¢; (f)  y' — 2y" — Sy’ + 6y = 0. 
3. By setting v = £, express the following second order nonlinear differential equations as first order systems of dimension 
2. 
(a) i+v%+2=0, (c) ¢=cotx+sing, (e) +2? =0, 
(b) ¢+4%+sinz =0, (d) #=2°4+2?, (f) @+cosx =0. 


4. A spherical pendulum is one that pivots freely about a fixed point in 3-dimensional space (see §6.1.4). To describe 
its motion, we consider a spherical coordinate system (£,6,), where @ is the length of the pendulum, 0@(t) is the azimuth 
angle of the pendulum that it makes with the downward vertical direction, and ¢(t) is a polar angle measured from a 
fixed zenith direction (usually chosen as x), at time t. With this system, the rectangular coordinates of the mass are 
then 

x=fsinOcos¢, y=fsind sing, z= —Lcosé. 


(a) Newton’s second law applied to the pendulum of mass 1 leads to # = 0, y = 0, Z = —g, where g is acceleration 
due to gravity. By differentiating x(6,¢), y(0,¢), and z(0,¢) with respect to time variable t, show that identities 


en ae ee oe 
7 cos p + 5 sind = 0, 7 cos i sing = 0 


lead to the equations 
6 cos 6 = sind (# + °) ; 
0= 2cos06¢ + sind 4, 


respectively. 


ow that these equations yield the following system of differential equations: 
b) Sh h. h i yield the following sy f diff ial i 

6 = e sin 6 cos 8 — ; sin 0, 

$ = —26¢ cot 4, 0~#nz, n is an integer. 


Section 6.4 of Chapter 6 (Review) 


1. Solve the following systems of homogeneous differential equations. 
(D? — 8)y — 28Da = 0, 


(a) : = (b) (5D | 6)x+ (2D | Sa 
y + 4a = 0; (4D + 5)a + (3D + 2)y = 0; 
(c) a ae (a) (D? —3D+2)a 4 iad + 3)y = 0, 
— A)x — 5y = 0; (3D —6)a+ (D—3)y =0 
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2. Use elimination by operator multiplication to get rid of one of the dependent variable in the following systems of 
nonhomogeneous differential equations. Then find the general solution. 


(3D? + 1)a + (D? +3)y = 0, (b) (D+ 2)" + 2Dy =sint, 


(2D° + 1)a + (D? + 2)y = 0; (D — 3)a + (D— 3)y = 0; 
ic) (D? — 4)y—a2 =0, (a) (D—3)a—y = 12e", 
3D°yta=4e™: (D — 2)y — 2a = 0; 
(e) Dax + (D— 2)y = 13 sint, (f) D°a — 4y = 85 cos 3t, 
(D+ 1)x — 2y = 0; D?y +a =0. 


3. The following systems of equations illustrate the exceptional cases. Solve them by the method of elimination. 
(D—1)?a + (D+3)?y =0, (b) (2D4+2)a+(D+1)y =sint, 
(D+ 4)?a + (D+ 2)*y =0; D+1)a—(D+1)y =0; 

(D — 4)a + (2D— 2)y =e”, 
( = 
( 


i 
T 
i 
T 


4D 
D-—3)a+(D—1)y =0; 2D —3)a—(D—1)y=0; 
3D —6)a — (D— 2)y = 16e”*, (f) 3D—3)a+(D—2)y=4e', 
(D — 2)a + (5D — 10)y = 0; (D — 1)a — (D— 2)y = 0. 


4. For the following systems of differential equations, determine whether they are degenerate or not. 


( 
( 
(d) 
( 


LE=xL+2y4+2, &£=2+2y+ 2z, Z=f£-yt+z, 
(a) y= 2y, (b) y= 2x + 3y + 2z, (c) y= —a# + 8y— 2, 
z= -x2-—3y- 2; z= —2x — 3y — 22; Z=2e4+7Ty4+ 2; 


5. When a drug is taken orally, it passes through two primary stages, first the digestive system and then the blood 
circulatory system. Assuming that on the boundary of these two stages the rate of change of drug concentration is 
proportional to the concentration present in that stage, we arrive at the following pair of differential equations 


&=—ar+ f(t),  #(0) =0, 

y = bx — cy, y(0) = 0, 
where x(t) is the drug concentration in the digestive system and y(t) is the drug concentration in the circulatory system. 
The function f(t) represents the rate at which the drug concentration is increased in the digestive system by external 


dosage. Assuming that parameters a, b, and c are constants, solve the equation for x(t) and then substitute its solution 
into the equation for y(t). Since the resulting equation in z is linear, solve it. 


Section 6.5 of Chapter 6 (Review) 
1. In each exercise, the initial conditions are assumed homogeneous, that is, y(0) = y(0) = 0. 


(a) Rewrite the given second order scalar initial value problem in vector form (6.5.1) by defining u = (ui, u2)", where 
U1 = Y, u= y. 
(b) Compute the four partial derivatives Of;,(t, ui, u2)/Ou,;, k,j = 1,2. 


(c) For the system obtained in part (a), determine where in 3-dimensional tu-space the hypotheses of Corollary 6.1 
are not satisfied. 


(a) f+ @)?+y? =t (b) 7+ (2+3y+4y)~! =sint; 
(c) ¥+sin (ty) + cosy = 0; (d) gt+y°/H@-D =e. 


2. For the following second order differential equations subject to the initial conditions y(0) = 0, y’(0) = 1, find the first 
four Picard iterations. 
(a) y"+2ty=0; (b) y” +3 y=0; (ce) y= y'—y, (d) y= tty; 
(e-) y=Ptys (Ff) yl +ty?=0;  (g) y=y'-ty (hb) y” =ty’+2y. 


Chapter 7 


Topics from Linear Algebra 


This chapter is devoted to some important topics from linear algebra that play an essential role in the study of 
systems of differential equations. Our main objective is to define a function of a square matrix. To achieve it, we 
present in detail four methods: diagonalization, Sylvester’s, resolvent, and the spectral decomposition procedure, of 
which the last three do not require any knowledge about eigenvectors. Note that while we are focusing on these four, 
there are many other approaches that can be used to define a function of a matrix [22, 35]. In the next chapter, 
we apply our four techniques to solve vector differential equations of the first order, either homogeneous y = Ay 
or nonhomogeneous y = Ay +f, and equations of the second order y + Ay = 0, with a square matrix A. In 
particular, we focus on constructing the fundamental matrix functions: a, needed for solution of the first order 
equations and eVAt, A-1/2 sin (A/?t), and cos (A1/?t), used in second order equations. This chapter contains many 
examples of matrix functions. The first four sections give an introduction to linear algebra and lay the foundation 
needed for future applications. 


7.1 The Calculus of Matrix Functions 


In this section, we use square matrices whose entries are functions of a real variable t, namely, 


a oes wa 

aqi(t) age(t) ++: aan(t 

A(t) = [aij (t)] = — ; 
Amt1(t) Am2 (t) ese Amn (t) 


The calculus of matrix functions is based on the definition of the limit: 


t-to t-to im 
This means that for every component of the m x n matrix A(t), 


t-to 


If the limit does not exist for at least one matrix entry, then the limit of the matrix does not exist. For example, 


a ; cs 
lim¢_s0 Pp Bp does not exist because one component t~! fails to have a limit. 


A matrix A(t) is said to be continuous on an interval |a, 6| if each entry of A(t) is a continuous function on the 
given interval: lim;_,,, A(t) = A(to) for every to € |a, 6]. With matrix functions we can operate in a similar way as 
with functions. For example, we define the definite integral of a matrix function as 


[ A(t)dt = i aula 
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The derivative dA(t)/dt of a matrix A(t), also denoted by a dot, A, is defined as 


| a dass(t 
Fe a fi, AGH HAC] = [im 5 (au(t+ &)— ay(t)] = [A], 


Many properties and formulas from calculus are extended to matrix functions; in particular, 


d dB dA 
7 (AB) =A + 3B. 
S(AtB)= 4S, [aw+Boya=/ aware [ Boa, 


= (CA) =C—, fe A(t)) dt = c/ A(t) dt, 


where C is a constant matrix. 


Example 7.1.1: Let 


Then 


t £ t 
e~’ cost +2e’ cost+te 
a | e* sint+2t? sint+t? | 


Differentiation gives 


cost 2t ‘dt 0 al 


a(n A 7-\ 6 | 
2t _ 2t os t t ta 
eee | 2e7* cost — e7* sint+2e' te’+e'—sint | 


dt 2e7' sint + e7! cost + 4t cost + 3t? 
Multiplication of the matrices yields 
dB [| 2c cost ef 
‘dt || 2e%* sint ¢? |’ 


dA, _ | e'—e* sint te’—sint 


dt e*' cost + 2t cost + 2t? 
Hence 4 qB dA 
q (AB) =A ha, B: (7.1.1) 
Similarly, 


cost +2sint e+ 2t 
CA> | 3cost 6t | : 


d 
dt 


3cost 6t rat 


~si t A 
(ca) =| sint + 2cost oa d 


Note that the relation 3 
A*(t 
oa) =2A 


dt dt 
is valid if and only if matrices A and aAt) commute, that is, 


dA(t)  dA(t) 
dt — dt 


In general, this is not true, as the following example shows. 


A 


7.1. The Calculus of Matrix Functions 


t sint 


sint 1 | Then 


Example 7.1.2: Let us consider the symmetric matrix A(t) = | 


dA(t) = | 1 cost 


2 . 2 : : 
aaa A?(t) _ t+sin*t tsint+sint 
cost 0 


tsint + sint sin?t+1 


Calculations show that 


a cA) _ {t+sintcost  tcost dA(t) , _ t+sintcost sint+ cost 
dt sint+cost  sintcost]’ dt - tcost sin t cost 


On the other hand, 
dA?(t) 2+2sintcost sint+tcost+cost 
dt | sint+tcost+ cost 2sint cost . 


Generally speaking, the derivative of A?(t) is not equal to 2A(t) A(t); however, 


dA?(t)  , dA(t) dA(t) 
dt a dt " dt a 


Example 7.1.3: Let A(t) = | 


dt 2t —2t 2 —2 
Therefore, 
dA(t)  dA(t) ?-—1 0 
= —— A= 2t 
dt dt 1 -3+# 
2 | (t-1) 0 
On the other hand, A* = | 2(t2 — 2) (3-42) and 
dA2(t) iat 0 dA (t) 
dt =a | 1 -3+# car 
1. For each of the given 2 x 2 matrix functions, show that _ ~2A a“ 
1 ¢ 1 + et et 
(@) k Ae (b) E Ar te) ke Pan. id) 
t 2t+1]). 2 1], P+1 t4+1). 
el @\s al es a a) | 
2. For each of the given 2 x 2 matrix functions, show that = =2A as 
t ¢t t sint t 1 e ¢t 
@ [i i Oi |: © [rif @ {ff 
Qt 3t]. i 2]. : aa ale Lt 
© [ya © [% 4): @ |? 3) ay [2 


3. Find a formula for the derivative of P?(¢), where P(t) is a square matrix. 


2/t 
—2/t 


|: 
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(7.1.2) 


4. Show that if A(t) is a differentiable and invertible (Definition 7.3 on page 382) square matrix function, then A~‘(t) is 


differentiable and (A~')'=—A~!A’A7'. Hint: differentiate the identity AT! A =I. 
5. Find lim;¢-+0 A(¢) or state why the limit does not exist. 


cot t a . 
? 


— sect 
— sint . = ‘ 
(a) A(t) = Be | } (b) AQ) = bes sinht 


(6) Aw=| 


e’ tanht 
cost cosht 


|: 
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7.2. Inverses and Determinants 


Starting with this section, we will deal with square matrices only. To each square matrix, it is possible to assign a 
number called the determinant®’. Its definition is difficult, non-intuitive, and generally speaking hard to evaluate 
numerically when the dimension of the matrix exceeds 10 x 10. Fortunately, algorithms are known for calculating 
determinants without actual application of the definition. We begin with 2 x 2 matrices 


a b 
a | c d | , 
The number ad — bc is called the determinant of the 2 x 2 matrix A, denoted by 
a b a b 
det = act] ale | =ad =e 
Note that the vertical bars distinguish a determinant from a matrix. 
Example 7.2.1: Let 
2 3 
a. 
Then its determinant is 
detA =2-8-—3-5=1. 
Permutations 
Recall that a permutation of a set of the first n integers {1,2,...,} is a reordering of these integers. For each 


permutation o, sign(c) is +1 if o is even and —1 if o is odd. Evenness or oddness can be defined as follows: the 
permutation is even (odd) if the new sequence can be obtained by an even number (odd, respectively) of switches 
of numbers starting with the initial ordering o = (1,2,...,n), which has sign(o) = +1 (zero switches). For n = 3, 
switching the positions of 2 and 3 yields (1,3,2), with sign(1,3,2) = —1. Switching once more yields (3,1,2), 
with sign(3,1,2) = +1 again. Finally, after a total of three switches (an odd number), the resulting permutation 
becomes (3,2,1), with sign(3,2,1) = —1. Therefore (3,2,1) is an odd permutation. Similarly, the permutation 
(2,3,1) is even: (1,2,3) 4 (2,1,3) 4 (2,3,1), with an even (2) number of switches. In general, there are n! 
permutations of {1,2,...,n}. 

In general, the determinant of a square n x n matrix A = [a,; | is the sum of n! terms. Each term is the product 
of n matrix entries, one element from each row and one element from each column. Furthermore, each product is 
assigned a plus or a minus sign: 


det(A) = >. sign() 14, @2ip *** Onin, 


where the summation is over all n! permutations (71, %2,...,%) of the integers 1,2,...,n and sign(c) is +1, which 
is determined by the parity of the permutation. Therefore, half of all the products in this sum are positive and half 
are negative. 


Recursive definition of determinant 


Next, we present the recursive definition of the determinant that reduces its evaluation of an n x n matrix to 
calculation of determinants of (n — 1) x (n — 1) matrices. 


Definition 7.1: Let A be an n x n matrix. The minor of the kj” entry of A is the determinant of the 
(n — 1) x (n — 1) submatrix obtained from A by deleting row & and column j: 


Minor (az; ) = det 


69°The word determinant was first coined by Cauchy in 1812. 
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The cofactor of the entry ax; is 


Cof (az; ) = (—1)**7 Minor (a;;). 


Theorem 7.1: Let A = [aij] be a square n x n matrix. For any row k or any column j we define the 
determinant of the matrix A by 


det(A) = 3 Ami Cot (@mj) = ss AkmCof (akm).- (7.2.1) 


m=1 m=1 


The appropriate sign in the cofactor is easy to remember since it alternates in the following manner: 


+ | 


det (A) = 4110422033 + @12023431 + A21432013 — 431022413 — 421412033 — 41132423. 


Theorem 7.2: If A and B are square matrices of the same size, then 
e det (AB) = det(A) det(B) = det(BA); 
e det(A) = det (A*) ; 
e det(aA) = a” det(A); 


e the determinant of a triangular matrix is the product of its main diagonal elements. 


Theorem 7.3: If A(t) = [a;;(t)] is an n x n function matrix, then 


daii(t) daj2(t) ao dain (t) 
d dt dt dt 
q (det A) =| (rn 
Gni(t) @ne(t) --- Gnn(t) 
aii(t) aig(t) --- ain(t) 
dani(t) da@na(t) |. dann(t) 
dt dt dt 
where vertical bars are used to identify the determinants. | 


Recall that the n x n identity matrix (see Definition 6.6 on page 359), denoted by I, or simply by I, has entries 
of zero in every position in the matrix except for those on the main diagonal. These diagonal entries will always 
equal one. For example, 


1 0 0 
Iz3=]0 1 0 
0 0 1 
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Definition 7.3: If for a square matrix A there exists a unique matrix B such that 


AB =BA=I, 


where I is the identity matrix, then A is said to be invertible with inverse B. We denote this relation as 
A~! =B and vice versa, B~! = A. 


Theorem 7.4: If A is a nonsingular n x n matrix, then 


1 
Tt eat [Cof (a;:)];; (transpose of the cofactor matrix). (222) 


Example 7.2.3: Let us find the inverse”? of the following matrix: 


-1 2 1 
A= 3 1 4 
—2 0 -3 


First, we calculate its determinant: det A = 7. Hence, the matrix is not singular, and its inverse matrix will be 
Cof (a11) Cof (a21) Cof (a31) 
AT! = 7 Cof (a12) Cof (a22) Cof (az2) 5 
Cof (a13) Cof (ag3) Cof (ag3) 


where entries are cofactors of the matrix A. To determine the cofactors, we eliminate the corresponding row and 
column from the original matrix A to obtain 


Cof(ai1) = Cof(—1) = det oe =§, 
0 -8 
3 
Cof(ai2) = —Cof(2 ) = ~aet | _9 4\- 1, 
1 
Cof(ai3) = Cof(1) = det E 2 9 | = 2, 
2 1 
Cof(a21) = —Cof(3) = det | — = 6, 
-1 1 
Cof(az2) = Cof(1) = det | ae | = 5, 


and so on. Therefore, the inverse matrix is 


{|e ob 7 
ae La 7 
2 -4 -7 


Theorem 7.5: If A and B are invertible matrices, then 
e (See = AY 
e (AB) '=B"!A7-!; 
e (aA) =a7!A7~! for any nonzero constant a; 


(AT) * = (A-?)* and (A*)"' = (A7!)*; 


e det (A!) = (det A) * 


“0The concept of the inverse of a square matrix was first introduced into mathematics in 1855 by the English mathematician Arthur 
Cayley (1821-1895), a closed friend of James Sylvester. 
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Definition 7.4: The resolvent of a square matrix A is the matrix R,(A) defined by 


R,(A) = (AL— A)“, 


where I is the identity matrix. 


Example 7.2.4: Find the resolvent of the matrix 

1 1 

O 1]. (7.2.3) 
1 0 


Solution. Throughout the text, we will use the notation (A) = det(AI— A). For the matrix A, we have AI— A = 
A -1 -1 
—1 . —1] . Hence, the determinant of the latter matrix is y(A) = det(AI— A) = \® —3A—2 = (A+1)?(A—2). 
-1 -1 4x 
1 rA-1 1 1 


The inverse of (AI — A) gives us the resolvent: Ry(A) = OF DAW2) 1 A-1 1 
7 1 1 rA-1 


Example 7.2.5: In MATLAB®, the command inv(A) gives the inverse matrix A~!. With det(A), MATLAB, MuPad 
provide the determinant of the matrix A. The transposition of a real matrix in MATLAB can be found by typing A’. 


Example 7.2.6: Maple” has two packages to handle matrices, linalg and LinearAlgebra. The former is depre- 
cated, but still widely used for instance in MuPad, a CAS from MATLAB; it calls for commands inverse(A) and 
det (A), respectively. The latter utilizes MatrixInverse and Determinant instead. Note that the LinearAlgebra 
package uses commands started with upper case letters similar to Mathematica while linalg package commands are 
all typed in lower case letters. To find the resolvent of the matrix A, we type: 

with(linalg): M:=Matrix(3,3,shape=identity); R := inverse(lambda*M-A) ; 


Example 7.2.7: In Mathematica®, we define the matrix, its inverse, and the determinant with the following 
commands: 

A:={{0,1,1},{1,0,1},{1,1,0}} 

Inverse[A] // MatrixForm 

Det [A] 

and then hold “Shift” and press “Enter.” The resolvent can be defined as Inverse [lambda*IdentityMatrix[3] - 
A] To solve the system of algebraic equations Ax = b, Mathematica has a special command: LinearSolve[A,b], 
which gives the vector x. 


Example 7.2.8: Mazima allows us to find the inverse and the determinant of the matrix A by typing 
B: A**-1; /* or */ invert(A); /* or */ A7~7-1, detout; 
determinant (A) ; 


In weMazima, you can also click “Algebra” and then “Determinant.” Upon typing minor(A,i,j) it will return 
the matrix with row 7 and column 7 removed from A. Sage has two dedicated commands: M.determinant() and 
M.det(), where M is defined previously matrix. SymPy uses the latter. 


7.2.1 Solving Linear Equations 


Now we turn our attention to the set of solutions of a vector algebraic equation: Ax = b when A is a square n x n 
singular matrix, and x and b are n-vectors. This equation provides us another point of view on square matrices: they 
are transformations in a finite dimensional space because matrices map a vector x into another vector b. Moreover, 
a square matrix can be considered as an example of a linear operator (Definition 4.1, page 190). 

The term linear operator means a linear transformation from a vector space to itself. For some choice of basis, 
a linear operator corresponds to a square matrix, and vice versa. Let vectors b;, bz, ..., by, form a basis in an 
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n-dimensional vector space. Then an arbitrary vector v can be written as a linear combination of these vectors, say 


Vv = v,b, + vabg +--+ + u,b,. We can identify v with the n-column vector (v1, v2,...,Un)?. 

For a given linear operator L, let u = L[v] (which we will denote as Lv for short). In the chosen basis, the vector 
u has coordinates u = (uj, U2,...,Un)?, meaning that u = u,b, + ugbg +--+ u,b,. Since L is a linear operator, 
we have 


Iv=L [uiby + vob See Unby| = v1 Lb, + vo Lbs tee t Un LD. 


By expanding each vector Lb; = aijb1 + aajb2 +--+ + anjbn (j = 1,2,...,n) as a linear combination of the basis 
vectors, we obtain a square matrix A = [aj;], called the standard matrix for the linear operator L with respect to 
the given basis {b,}. 


Theorem 7.6: If a square matrix A is not singular, then the solution of the algebraic equation Ax = b is 

x = A~'b. If det A = 0, then the vector equation Ax = b either has no solution, or it has infinitely many 

solutions. In the latter case, the vector b must be orthogonal to every solution of A*y = 0, namely, b- y = 0. 
I 

Here A*=A . 


Recall that the rank of the matrix A is the maximum number of linearly independent column or row vectors of 
A. If the rank of the matrix A is r < n, then the set of all solutions of the vector equation Ax = 0, together with 
x = 0, forms a vector space, called the kernel or null space (also nullspace). The dimension n — r of the null 
space of the matrix A is called the nullity of A. It is related to the rank of A by the equation 


rank(A) + nullity(A) = n. 


A nonhomogeneous algebraic equation Ax = b has a solution only if b belongs to the column space of the matrix 
(sometimes called the range of a matrix), which is the set of all possible linear combinations of its column vectors. 
When A is a singular matrix, a system of equations Ax = b has a solution only if b is orthogonal to every solution 
y of the adjoint homogeneous equation A*y = 0 or y*A = 0. The null space of A* is called the cokernel, and it 
can be viewed as the space of constraints that must be satisfied if the equation Ax = b is to have a solution. 


Example 7.2.9: Let us consider a singular matrix of rank 2: 
1 2 8 
A={3 2 1 
1 2. 1 
This matrix has rank 2 because its first two rows (1, 2,3) 


a 
(1,1,1) is their linear combination: (1,1,1) = 4(1,2,3) + 4 
the three equations 


nd (3,2,1) are linearly independent, but the last row 
(3, 2,1). The vector equation Ax = 0 is equivalent to 
%1+2%2+32%3=0, 34,4 2%2.4+473=0, 414+%2+23 = 0, 


where x = (#1, 22,23)" is a 3-column vector. From the latter equation, we get 73 = —2, — £2. Substituting x3 into 
the first two equations, we obtain 


Ly + 2x9 —3 (a1 +242) =0 = —-24%,-—2%2 = 0, 
3x1 + 2x2 - (21 + x2) =O <= 24,+2%2=0. 
We can determine only one variable from these equations, for example, v2 = —22,. Then x3 = —2, — 42 = 


—2x, +22, = 2. Substituting these values of x3 and x2 into x = (1, 22,73)", we get the solution of Ax = 0 to be 
(x1, 201,21)" = 2 (1,-2, oe Hence, the null space is spanned by the vector 


(1, —2, 1)" ’ 


resulting in a one-dimensional vector space (a line). 


Example 7.2.10: Let us return to a nonhomogeneous vector equation Ax = b, with the singular matrix A from 
the previous example: 


1 2 8 Ly by LY by 
3.2 1 t2| = | be}, where x= |Z}, b= jbo 
1 11 x3 b3 X3 b3 
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First, we consider the adjoint homogeneous equation: 


1 3 1 Y1 0 
Aly =0 or 2 2 1} |ye2} = JO], 
3 1 1 Y3 0) 


which can be expressed by three equations 


y1 + 3y2 + y3 = 0, 2y1 + 2y2 + y3 = 0, 3y1 + y2 + y3 = 0. 


Solving this system of equations, we get yi = y2 and ygs = —4yi1. Hence, 
Y1 i 
y=) ae | Hoey 4 
—Ay, —4 


The solution space of the equation A*y = 0 is spanned by the vector (1,1,—4)?. So the cokernel of A is a one- 
dimensional space. Now we are ready to determine the column space of A. This space consists of all vectors that 
are orthogonal to (1,1,—4)7. This leads to 


(taal (by, ba, bs) =0 or by + bg — 4b3 = 0. 


Since we have only one constraint, 6} = —b2 + 4be, this vector space is two dimensional: 
bi —be + 4be —1 4 
bo| = bo =bo{] 1} +53 |0 
bg b3 0 1 


Therefore, the algebraic equation Ax = b has a nontrivial solution if and only if the vector b belongs to the column 
space of A spanned by the two vectors (—1,1,0)7 and (4,0, 1)”. 


1. Determine all minors and cofactors of each of the given matrices. 


0 0 -4 1 2 -1 0 2 -1 
(a) 0 -4 Of]; (b) 2 1. 1/3 (c) |4 3 5]; 
-4 0 15 -1 1 -1 2 0 -4 
1 2 3 1 1 4 1 2 3 
(d) 3 2 1]; (e) |0 6 —-1]; (ff) |2 3 4]. 
1 1 3 2 0 10 3.4 4 
2. Find the determinants of the following matrices: 
t sin 2t cos 2t : t ad 
ae sin2t+2cos2t cos 2t— sin 2¢ |; i i—@ 137 |. 
3. For each of the following 2 x 2 matrices, find its inverse using Cayley-Hamilton formula A~' = =+5 [(trA)I— A]. 
0 1 1 2 2 1 2 -1 
Olea lage Clare @ ie al: 
4. For each of the following 3 x 3 matrices, find its inverse. 
1 2 -1 0 2 2 1 11 
(a) 7 a (b) {2 0 2]; (c) 1 0 14]; 
-1 1 O 2 2 0 1 1 0 
1 3 O 1 2 3 1 1 1 
(d) 0 0 24; (e) 2B Bed (f) 1 2 3°). 
O -1 5 0 1 2 2 3 ~=5 
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7.3 Eigenvalues and Eigenvectors 


For a square n x n matrix A = (a;;), we consider an associated set of simultaneous linear algebraic equations 


Q1121 1 A12%2Q Ts + A1intn = bi, 
Q2121 + A22%2 T*'* + a2ntn = bo, 
Ani, + Ang%Q + +++ + Anntyn = br, 


which can be written in the matrix form A x = b, where 


Ty by 411 G12 **' Gin 

Z2 0) 421 G22 ‘"' Gan 
x= , b= , and A= 

In bn Anl An2 > ann 


The equation Ax = b establishes a relationship between column vectors x and b. This means that A is a linear 
transformation in the vector space of all n-vectors. This transformation is one-to-one if and only if the matrix A 
is nonsingular (i.e., det A # 0). In addition, if we require that b = Ax for some scalar A, so that the matrix A 
transforms x into a parallel vector, we are led to a consideration of the equation 


Ax =x or (AI— A) x =0, 


which has a nontrivial solution if and only if the matrix AI — A is a singular matrix. That is, \ is a root of the 
so-called characteristic equation: 


A a11 —a12 ae —ain 
—a21 A—a22°*° —a2n 
det(AI — A) = det : . ; . =0. 
Anil —An2 ees. — ann 


The determinant of the matrix A — XI, where I is the identity matrix, is clearly a polynomial in 4 of degree n, 
with the leading term (—1)”A”. It is more convenient to have the leading coefficient be 1 instead of (—1)”, yielding 
the following definitions. 


Definition 7.5: The characteristic polynomial of a square matrix A, denoted ya (A) or simply (A), is the 


determinant of the matrix AI— A, 
def 


(A) & det(AI— A). (7.3.1) 


Obviously, y(A) has the leading term \”. Any solution of the characteristic equation y(A) = 0 is said to be an 
eigenvalue of the matrix A. The set of all eigenvalues is called the spectrum of the matrix A, denoted by 
o(A). 


Definition 7.6: A nonzero n-vector x such that 
Ax = \x 


is called an eigenvector of a square matrix A corresponding to the eigenvalue 4. 


For \ = 0, we have the relation Ax = 0 instead of Eq. (7.3.2). All solutions of this equation form a vector 
space, called the null space or kernel of A. Note that an eigenvector corresponding to a given eigenvalue is not 
unique; any its nonzero constant multiple is again an eigenvector. Moreover, any linear combination of eigenvectors 
corresponding to a fixed eigenvalue”! is again an eigenvector. 


The prefix “eigen” is adopted from the old Dutch and German, meaning “self” or “proper.” 
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Definition 7.7: Let N be the collection of all eigenvectors corresponding to the eigenvalue A. Since, by our 
definition, 0 is not an eigenvector, N) does not contain 0. If, however, we enlarge N) by adjoining the origin 
to it, then Ny, becomes a subspace, usually called the eigenspace or proper space. We define the geometric 
multiplicity of the eigenvalue 4 as the dimension of the subspace N). If the eigenvalue \ has multiplicity 1, it 


is said to be a simple eigenvalue. 


Definition 7.8: Let \ be an eigenvalue of a matrix A. The algebraic multiplicity of is called the multiplicity 
of A as a root of the characteristic equation: det(\AI — A) = 0. 


These two concepts of multiplicity do not coincide (see Examples 7.3.1—7.3.3 on page 388). It is quite easy to 
see that the geometric multiplicity of \ is never greater than its algebraic multiplicity. Indeed, if T is any linear 
transformation with eigenvalue A, then N) is invariant under T. If To is the linear transformation of T restricted 
to Ny only, then clearly det(AI — To) is a factor of det(AI — T). 


Theorem 7.7: Nonzero eigenvectors corresponding to distinct eigenvalues of a square matrix are linearly inde- 
pendent. 


Let v1, V2, ---, Vm be nonzero eigenvectors of a square matrix T corresponding to distinct eigenvalues 
A, A2,---; Am- Suppose ay, G2, ..., Gm are complex numbers such that 
a1V, + devo +++: +4mVm = O. 
Applying the linear operator (A2I — T) (A3I — T)--- (AmI — T) to both sides, we get 
a1(A2 — A1)(A3 — At) +++ (Am — A1)v1 = 0. 
Thus, a; = 0. In a similar fashion, a; = 0 for each 7, as desired. Oo 
Definition 7.9: The eigenvalue \ of a square matrix is called defective if its algebraic multiplicity is greater 


than its geometric one. The difference (which is always nonnegative) between the algebraic multiplicity and 
geometric multiplicity is called the defect of the eigenvalue X. 


Definition 7.10: For a square matrix A, let \ be an eigenvalue of defect 1. If there exist two column vectors 
€ and 7 such that 
AE=2XE and (AI — A) = €, 


then the vector 7 is called the generalized eigenvector corresponding to the eigenvalue A. In other words, 
for the vector 7 we have (AI— A)? n = 0, but (AI— A) 7 4 O. If the eigenvalue \ has defect 2, the vectors 
n and ¢ that satisfy the equations A€ = A€, (AT— A) = €, and (AI— A) ¢ = 7 are called the generalized 
eigenvectors. Therefore, for these vectors, we have (AI — A) € = 0, (AI— A)’ n = 0, and (AI— A)® ¢=0. In 
general, a vector 7 is a generalized eigenvector of order m associated with the eigenvalue \ if (AI— A)" 7 = 
0 and (AI—A)”™ ‘7 ZO. 


The set of generalized eigenvectors of the n x n square matrix A corresponding to an eigenvalue \ is a subspace of 
the n-th dimensional vector space. Problem 8 on page 427 asks you to prove that the set of generalized eigenvectors 
is mapped to 0 by (AI— A)”. 

The eigenspace for every eigenvalue A is the kernel of the matrix \I— A: N) = ker(AI— A), where I is the 
identity matrix. The dimension of this space N) is the geometric multiplicity of A. If the matrix A is not defective, 
then the dimensions of all the kernels of (AI — A)* remain the same for any positive integer k = 1,2,.... However, 
if a matrix is defective, then ker (AI— A) C ker (AI— A)’. If \ has defect 1, then ker (AI— A)? = ker (AI— A)’. 
For an eigenvalue A with defect 3, we have ker (AI — A) C ker (AI— A)? C ker (AI — A)* = ker (AI— A)’. 

Let Ay, A2,---,Ar be the distinct eigenvalues of a square matrix A with algebraic multiplicities m1,m2,...,mr, 
respectively; then 


detA=][AM, trA=S > mArj;, (7.3.3) 
j=l 


gai 
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where the expression tr A is called the trace of A. Recall (Definition 6.8 on page 360) that the trace of a square 
matrix is the sum of its diagonal elements. 
The characteristic polynomial for any n x n matrix A can be written as 


(A) =X” — Op") + Gq A? — + + (-1)" ec, 


where c,—-1 = tr(A) and co = det A. Actually all coefficients c; can be expressed via eigenvalues, for example, 
Cn-2 = Dy, <a, AiAz (for real eigenvalues). 


Example 7.3.1: Let us consider a three-dimensional space and a linear operator T acting on this space. Suppose 
this linear transformation is determined by the matrix 


10 3 
T=|2 12 (7.3.4) 
002 


for a given basis. The spectrum of T consists of all roots of the characteristic equation .(A) = det(AI— T) = 0. A 
short computation gives x(A) = (A — 1)?(A — 2) = \3 — 4\2 + 5\ — 2. Hence, the spectrum o(T) of the operator 
T consists of two real numbers A = 1 and A = 2. The geometric multiplicity and the algebraic multiplicity of the 
simple eigenvalue \ = 2 are the same. 

The algebraic multiplicity of the number = 1 equals two, but the geometric multiplicity of this number equals 
one, so it has defect 1. Indeed, solving the system of equations Tx = x, where x = (x1, 22,23)", we see that the 
relevant eigen subspace N; consists of column vectors of the form (0, 2,0)", where x is any nonzero number and the 
index “T” denotes the transpose. To find the generalized eigenvector, we need to solve two systems of equations 


1 0 3] [& & 1 0 3) |m ™m 1 
2 1 2] |é] = lél], 2 1 2) [m2] = |n2| + |é2 
0 0 2] |é3 & 0 O 2} [ns 73 &3 


with respect to € = (&, &2,€3)" and 7 = (m,72,n3)7?. Since € is the eigenvector, we get €&; = €; = 0. Then the 
latter system of algebraic equations becomes 


m +373 =m, 
2m +12 + 2n3 = Ne + €2, 
3 = 13- 


From the first equation, it follows that 73 = 0. Since the last equation does not impose any condition, we obtain 


21 = £9. 


By choosing some numerical value for £2, say 2 = 2, we obtain two (linearly independent) eigenvectors corresponding 
toX=1: € = (0,2,0)7 and 7 = (1,0,0,)". Together with the eigenvector x = (3,8,1)7 corresponding to the 
eigenvalue A = 2 they form a basis for the three-dimensional space. 

To analyze the kernels of the powers of (I— T)”, we introduce the matrix 


0 0 -3 0 0 3 0 0 -3 
K=I-T=]-2 0 -2 — K?=|0 0 8|, K®={|0 0 —8| =—K?. 
0 6.41 001 0 0 =1 


While the kernel of the matrix K is one dimensional and spanned on the vector (0, 1,0)", the kernel of the matrix 
K? is two dimensional and spanned on two vectors, (0,1,0)7 and (1,0,0)7. Since (I— T)” = (—1)"K, n = 2,3,..., 
we have ker (AI — T) C ker (AI — T)? = ker (AI — T)® = ker (AI— T)* = -- 


Example 7.3.2: (Example 7.2.4 revisited) The matrix (7.2.3) on page 383 has the characteristic polynomial 
x (A) = (A — 2)(A +1)”. Consequently, the spectrum o(A) of the operator A includes two numbers \ = 2 (of 
multiplicity 1, with eigenvector (1,1,1)7) and \ = —1 (of multiplicity 2). Moreover, the geometric multiplicity 
of A = —1 coincides with its algebraic multiplicity. Indeed, the eigen subspace N_; consists of column vectors 
(a1, 22, —@1—a2)", where x1, £2 are arbitrary numbers. For example, the two eigenvectors (1,0, —1)7 and (0,1, —1)" 
are linearly independent. 
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Example 7.3.3: Let us consider linear operators defined by the following matrices: 


2 0 0 21 0 2 1 
T.=|0 20], T2=|0 2 0], T3=]0 2 (7.3.5) 
002 0 2 0 0 


Nora 


where a is any number. The polynomial y(A) = (A — 2)? is the characteristic polynomial of all three matrices 
(7.3.5). Therefore, the spectrum, o(T;) (7 = 1,2,3), consists of one number, \ = 2, with algebraic multiplicity 3. 
However, the geometric multiplicities of 1 = 2 for each of these operators differ. The eigen subspace N 2 for the 
operator T; coincides with R°. The eigen subspace Nz for the operator Ty» consists of all the column vectors of the 
form (x,0,23)", and the eigen subspace N» for the operator T3 coincides with all the column vectors of the form 
(x1,0,0)7. Here, x1, 23 are any (real) numbers. 


Theorem 7.8: The eigenvalues and eigenvectors of any n x n self-adjoint matrix (A = A” = A*, that is, 
ai; = G;;), possess the following properties. 


1. All eigenvalues are real numbers. 
2. There exists a basis of n linearly independent eigenvectors. 


3. Eigenvectors that correspond to different eigenvalues are orthogonal, that is, (x,y) =*-y? =0. 


A more general result is proved in Theorem 10.1, page 555. 


Corollary 7.1: For a square matrix A, the products of A with its adjoint, AA* and A*A, have the same real 
eigenvalues. 


Example 7.3.4: Find the eigenvalues and eigenvectors of each of the following matrices. 


2 -2 3 
1. The matrix A= | 10 -—4 5 | has the following eigenvalues and eigenvectors: 
5 —-4 6 
1 4/5 
Ay = Ag =1, x1 = 5 ; and A3 = 2, X, = 3 
3 2 
Therefore, the eigenvalue \ = 1 is defective. 
-2 2 -8 
2. The matrix A = 2 1 —6 | has eigenvalues A; = 5, Ag = A3 = —3 because its characteristic polynomial 
-1 -2 0O 
is x(A) = A3 + A? — 21\ — 45 = (A— 5)(A + 3)?. An eigenvector corresponding to Ay = 5 is (1,2,—1)7, and 
the two linearly independent eigenvectors that correspond to \2 = A3 = —3 are (—2,1,0)7 and (3,0,1)7. 
Therefore, the matrix is not defective. 
3.2 -3 
3. The matrix A=}; 0 2 1 has the characteristic polynomial (A) = (A — 3)(A — 2)(A — 1) with simple 
00 1 


eigenvalues Ay = 1, Az = 2, and A3 = 3. An eigenvector corresponding to \y = 1 is (5, —2,2)", an eigenvector 
corresponding to Az = 2 is (—2,1,0)", and an eigenvector corresponding A3 = 3 is (1,0,0)?. 


Example 7.3.5: In MATLAB, eigenvalues of a matrix A can be found by entering eig(A). The output of this 
operation contains a column vector of eigenvalues. To determine eigenvalues and corresponding eigenvectors one 
should enter [V,D] = eig( A ). This produces a diagonal matrix D of eigenvalues and a full matrix V with 
columns of eigenvectors so that A « V = V x D (the asterisk « represents multiplication of matrices). 
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Maple has two packages—linalg and LinearAlgebra—to deal with matrices. The command eigenvectors(A) 
provides eigenvalues, their multiplicities, and eigenvectors within deprecated linalg package (where all commands 
are typed in lower case letters); when the command eigenvalues (A) is typed, it provides eigenvalues of the matrix 
A. On the other hand, LinearAlgebra packages calls for Eigenvalues(A) and Eigenvectors (A). Eigenvalues also 
can be found from the resolvent. First, we find the matrix AI — A within linalg package: 

RR:= matadd(array(identity, 1..3, 1..3), -A, lambda, 1)); 
and then factor it: factor (RR) or calculate the resolvent: R:=inverse(RR). 

Mathematica has two dedicated commands: Eigenvectors[A] and Eigenvalues[A] These two commands 
can be united into one: Eigensystem[A], which gives a list of the eigenvalues and eigenvectors of the square matrix 
A. Sage utilizes the similar commands. 

When using Maxima, its package eigen contains several functions devoted to the symbolic computation of eigen- 
values and eigenvectors: eigenvalues(A) and eigenvectors(A) that can be united into one: spectral_rep(A) ; 
which displays eigenvectors and eigenvalues. The command charpoly(A,lambda) returns the determinant of 
det(A — AT) = (—1)" x(A). An alternative to Mazima’s charpoly is the function ncharpoly, which requires the load 
of this package: load ("nchrp1l"). SymPy uses special commands: M.eigenvals() and M.eigenvects(), while 
Sage utilizes similar ones: M.eigenvalues() and M.eigenvectors(). 


1. For each of the following 2 x 2 matrices, find its eigenvalues and corresponding eigenvectors. 


2 -1 —-4 -6 1 2 1 1 3 «2 
|e ol [es cle @ le ale @ le le Ole sh 
2. For each of the following 2 x 2 matrices, find its eigenvalues and corresponding eigenvectors, including generalized 
eigenvectors. 
0 -1 2 i 1 -l 1 -1 0 -3 
1) Se ® (Sane ae @) le Ge LE ak 


jl5 —9 -4 3 10 3 . [8 —4]. .. [9 —4 
© [eo @ lan ie @ 1 de OE ah we i el 
3. For each of the following 2 x 2 matrices, find its complex eigenvalues and corresponding eigenvectors. 


ei; ae Ea ey Sew |S ke @ ly sa 


Oiler 7 |e GP ao ke Oe ie OP Se Oe 


= 2 3 a meno 3 5 8 
(a) o 2 8 Is ib) | 2 2 @ | (c) sk 29s 
0 0 -2 a a | 
i =f = 1. = 3 4 =i =4 
(d) fe ale (e) | 12 -4 10 |; (ft) | a sg 42 
= | 5 i> 23 Si. 4 


5. For each of the following 3 x 3 not defective matrices, find its eigenvalues and corresponding eigenvectors. 


-3 2 2 3.2 4 2 —-2 #1 
(a) 2 -3 2 : (b) 20 2]; (c) -1 3 -1]; 
2 2 -3 4 2 3 2 -4 3 
3 -3 —-4 1 2 -4 3 1 -1 
(a) | 6 -8 -12 |; (ec) |1 0 4 |; (f) | 1 3 -1 
—-4 6 9 1 -2 6 3.3 -1 
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7.4  Diagonalization 


This section shows the reduction of matrices into their diagonal form. This approach, called diagonalization, allows us 
to define a function of a square matrix. We present the diagonalization procedure only for matrices with nondefective 
eigenvalues, that is, when the algebraic and geometric multiplicities are equal for every eigenvalue. The algorithm 
based on diagonalization can be extended for an arbitrary matrix; however, all calculations become more tedious as 
a result and require more work. 


Definition 7.11: Two matrices A and B are said to be similar if there exists a nonsingular matrix S such 


that S-'AS = B. We denote this relation as A ~ B. 


Note that the matrix S is not unique. It is obvious that if A ~ B, then B ~ A because SBS~! = A. Also if 
A~BandB~C, then A ~ C. Indeed, suppose that there exist two matrices S and T such that 


S"'AS=B, T'BT=C. 

We multiply both sides of the former equation by T~! from the left and by T from the right. This yields 
T's 'AST=T'BT=C 

since T-!BT = C. If we denote R = ST, then R~! = T~“!S~!, and we have 


R-'AR=C. 


Theorem 7.9: For similar square matrices A and B, we have 
det(A) =det(B) and tr(A) =tr(B). 

Example 7.4.1: Let us consider two nonsingular matrices: 
O11 1 
S=/;1 01 and A=] 0 
1 1 0 0 -1l 


We use the matrix S to build a similar matrix to A, which we denote by B: 


i (=i 2 4 1 3 40 011 
B=S'AS=5 1 -1 1 0 0 2 101 
1 1 -1 0 -1 5 110 
3 6 —5 
=5|5 4 3 
1-2 5 


These two matrices have the same determinants and traces: det(A) = det(B) = 2 and tr(A) = 6 = tr(B). 
Similarly, we can use the matrix A to determine T, a similar matrix to S: 


1|2 —15 6 011 1 3 0 
T=A-!ISA=_|0 5 -2 101 0 0 2 
0 1 0 1 1 0 0 -1 5 
, | —9 —14 —49 
= 3 4 21 |, and trT =trS=0. 


1 2 5 


Definition 7.12: An n x n matrix is called a diagonal matrix if its entries are 0 everywhere except possibly 


along the main diagonal (which will therefore contain all eigenvalues). 
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A diagonal matrix is completely determined by its main diagonal elements. The product and sum of any two 
diagonal matrices is again a diagonal matrix. Since a diagonal matrix is uniquely identified by a vector of its diagonal 
elements, there exists a one-to-one correspondence between diagonal n x n matrices and n-vectors. 

Mathematica and Maple (needs the LinearAlgebra subroutine) share a dedicated command to define a diagonal 
matrix: 

DiagonalMatrix[{1,2,3}] // MatrixForm 
Mazima and MATLAB also have a similar command: A : diag([a,b]); for a diagonal 2 x 2 matrix. 


Definition 7.13: An nx n matrix A is said to be diagonalizable if it is similar to a diagonal matrix, that is, 
there exists a diagonal matrix A and a nonsingular matrix S such that A = S~'AS. 


Definition 7.14: A square matrix is called positive definite, or positive semidefinite, if its eigenvalues are 
all positive numbers or nonnegative numbers, respectively. 


A matrix is positive definite if and only if (u, Au) > 0 for any nonzero vector u, where (, ) is the inner product 
(Definition 6.1, page 356). 


Theorem 7.10: A square matrix A is diagonalizable if and only if it is nondefective, that is, its eigenvalues 
have the same geometric and algebraic multiplicities. In other words, if a square n x n matrix A has n linearly 
independent eigenvectors, then A is diagonalizable. Hw 


We present some sufficient conditions that guarantee this property. 


Theorem 7.11: A square matrix A is diagonalizable if it 
1. is a self-adjoint matrix, that is, A= A* (A* = A’ is the adjoint matrix); 
2. is normal, namely, AA* = A*A; 


3. has distinct eigenvalues. a 


Other than Theorem 7.10 and Theorem 7.13 on page 400, necessary and sufficient conditions for a matrix to 
be diagonalizable are still unknown. When the entries of an n x n matrix A are integers, then the matrix A has 
the Smith’? canonical form UAV = A, where U and V are unimodular matrices (which means that they have the 
determinant equal to +1), and A is a diagonal matrix with integer values, called the invariant factors of A. However, 
this canonical form is rarely used in practical calculations. 

We illustrate the product of matrices by considering two 2 x 2 matrices: 


A= Q11 412 and BS bi bye 
a21 422 ba bee 


Their product is 


AB-— a41b41 + @y2b21 41612 + ay2b22 
a21b11 + G22b21  d21b12 + az2b22 


It is convenient to rewrite the matrix B in column form as 


B=[bi,b2], where be | b= | 5 |. 
21 


Then 
AB = [Ab,, Aby]. 


Therefore, in general, 
AB = [Ab,, Aby,...,Ab,], 


72The British mathematician Henry John Stephen Smith (1826-1883) discovered this property in 1861. 
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where b; (j = 1,2,...,n) is the j-th column vector of the n x n matrix B = [bj, be,..., by]. 
Suppose that a square n x n matrix A has n linearly independent eigenvectors 


X1, X2,---,Xn, 


where x; = (@1;,%2i,-.-,%ni)’, i =1,2,...n. Let Ay, Az, ..., An be the corresponding eigenvalues. We construct a 
nonsingular matrix S from the eigenvectors, that is, 


| T11 12 Zin ] 
21 22 soe Ln 
S= ; mS det(S) 4 0, 
In1 Un2 s+. Inn 


where every column is an eigenvector. Recall that the condition det(S) 4 0 is necessary and sufficient for vectors 
xX;, 7 =1,2,...n, to be linearly independent. Since Ax; = \;x;, we have 


Mit11 Ag®ig— -.-  AnLin 
Ai%21 Ag®o2«.. Anon 
AS = [Ax, Axo, ee , Ax, | = 
M12n1 A202 te Anfnn 
On the other hand, let A be the diagonal matrix having 41, A2, ..., An on its diagonal: 
4 (Os OO 
0 A. «~O 
A= . . . . (7.4.1) 
0 O +++ An 
Then 
tA, f12A2. «©.» LinAn 
taiA1  La2AQ. «-.  LanAn 
SA= 
Lni Ar Ln2A2 te LnnAn 


From the above equation, it follows that SA = AS so A~ A. Therefore, a matrix S can be constructed from the 
eigenvectors of the given matrix A. 


Example 7.4.2: Let j be the imaginary unit vector in the positive vertical direction on the complex plane C 
(j? = —1), and let 
1 j 
a-[5 3] 


be a symmetric matrix, but nonself-adjoint. It has one eigenvalue 4 = 2 of algebraic multiplicity two. To this 
eigenvalue corresponds a one-dimensional space of eigenvectors that is spanned by the complex vector 


J 
11: 
Hence, the geometric multiplicity of X = 2 is 1 and, as a result, this symmetric matrix with complex entries is 


not diagonalizable. Note that a real symmetric matrix is always diagonalizable (Theorem 7.11, page 392). The 
self-adjoint matrix 


has two distinct real positive eigenvalues 


M4 =24+V2~3.41421 and Ay =2—V2 0.585786 
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with complex eigenvectors 
= X1 
1 > 


he al oad m= [ SEHD _ 


respectively. Hence, the matrix Ag is diagonalizable; all its eigenvalues are positive real numbers, so Ag is a positive 
matrix. The matrix S of these column eigenvectors is 


S= | i(v2—1) sae) | and det(S) = 2jV2. 


1 
, cs 1 j(V24+1) 
1__1 
Since S~* = Vi | f, Gje— 1) , we have 
-1 _f 2+v2 0 
S-*A2S =| 0 2 V3 |" 
j 0 -j 
Example 7.4.3: (Normal matrix) The matrix |0 j 0} is normal, but not self-adjoint. It has three distinct 
to 3 
(complex) eigenvalues j and +1 + j; therefore, the matrix is diagonalizable. 
4 0 2 
Another nonsymmetric matrix M= |2 4 0] is normal, which means that 
0 2 4 
4 0 2] |4 2 0 20 8 8 
MM*=]2 4 O| |0 4 2} =M*M=1]8 20 8 
0 2 4] |2 0 4 8 8 20 


The matrix M has three simple eigenvalues 3 and 3 + jV3, so it is diagonalizable. 


Example 7.4.4: Let us consider a 3 x 2-matrix 


1 2 
B=|o0 1 |, with Br= [3 4 el 
21 -1 
1 -l 
We can construct two (diagonalizable) matrices: 
5 2 —-1l al 
B,=BB’=]/] 2 1 -1] and B,=B’B= 
-1 -1 2 pe 


The singular matrix B, has three real eigenvalues 
A, = 44+ V5 & 6.23607, Az =4—V5 1.76393, Az =0. 


The matrix Bo has two eigenvalues A; = 4+ V5 and \y = 4— V5. It is not a coincidence that matrices B; and Bo 
have common eigenvalues. 


Now we are in the position to define a function of a diagonal matrix (7.4.1) as 


f(A) = | . ; (7.4.2) 
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The powers of diagonalizable matrices can be obtained as follows. Let A = S~1AS be a diagonal matrix similar 
to A. Then 
A? = AA =SAS7'SAS™ = SA?S™?. 


Similarly, for any positive integer m, we have 
A™ =SAS~!...-SAS~!=SA™S7!. 
That is why for an arbitrary polynomial f(A) = agA™ + @m—1A"~! + +++ + am, we have 
f(A) =S/(A)S-*. 
With this in hand, we define a function of an arbitrary diagonalizable matrix A: 


f(A) =SS "f(A)SS"* =S/ (S""AS)S* =Sf(A)s™. 


Thus, 
f(A1) 0 0 
0 f(A2) + 0 
f(A)=Sf(A)ST=S]| Sn se hee (7.4.3) 
6 O. a0 Fog 


Remark. We cannot use this formula (7.4.3) if the function f(A) is undefined for some eigenvalue of a matrix A. 
The function f(A) must be bounded on the spectrum of the matrix A, otherwise f(A) does not exist. For example, 
the exponential function e“* exists for an arbitrary square matrix A because e* is defined for every \ (complex or 
real). However, the function f(A) = \~! can be applied only to nonsigular matrices. 


9 8 
7 8 
A2 = 16. The corresponding eigenvectors are as follows: x; = (1,—1)7 and xg = (8,7)7. Taking the diagonal 
matrix of its eigenvalues A, and building the matrix of eigenvectors S, we obtain 


s=| : ar git E ok aud A=s-tas =|) ve 


Example 7.4.5: Let us consider the 2 x 2 matrix A = , which has two simple eigenvalues A; = 1 and 


1% a5. 11° i 0 16 


According to Eq. (7.4.3), for the function f(A) = 444, we have that 


A+ 
A-I_.[ f(a) 0 ca ee We. a 
m= 8| 0° fie) |F Slo asya7 J® 
8 8) 1f9 -8] 1s 8]. 1 
-[; | alos ol alt *| =a (AD. 
However, the fractions 
A+I Fi A+I 
A-I “  A~-i6I 


do not exist because the corresponding functions (A + 1)(A — 1)! and (A + 1)(A — 16)~! are undefined for A = 1 
and \ = 16 (the spectrum of the matrix A). We can determine f(A) for functions that are defined on the spectrum 
of A, for instance, for f(A) = \~! or f(A) = e*, as the following calculations show. 


1 0 1 8 —8 
= SM 
A =s(5 cae -3|5, al; 
we | 1, Se Oe Se) | eerie fe ee 
Tey 0 eM |°35]1 1 | 15 | 7el*—7et Tel +8et 
Now we turn our attention to the definition of a root of a matrix. Since the square root, f(A) = VW, is not a function 
(that assigns a unique output to every input), but rather the analytic function of a complex variable: every input 
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has two output values depending on the branch chosen. Using the square roots of \ = 1,16, we may define four 
possible roots of the diagonal matrix: 


1 O -1 O -1 0 1 O 
malo a} a= [oP Sf) as=[ob af A= fo 
According to Eq. (7.4.3), the corresponding roots of the matrix A are as follows: 


: 1/13 8 
def a 
| » Re=SA28 = E 12 


R, = SA,S"! = 
det - 1 
; | , Ry=SA,S t= ae E | =—R3. 


©, 12 


e _ 1 
R3 = SA3S~' = = 
3 
Each of the matrices Ry, k = 1,2,3,4, is a root of the square matrix A because RZ? = A. These matrices have 
distinct eigenvalues, but they share the same eigenvectors, as the matrix A. For each root, we define four exponential 
functions according to the formula e®*! = SeA«'S~! to obtain 


7 4 


eri Set 4+7e! 8et*—B8et eRot _ 1 Se-*+7e* 8e-*#— 8e-* 
~ 15 |7et#— Tet Tet# 4+ 8et|’ ~ 15 |7e-* —Te-*  Te~*# 4+ Be’ 
eo Set +7e* 8ett—8e* na | [Se +78 Be =se’ 
~ 15 |\7e%—Te-t Tet +8e-*|’ ee 8 (FeV Fem fee | 


Now we consider two functions that involve the root 


sin(VAt) ihre - ye 
me _ 2 Gera cos(V Mt) = OW a 


However, as their Maclaurin series show, these functions are actually entire functions because their series converge 
for all of the parameter values. Therefore, we expect that Eq. (7.4.3) will give the same result for each root Rx, 
k =1,2,3,4. Indeed, 


sin(VAt) _ sin(Ret) _ ¢ sin(Axt) g-1_[1 8] fsnt 0] 1 [7 -8 
VA Re Ak “lb oe | 


1 in4t — 8 si 
evens 2 sin 4t a (k = 1,2,3,4), 


~ 15 [2 sin4dt—7sint 1 sindt+8 sint 
_ _ -1_ | 1 8| Jcost 7] 2 Po 8 
cos(VAt) = cos(Rxt) = S cos(A,t) 87! = [4 ail o --eeea| ge lt 4 


1 ; cos4t+7 cost 8 cos4t—8 cost 


~ 15 |7 cos4t — 7 cost eae (k = 1,2,3,4). 


Maple can help you to find, say, cos(Rit), with the following commands: 
with(linalg): s:=matrix(2,2,[1,8,-1,7]); 
si:= inverse(s); cc:=matrix(2,2,[cos(t),0,0,cos(4*t)]); 
cosR1:=evalm(s&*cc&*si) ; 
The trigonometric functions, cos(VAt) and sin(vAt), can also be obtained from the exponential function eV Ait 


by extracting the real part and imaginary part, respectively. The definition of the matrix function cos (v At) does 


not depend on what root VA is chosen, but the function sin (v At) is sensitive to such a choice because it depends 


on Rx, k = 1,2,3,4. Choosing, for instance, the root Ri, we ask Maple to do this job: 
di:=matrix(2,2,[1,0,0,4]); R1:=evalm(s&#*d1i&+*si); exponential (R1,I*t) ; 
Maple has a dedicated command to determine the function of a matrix (see Example 6.2.7, page 360). 


Example 7.4.6: Let 


-3 2 2 1 0 0 
A=j]|-6 5 2 and A=j|0 2 0 
—-7 4 4 0 0 3 
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be the 3 x 3 matrix and corresponding diagonal matrix of its eigenvalues. Then A = S~!AS, where 


1 2 3 2 1 -2 
Sa] i 2a and S =) 1 —2 7 |, 
1 3 = 5 -1 1 O 
with det(S) = —1. Recall that each column vector of the matrix S is an eigenvector of the matrix A. For the function 
f”A)= sin(at) ) we have 
sin(At) sin(At) | 4 
A)= =S s-t= 
i 23 sin(t) 0 0 2 1 -2 
=|124]-)} 9 S@2® o |./ 1 -2 1 
1 3°5 0 0 ania) -1 1 0 
3 +2 cost—4 cos’ i —4 cost + 4 cos? t —1+2 cost 
=sint |2+2cost— 2 cos?t —4—4 cost + 2 cos?t -—1+2 cost 
a = a 3 _ ee ee ee 
+3 cost cos? t 6 cost + = cos*t 1+3 cost 


Similarly, we have 


g (A) = cos(At) = S cos(At) S~! = 


1 2 3 sin(t) 0 
=|12 4]. 0 cos(2t) 
1 3°55 0 0 cos( 36 


2cost + 2. cos(2t) — 3cos(3t) cost — 4cos(2t) + 3 os 2 cos(2t) — cost 
= |2cost + 2cos(2t) — 4cos(3t) cost — 4cos(2t) + 4cos(3t) 2cos(2t) — cost 
2cost + 3cos(2t) — 5cos(3t) cost — 6 cos(2t) + 5cos(3t) 3cos(2t) — cost 


Both f (A) and g(A) are solutions of the following matrix differential equation & + A?®(t) = 0, namely, 


ae (ae) 4 A? (ae) =0, (cos (Ad) + A? cos(At) = 


dt? A 


1. Are the following matrices diagonalizable? 
2 5/2 |. 2 j/2 |. 2 -1 
Olas sie Wile. i Ola ek 
2. For the following pairs of matrices (a, b, c, and d are real constants), 


waeP J meal J mac ym e-[4 J 


(a) prove that they are similar; 


(b) show that matrices A and B have distinct eigenvectors. 


2 
3. Using the diagonalization procedure, find the functions f(A) and g(A), where f(A) = e** and g(A) = —. of the 


following 2 x 2 matrices: 


wfc wy eff of ap 


398 


7. 


8. 


10. 


11. 
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Using the diagonalization procedure, find the exponential matrix, e“*, for each of the 2 x 2 matrices in Problem 3 on 
page 390, Section 7.3. 


Using the diagonalization procedure, find the exponential matrix, e“*, for each of the following 2 x 2 matrices: 


4 13 14 9 9 —15 4 13 
@ [Fo]: wy [Mah |e ar) @ |e a) 
4 1 7 -2 17 —10 17 -7 
7-2 
. Using the diagonalization procedure, find the functions f(A) and g(A), where f(A) = e** and g(A) = ya? of the 
following 3 x 3 matrices: 
12-2 4 t —2 2 2 2 1 
(a) | 8 -1 4/4]; (b) | 25 -1 4]; (c) |-1 3 -1|; 
-13 4 -4 —25 4 -1 2 -4 3 
1 12. -8 4 -—50 50 3.2 4 
(d) |-1 9 -4]; (e) |10 -2 3]; (f) [12 5 12]; 
-18 7 -18 1 0 1 4 2 3 
1 2 1 1 12 -8 3.2 4 
(g) |-2 3 5]; (h) |-1 9 —4|; (i) |2 4 2 
2 1 -i1 -4 8 -4 4 3 3 


Using the diagonalization procedure, find the exponential matrix, e“*, for each of the 3 x 3 matrices in Problem 4 on 
page 390, §7.3. 


For each of the following 2 x 2 matrices, find all square roots. 


re ed ee 
e) i a (f) i ali (g) ee ah (i) 2 Fal 
0 [sol @ GS) wf vol: o [% 3h 
m) |i gl @) ls si ©) |} i): ) [i a). 


For each of the matrices in the previous Problem, determine ausA and cos (VAt), and show that the result does 
not depend on the choice of the root. 

For each of the matrices in Problem 8, find R(t) = eVAit (j? = —1) for each root of the matrix A; then extract its real 
part (denote it by cos (Vat) and imaginary part (denote it by sin (vA) ), and show that they satisfy the matrix 


equations: 


cos” (vat) + sin? (Vat) =. - [cos (var)| + A cos (Vaz) = 


Determine which of the following 3 x 3 matrices is diagonalizable. 


1 0 1 7 2 -6 3 0 7 
(a) [1 1 Of; (b) |3 8 : ; (c) J15 1 8]; 
0 0 1 6 0 0 38 
25 -8 30 7 7 5 -1 0 
(d) 24 -4 30]; : (f) |-1 1 2]; 
-12 4 -14 5 1 1 
91 72 —-18 Be -—99 99 3-2 2 
(g) |-135 -107 27]; 151 —153] ; (i) J]O -1 Of]; 
—105 —84 22 117-119 4 —2 -3 
-11 25) «#13 5 1 6 1 -1l 
(j) 6 -9 -6]; —-3 -2)/; (qd) j1 0 -1 
—32 61 34 17 8 17 #4 
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12. For each of the following nonnegative matrices, determine sea) and cos (vA i) and show that the result does not 


13. 


14. 


15. 


16. 


17. 


depend on the choice of the root. 


24. 94 A Tk. 6-26 1 2 6 
(a) |6 -8 12]; (b) |3 10 -6]; (c) }1 10 2]; 
= a 1 6 i, 6, B 
1 10 6 th a ee ee 
(ad) [5 10 —10]; (ey. et Be ls (ft) |o 1 1; 
hie ee" 9 i 2 3 20 tay 
a ee = a ee” 40 4 
(g) |5 2 —3]; (iy theo Sa8° B13 CONE my ame 
pO: 3 a a 0 4 0 
de SEG re ae 21 10 -2 
Gy 16 -3> 62 (ey, Ee? 3B as (Ds 288 tt BOS 
2 a ae! = a ee 110 50 —ll 
39 5 -16 can ae 6 2 1 
(m) |-36 -5 16]; Get Se Shs (0) |-2 6 11]; 
#29" 229 i h. -Z =i “%, <I 
ms aa OF Teaihe °325 15 #17 1 
(p) |o 4 Ol; (q) |-5 19 5 |; (r) | 8 -15 -8 
6s 135 °% i> 45? 14 -10 65 26 


For each of the following nonnegative matrices with complex entries, determine aus and cos (VAt), and show 


that the result does not depend on the choice of the root. 


34h, Ol > en ae 9+2j 2-5j]. 
(@) ie Hae ) Cae te a ie sae 

Deb Ft = Fi Gp. Sy 10+j 6-3]. 
(4) Foes aa (e) fee vee i ®) fo4g 7-3) 

9+j 1-3]. 8437 -=7 44). go Pe OF, 2G 
(g) hae re (h) eee 5-33]? (i) |o_33 6-95] 


Show that if a square matrix A is nilpotent, then all its eigenvalues are zeroes. In particular, a nilpotent matrix is 
always singular (det A = 0) and its trace is zero. Recall that a matrix A is called nilpotent if A? = 0 for some positive 
integer p. 


For each of the following 3 x 3 matrices, find all square roots. 


=35 Tiss «Bi —51 20 30 Gee ee 
(a) | 234 -86 —126] ; (b) |—-30 19 15]; (c) |-24 9 13]; 
234 90 130 —90 30 54 13. 25 SY. 
OY 80k: <=d8 WT As - 126 16. <3. 9 
(d) |}98 -15 —24| ; (e) |-88 25 64 |; (f) |-84 25 63 
144-48 —71 264 —72 —-191 48 -12 32 


For each of the following 3 x 3 positive matrices, determine sea and cos (Vat), and show that the result does 


not depend on the choice of the root. 


a | 6 -2 0 i. 23 
(a) 0 6 -5 |; (b) |-5 5 —-2]; (c) (290 24 
Be: a= Sth 0. 2, 5 Lr =2¢ 8 


ely 2 O -1l -2 2 Be cd. 
(a) |5 4 —-T7]; (b) ne a (c) |4 5 —-2]; 
5 0 6 ee ee 1 29 —-7 O 
-3 7 —-6 2 UG i) -15 -7 4 
(ay |e “Sb, oe | Se, el eae (f) | 34 16 —4 
1 -1 4 -6 7 9 17 7 5 
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7.5 Sylvester’s Formula 


This section presents probably the most effective algorithm to define a function of a square matrix—the Sylvester 
method. We discuss only a simple version of this algorithm that can be applied to diagonalizable matrices. When 
Sylvester’s method is extended to the general case, it loses its simplicity and beauty. 
Because a square matrix A can be viewed as a linear operator in n-dimensional vector space, there is a smallest 
positive integer m such that 
PeA A, And A“ 


are not linearly independent. Thus, there exist (generally speaking complex) numbers qo, q1, ---; Gnm—1 Such that 


q(A) =qol+qmAt qA? ++-++¢m1A™ 14+ A™ =0. 


Definition 7.15: A scalar polynomial q(A) is called an annulled polynomial (or annihilating polynomial) of 
the square matrix A, if ¢/A) = 0, with the understanding that A° = I replaces \° = 1 in the substitution. 


The annihilating polynomial, ~(A), of least degree with leading coefficient 1 is called the minimal polyno- 
mial of A. 


The degree of the minimal polynomial of an n x n matrix A is less than or equal to n; moreover, we can rewrite 
the product of its factors in compact form using the notation [], which is similar to © for summation, as follows: 


Ss 


WA) = (A= AL)J™ (A= Av)" (A= As)™ = TTO= AV), (7.5.1) 
j=l 
where each number m;, j7 = 1,2,...,8, is less than or equal to the algebraic multiplicity of the eigenvalue \;. So 


my, +m2+---+m, <n. The minimal polynomials can be determined from the resolvent of the matrix. From 
Eq. (7.2.2), page 382, it follows that the resolvent of a square matrix A has the form 


Ry(A) = (A= A)" = PCa), 


where x(A) = det(AI — A) and P(A) is the n x n matrix of cofactors, which are polynomials in \ of degree less than 
n. If every entry of the matrix P(A) has a common multiple (A — ,;), where A; is an eigenvalue of A, then it can be 
canceled out with a corresponding factor of y(A). After removing all common factors, the resolvent can be written 
as 


R)(A) = (AI— A)7! = —~ 
where the minimal polynomial (A) of the square matrix A and the polynomial n x n matrix Q(A) have no common 
factors of positive degree. The following celebrated theorem, named after the British mathematician and lawyer 
Arthur Cayley (1821-1895) and an Irish physicist, astronomer, and mathematician William Rowan Hamilton (1805- 
1865) is crucial for understanding the material. 


Theorem 7.12: [Cayley-Hamilton | Every matrix A is annulled by its characteristic polynomial, that is, 
x(A) =0, where x(A) = det(AI — A). 


Theorem 7.13: A square matrix with distinct eigenvalues p11, 2, .-., [4s, is diagonalizable if and only if its 
minimal polynomial is w(A) = (A — f41)(A — fe) ++: (A= ps). 


Example 7.5.1: Let us consider two matrices 
1 1 0 

Ay = 0 1 0 and Ag = —1 5 —2 
0 0 1 


Their characteristic polynomials are yi(A) = (A — 1)? and y2(A) = (A — 2)?(A — 4), respectively, but the minimal 
polynomials for the matrices A; and Ag are of the second degree: (A) = (A — 1)? and 2(A) = (A — 2)(A — 4) = 
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\? —6\+ 8. Therefore, the second powers of these matrices can be expressed as a linear combination of I, the 
identity matrix, and the matrix itself: A?—2A,+I=0 and A} —6A2+8I=0. Thus, for non-diagonalizable 
matrix, 


a. 2 D dh, AG: i 2 0 
A?7=9A;-TI=|]0 2 0/-|]0 1 0/=]1 1 0], 
0 0 2 001 001 
and for the diagonalizable matrix Ay we have 
ig. —18 12 8 0 0 10 -—18 12 
A2?=6A,—8I=| -6 30 -12|/-—|0 8 0|/=|-6 22 -12 
-6 18 O 00 8 -—6 18 -8 


Let A be an n x n diagonalizable matrix and let y(A) be its minimal polynomial that is also the product of linear 


factors: 
s 


WA) = A= A)(A= 2) A= As) = TTA - rx), (7.5.2) 
k=1 
where Ay, k = 1,2,...8 <n, are distinct eigenvalues of the matrix A. If w(A) coincides with the characteristic 
polynomial y(A), then s equals n, the dimension of the matrix A. 
Suppose that a function f(A) is defined on the spectrum o(A) = { Aq, Az, ...,As } of the matrix A, that is, the 
values f (Ax), k = 1,2,...8, are finite for every eigenvalue A,. Then we define f(A) as 


f(A) = 2 fx) Ze(A), (7.5.3) 
k=1 


where 
(A — Ai) +++ (A = Ag-1)(A = Anti) ++ (A = As) 


(Ak — A1) ++ (Ak — An—1) (Az — Anta) ++ (Ak — As)’ 
Ts 


Zy(A) = K=1,2;.:.8. (7.5.4) 
The matrices Z;,(A) are usually referred to as Sylvester’s‘’ auxiliary matrices (one can recognize in Z,(A) the 
Lagrange interpolation polynomial). Actually, Z;,(A) is the projector operator on the eigenspace corresponding Ax. 
Note that the notation A — A means A — XI, and we will learn later (§7.6) that Z;,(A) = Res), R,y(A). 


Example 7.5.2: Let us consider a diagonalizable matrix 


i 2. 3 
A=| -3 -5 -3 
> 2 ot 


, 


with the characteristic polynomial (A) = det(AI — A) = (A—1)(A+ 2). Since the resolvent is 


i A-4 —6 —3(A + 2) 
R,(A) = ——___. 6 A+8 3(A + 2) 
7 (A—1)(A +2) eo) seo 2 


2 


its minimal polynomial is the polynomial in the denominator, (A) = (A + 2)(A — 1), which has degree 2. So 
the matrix A has one simple eigenvalue \ = 1 with eigenspace spanned by the eigenvector (1,—1,1)7 and one 
double eigenvalue \ = —2 with two-dimensional eigenspace spanned by vectors (1,—1,0)7 and (1,0,—1)7. The 
corresponding auxiliary matrices are 


1 1 1 0 -1 -l 
A+2I A-I 
a 1 1 1 aa -l1 -1 0 
For the function f(A) = e*’, we have 
0 -1 -l 1 1 1 
“Atae*Z yn tebZy=e"} 1 2 1 [+e] -1 -1 -1 
-1 -1 0O 1 1 1 


73 James Joseph Sylvester (1814-1897) was a man of many talents who took music lessons from Gounoud and was prouder of his “high 
C” than his matrix achievements. Florence Nightingale was one of his students. 
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Once the auxiliary matrices are known, other functions of the matrix A can be determined without a problem. For 
instance, cos(At) = cost Z(1) + cos 2t Z(_2). 


Example 7.5.3: Let A be the matrix from Example 7.4.6 on page 396, that is, 


Since the eigenvalues of the matrix A are distinct, the minimal polynomial is also its characteristic polynomial, that 
is, 


PA) = A-DA—2)(A—3) 


and Sylvester’s auxiliary polynomials become 


—-5 2 2 -6 2 2 

TARR ig aia es Oe -—6 3 2 a6: 2 2), 
oe ae ee eae 741 
-4 2 2 —-6 2 2 

Pity ees 6 4 2 =§ 22.1, 
oes) “7 & B 41 
—-4 2 2 —-5 2 2 
Z(3)(A) = (A D(A —~ 21) eee -—6 4 2 -—6 3 2 
(3-1B-2) 2) 7 4 3 742 


1 -5 2 2 -6 2 2 1 6 2 2 —-5 2 2 2 1 -2 
Za(A) = 5 —6 3 2]-|-6 2 2 =— —-6 2 2 -6 3 2|=]2 1 -2 
—-7 4 2 —-7 4 1 —7 4 1 —7 4 2 2 1 -2 
Similarly, 
2 -4 2 —-3 3 0 
Za(A)= |] 2 —-4 2], Z3)(A)= | —4 4 0 
3-6 3 —5 5 0 
For the following functions of one variable a e**, and sin) | we define the corresponding matrix functions: 
A+I 1+1 2+1 341 
= — Z,)(A) + —— Zp) (A) + —— Zi3) (A 
Ayad ~ Tp 2 2A) + 59 2a) + 35 Ze lA) 
2 1 -2 2 -4 2 -3 3 0 
= ; 2 1 -2 4) 4 2 -4 2) 4 a -4 4 0]; 
2 1 -2 3-6 3 5 5 0 
eAt =e Z1) (A) + et Z(2) (A) + @ Z(3) (A) 
2 1 -2 2 -4 2 -3 3 0 
=e |2 1 -2|}+e%}] 2 -4 2|/+e% |] -4 4 OO]; 
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The characteristic polynomial y(A) = \(\? +1) has one real null \; = 0 and two complex conjugate 
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nulls d2 = +j. 


Since A = 0 is an eigenvalue, A is a singular matrix. The minimal polynomial coincides with the characteristic one, 


namely, (A) = A(A — j)(A + j) and therefore the corresponding Sylvester auxiliary matrices are 


0 1 0 1 0 0 1 1 0 
Zo) =A? +I=]0 -1 0 |[+]/0 1 0]=]0 0 Of, 
0 0 -l 0 0 1 0 0 0 
‘ 0 0 1 j 0 1 0 -1 -j 
A(A+jlI 1 
(i) = Gael 3 00 -1]/-]0 j -1}/==]0 1 j jf, 
jG +3) 01 0 a ae 6 4 
A(A-jD = 1 : Be uy 
— = — eed —Jj 
(-J) = =3) G) ~ 5 0 j ‘ 
For the functions f(A) = (A+ 1)/(A+ 2) and g(A) = e*', we have 
A+I 0+1 ibm + 1 
= —Zo)(A Zj)(A Zi; (A 
A+2I 042 (0) ( ) 1-2 (i ( Va j =) (-3) ( ) 
1 +5 = 1/2 -—1/10 —1/5 
= 5 Z@(A) + =~ ZH(A) + | 2-p(A)=] 9 3/5 1/5 
0 -1/5 3/5 
and 
et = Zo(A) +e! Zi) +e Z_j) = Zo(A) + Rel" Z 
(0) Gi) (=) 0 
1 1 0 al 0 -1 -j 4 0 -1 j 
= 0 0 0 as 0 1 j se 0 1 -j 
0 0 0 0 -j 1 0 j 1 
We can break matrices Z(j) and its complex conjugate Z(_;) into sums 
Zj) =B+jC, Z_;)=B—jcC, 
where 
l 0 -1 O l 0 0 -l 
B=5 0 1 Of, C=5 0 O 1 
0 1 0 -1 O 
With this in hand, we rewrite e“! as 
e“t = Z(A) +e" (B+jC)+e% (B-jC) 
= Zo (A)+B (et + ew) +jC (et — e 4) 
= Zo)(A)+2B cost — 2C sint. 
Therefore, 
1 1 0 0 -1 O 0 O -l 1 1l—cost —sint 
e“t=|0 0 0] +cost/0 1 O]—sint}O O 1]=1|0 cost sint 
0 0 0 0 O 1 0 -1 O O -—sint cost 


1. For each of the given 2 x 2 matrices, find Sylvester’s auxiliary matrices. 


Calas ) |; 2): © [a al @ | 


© [5 6 © (i y): ela of @) |e ab 
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2. For each of the given 3 x 3 matrices, find Sylvester’s auxiliary matrices. 


aie 5; ~ se 3 2 2 i, 2oe0 

(a) | 34 16 —18]; (By 125 sa 594 (c) |1 2 1); 
17 7 5 & - Sh Ay 42 
A eB BR ond 5 a 23.1.6 

(a. ee Sa BS fe): |e = aes fi |e = 8 
ee = a ee: a) 


10 6 a a Oe ot 
a) |2 1 2/; (b) |-1 3 —1|; c) |1 2 1]; 
0 0 2 2 -4 8 a ae 
i a £0) a ae) 
Ay» |e or Bi (e) jo 1 1]; By! soe ds 2s 
3. 3-3 OO) -sat aD at 
98- 5.6. S16 =o JT - 3 HAG! 215.59 
g) |-11 4 81; (h) | 28 —5 -<21/5 Bi [adds SOR: 8 
cae eee -16 4 14 —126 —90 56 


4. For each of the given 3 x 3 matrices, find e**, ae and cos(At). 


f, oo mh Bo ag le OG “cal, Ne 
a): ae oh (by. |c6r 2882/3 (c) |-5 1 1]; 
=e -4 6 9 Oo 1. 38 
G2. M0 oe Ys cad 
(Cle sees Gee) (ae (ae ee (f) oa 54 
a a ee Oe. al a 02 
—12° 10 6 17 10 -6 8 5 -8 
(g) |-28 -18 12]; (h) |28 23 -12] ; (i) [14 11 -6 
—84 —60 38 84 60 —33 42 30 -17 


5. For each of the given 3 x 3 matrices, find square roots 


a ena 00 18). <9 89° 20: 412 
(a) |1 0 —-1]; (b) | -42 -29 18]; c) |56 44 —24]; 
rs Oe -126 —90 55 168 120 —68 
31. 25. 15 SP 100-778 121 96 —48 
(d) |-70 -46 30]; (e) | —336 —239 144]; f) te48° cos. oF 2 
—210 150 94 —1008 —720 433 144 144 —47 
265 —432 336 89 -144 112 64 -—100 80 
(g) |240 -407 336]; (h) }80 135 112]; i) [45-71 60]; 
120 —216 193 40 —72 65 15-25 24 
—191 —528 336 Gv 23 19 45 -—30 
(j) | 256 705 —448] ; (ky [2° Sr 8 I); |e 28-3 
288 792 —503 es 18 54 —32 
88 —693 441 16 12 72 94 —220 50 
(m) |21 —206 147]; (n) |3 7 18]; o) |45 106 25 
Oh O81 172 230 =a 24 45 —110 29 
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7.6 The Resolvent Method 


The goal of this section is to present another method for defining the function f(A) for any square matrix A and an 
“arbitrary” function f. Although the resolvent method was used for a while in operator theory, its application to 
functions of matrices is new and was developed by the author of this textbook. In this section, we consider functions 
that are defined in a neighborhood of every eigenvalue from the spectrum o(A) of the matrix A. This means that 
f(A) is mz — 1 times differentiable at every eigenvalue », of multiplicity m,. 

Recall from Definition 7.4 on page 383 that the resolvent of a square matrix A, 


R,(A) = (AL— A)7}, (7.6.1) 


is a matrix function depending on a parameter X. 


Example 7.6.1: Let us consider three 2 x 2 matrices: 
1 3 3-2 1 2 
mali A] mals 3] =| 4s]: 
The resolvents of these matrices along with their characteristic polynomials are 


i {ic 3 ; 
Ra(B,) = | ; 5 | with (A) =? —4: 


il A+3 —2 : 
1 A-—5 2 : 
R (Bz) = Dn3P | 29 \-1 with (A) = (A— 3)’. 


we have | 


a 1 
RA) =| : hi BA) = 


with corresponding characteristic polynomials y4(A) = (A—2)*, ye(A) = (A—1—j)(A—14-j) =(A-1)? +1 = 
\? —2\+2. As we know from Example 7.4.2, page 393, the matrix A is defective. 


A-1 j 


j A-1 j A-1 


Example 7.6.3: Let us reconsider matrices T, A, T;,T2, T3 from Examples 7.3.1 — 7.3.3, page 388. The minimal 
annulled polynomials of matrices (7.3.4) — (7.3.5) are as follows. 


1. H(A) = (A— 1)?(A — 2) for the matrix (7.3.4), page 388, because 


10 0] fi 0 3]\~ [x 0 TSO? 
_ pa 2(A+1 
R(T)=[{A]O 1 0/— 72 1 2 =|oep eT wiry 
00 t). “\@ @ 2 0 ie 


2. W(A) = (A+ 1)(\ — 2) for the matrix (7.2.3), page 383, because 
too) Jo a ay a 

R\(A)={r{o 1 of —]1 0 1 
0 0 1 1 1 0 -1 -1 4X 

l A-1 1 


= 1 A-1 1 
QO=DVA+H | | en” eed 
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3. W(A) = A —- 2 for the matrix (7.3.5)(a) because 


A 0 0 2 0 0 1 1 0 O 1 
R(T,)=(]o A of —lo 2 0 =< |0 1 0f=5 51 
0 0 X 0 0 2 7 0 0 1 a 
4. (A) = (A — 2)? for the matrix (7.3.5)(b) because 
4 0 0] f2 1 O]\— 1, pr-2 1 0 
R,(T2)=[]o0 » 0] —]o 2 0 =| 0 4-2 0 
0 0 X 0 0 2 (A — 2) 0 0 rA-— 2 


5. W(A) = (A — 2)? for the matrix (7.3.5)(c) because 


A 0 0 2 1a 
Rts (10 AO) = }0 B41 
0 0 xX 0 0 2 
i (A-2)2) \-2 14+a—2a 


0 0 (A — 2)? 


If the characteristic polynomial (A) of a real symmetric matrix A has a multiple eigenvalue, then its minimal 
polynomial is of lesser degree than y()). 


Example 7.6.4: Let A be a symmetric diagonalizable matrix: 


1 1 1 1 A 1 1 

A=]1 1 -1 => R)(A) = (AI— A)7? = ———~ ]1 A -1 
2-2 

1-1 1 1 -1 xX 


So its characteristic polynomial is y(A) = (A — 2)?(A+ 1), but the minimal polynomial, W(A) = (A — 2)(A +1) = 
\? — \ — 2, is of the second degree. Therefore, A? — A — 2I = 0 and A? = A+2I. Thus, 


1 2 0 0 3 
A?=]1 1 -1/4+]02 0/=/]1 3 =-1 
1 00 2 1 


Definition 7.16: Let f(A) = P(A)/Q(A) be the ratio of two polynomials (or in general, of two entire functions) 
without common multiples. The point A = Ao is said to be a singular point of f(A) if the function is not 
defined at this point. We say that this singular point is of multiplicity m if the nonzero limit 


Definition 7.17: The residue of the function f(A) = P(A)/Q(A) at the pole Ao of multiplicity m is defined by 


i J POWA—Ao)™ 
= oD FT OG) hh.’ (7.6.2) 


In particular, for m = 1 we have 
P(Ao) 


= Bo.” (7.6.3) 
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d PO)A= A? 
QQ) Irery’ 


and for m = 3: 
—1 @ PA)A= 0) 


2 di? Q(A) ity! 


If a real-valued function f(A) = P(A)/Q(A) has a pair of complex conjugate poles a + bj, then 
Res f(A) + R A) = 2 Res f(A) =2¥ R 
Ret) hey) Rei) Ree f(A), 
where §t stands for the real part of a complex number (so R(A + Bj) = A). 


Definition 7.18: Let f(A) be a function defined on the spectrum o(A) of a square matrix A. Then 


S- Res f(A) Ra(A). 


Aneo(A) 


Example 7.6.5: (Example 7.4.5 revisited) The matrix A = ; ° 


7 has the resolvent 


A-9 -8 


Raia) =| | sak 


Using Definition 7.18, we define the exponential of the matrix A as 


et — Rese™’ Ry(A) + Res e*’ Ry (A). 
A=1 A=16 


Since the eigenvalues of the matrix A are distinct and simple, we use the formula (7.6.3) to obtain 


Mt : 
A-8 8 a ee Ge, 
. At A 2 = TTF 
Rese“ Ry(A) (A—16)| 7 A-49]||,_,  -15|,7 —8]’ 
MM 16t 
7 — e* [\-8 8 al ee 
Rese Ra(A) = a | t rd vais 15 ; an 


Using these matrices, we get the exponential matrix from Eq. (7.6.7): 


e€ 


a. 2 8 cl6t 4 7 et 8 el6t _ get 
~ 15 |7et —7et Tel + Bell: 


Example 7.6.6: For the function f(A) = e* and matrices from Example 7.6.2, page 405, 


A=|j iy and B=| 5 r| (i? =-1), 


407 


(7.6.6) 


we can determine the exponential matrices using the residue method in the following way. In Example 7.6.2, we 


found the resolvents of the matrices A and B. Therefore, 


d A-2 j/2 
At _ dt _ @ art 
ev = pis e™ R)(A) = e | 12 Aq | 


=u [VD ue +l | 
il | ays 01 


A=3/2 


j/2 A= 


_ to sy2|-1 j 3t/2 |L 0 
5° j i| Te 0 1]? 


A=3/2 
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and 
eB = Res e’R)(B) + Res e’ R)(B) 
+j ij 
ert Xr =I) j ert Xr —1 j 
= Base a ota |Brsey (97 ao] 
— et Pu -l j et [do-l j 
~ x!(A1) j Ai -1 x! (A2) J Ag =|” 


Since Ay = 1+j and Ag = 1 —j are simple nulls of the characteristic polynomial (A) = A? — 2A + 2, we can use 
Eq. (7.6.3). Calculations show that 


x’ (Ar) = 2(r1 = 1) = 2j and x (Az) = 2(A2 = 1) = —2j. 


Hence, 
He oe | jj | oe | -j j | 
e = ; aw SE 7 P ; 
2j J J —2j JI —J 
ease | 1/2 1/2 | ape | 1/2 1/2 
1/2 1/2 -1/2 1/2 
_ zllet+e# et-e*] 4 [ cost jsint 
~ © 9 | test eit pet | ~ jsint cost 
because sint = 7 edt — oF e# and cost = $e¢ + 5e%. 


Example 7.6.7: (Example 7.6.1 revisited) For the matrix B,, we have 


eBit Res ert R)(Bi) + Res ert R)(Bi1) 


1/3 3 i oe 
= $204 a a 
- “af; i © a 2: 


Similarly, 
eB2t _— Res ert R,(B2) 4 Res ert R)(Bz) 
j -j 
_ wl ti -2)_ pnt fe-3 -2 
23| 5 j-3 2j| 95 —j-—3 

_  jcost+3sint —2sint 

7 5sint cost — 3sint| * 
Actually, we don’t need to evaluate the residue at 4 = —j because we know the answer: it is a complex conjugate of 


the residue at A = j. Therefore, we can find the residue at one of the complex singular points, say at A = j, extract 
the real part (denoted by #), and double the result: 


est — OF Res eR) (Bo). 
=j 
For the matrix B3, we have 


M 
A-5 2 
Bst os . pAt = 
e - Res e’ R)(B3) = Res Dap | | 


_., Ah. [xe Ae oe <2 
a a | 
1 O 
+o | | 
= 0 1 a 


=9 1 0 1-2 2% 
= 3t 3t 3t 
= = | te E i|=e = a 
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Although the function f(A) = cos At can be expressed as the sum of two exponential functions, cos At = $e! + 


, it is instructive to follow the procedure. Therefore, we have 
cos (B,t) = Res cos(At) R (Bi) + Res cos(At) R(B1) 


7 cos(At) [A+1 3 cos(At) [A +1 3 
= Re Ot | i i] +85 Lael 


-2 2-4 
cos2t |3 3 cos2t |—1 3 1 0 
3 = cos 2t 0 . 


i pit 
5) e 


1 1 —4 1 


For the matrix Bz, we calculate only one residue, say at the point \ = j: 


= cos(At) [A+3 -2 ]  cos(At) [A+3 —-2 
ips eon) BAS) NS a | 5 — ~ 3x 5-3 


_ cos(jt) [3+j —-2 
2j 5 j-3]" 


r\=j 


Since cos(jt) = cosht = $e’ + $e7', we get 


—3/2 | cosh t F ie 


Hes cos(At) Ry (Bz) = j cosht ie 3/2 > 10 1 
Extracting the real part and multiplying it by 2, we obtain 
cos (Bat) = (cosh t) I, 
with I being the identity matrix. For matrix B3, we use formula (7.6.4): 
A cos At Ea 
=3 


cos (Bst) = Res cos(At) R)(B3) = Da 


1 O . —2 2 
= cos 3t F | —t sin 3t ze il 


Example 7.6.8: Let us consider the non-diagonalizable matrix 
5 -1 
a-[5 3]. 


Since the resolvent of the given matrix is 
1 A-3 -1 
R= Goa ans) 


we find a root of A by calculating the residue at = 4: 
VA fA-3  -1 d A-3 -1 
Jn ave [T* ac 


A = Res ~~" 

va Ree (7 A—5]~ a 
_ifi -1),,ft 0) 1 fo -1 
“|i -—i]"“|0 a) -alt 7) 


which is indeed the root of A. Actually, the given matrix has two square roots; one is just the negative of the other. 


Example 7.6.9: Let us consider the following 3 x 3 defective matrix: 


3 38 -4 
A=|-4 -5 8 |, ~— with y(A) =det(AI— A) = (A—1)*. 
—2 -3 5 
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Its minimal polynomial 7(A) = (A — 1)? is not equal to the characteristic polynomial y(A). The eigenvectors 
corresponding to the eigenvalue A = 1 are as follows: 


x = (2,0,15*, X_ = (—3, 2,0)’. 


The matrix A is neither diagonalizable nor normal, that is, AA* 4 A*A since 


34 —59 —35 29 35 —54 
AA* =| —59 105 63 , AA= 35 43. —67 
—35 63 38 —54 —67 105 
The resolvent of A is 
1 A+1 3 —4 
R)(A) = 5 4 Y-—7 8 
(A) S -=3 Res 


q A+1 38 —4 
e*! = Rese™ Ra( =e =f jae <8 
2 B. APS s 4 
1+2t  3¢ —At 
=e! —4t 1-6t 8 , 
—2t —-3f 1+4¢ 
A+1 3 —4 
A-I d A-1 
—— = Res R)(A) = A\-7 8 
AS] "a Abd d\ A+1 Se Kal 
1 3/2 -2 
-1 -3/2 2 


1. Use the resolvent method to compute A?, A?, A‘, and A® if A is one of the following 2 x 2 matrices: 


@ [3 oo): OM Para Oa el © |S 
6: ed the vesolveuk OF theeachwt tie following 9°¢ 2 matrices and caleuia a. 

@ | 3]: w) |} 3]: ©, Sh @ |* i) 

ee | oe <a @ |. 5) a) |, 
j Re Oc aie Miyano Deere mala Hahaha 

ea ah wo) |f o3l: ®: [eo qa) [| 

of] of 2 bd of gf 

of Gy als Caer Ole esl: 


4. For each of the matrices in the previous exercise, determine sunt aas and cos (vA). 
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5. For each of the following 3 x 3 defective matrices, find a square root. 


Bh Bs OT SSB. 0 
(a) {15 1 4]; (6b). | TN (ey (=a 7 <9 4 
OO 58 a ae | 3-6 -8 
6 0 4 7 2 26 6 0 -4 
(d) |3 4 5]; (e) Jo 1 Of; (Oe Be Be Dy 
1 0 6 6 -5 —5 — 0 14 
hi “Bg 9 LoS 51 a Gt 
(g) |1 3. 2]; (bh) |-1 1 2]; (i) 3 -1 
2 ba = hae ae a a | 


6. For each of the matrices in the previous Problem, determine so and cos (Vat). 


7. Use the resolvent method to compute A?, A®, A*, and A°® if A is one of the following 3 x 3 matrices: 


3 -l -l 3.0 4 4 2 -2 
(a) }-2 3 258 (b) Ie Ale 1 y| 3 (c) |-5 3 2]; 
4 -l -2 -5 0 -5 —2 4 #1 
3.2 4 4 2 -6 -3 7 -8 
(d) j2 0 2]; (e) |-6 -3 8]; (f) 4 -1 4 
4 2 3 2 1 -83 2 -2 5 
8. Find the resolvent of the each of the following 3 x 3 matrices and calculate e**. 
2 0 3 5 2 -l 1 15 —-15 
a) |O 3 1); b) |-3 2 2]; (c) |-6 18 —22]; 
0 -1 1 de 35. 2 -3 ill -15 
-15 -7 4 1 8 3 3.5 -83 
d) 34 16 —18]; e) |-4 -1 2]; (f) eae Ca 
17 7 5 4 5 2 -1 2 7 
—27 32 —-4 1 2 -1 1 2 -1 
g) |-14 17 -42]; h) {1 1 -2]; (ij) {2 1 -1 
70 -—80 11 3-4 -1 8 -3 -l 
9. Find a cube root of the matrices 
11 -9 29 21 24 -3 61 9 
Orla a Ole. ab Cul legeral: @| rage | 
0 il -1 3 33 16 15 14 
(e) -1 2 |; (f) —6 10 ie (8) 62 32 [F mw 1213 is 
10. Find a cube root with real entries for each of the following matrices 
9 -1 -l —2 1 1 5 1 -1 
(a) {1 1 -I1]; (b) |}-1 0 1]; (c) |-20 8 19 
1 -l1 7 -1 1 8 —4 11 16 
11. Show that the following 3 x 3 matrices have no square root. Nevertheless, find the matrix functions Y(t) = cos (Vat) 
and ®(t) = A7'? sin (av ae and show that they satisfy the second order matrix differential equation X + AX = 0. 
-1 11 -1 2 -1 -1 -1 6 -1 1 -2 
(a) |-1 0 1]; (b) 1 -—-2 11]; (c) }-1 -1 —-2]; (d) |-3 -3 3 
-1 11 1 -8 8 -1 -1 2 2 -2 4 
12. The following pairs of matrices have the same eigenvalues. Use this information to construct e* for each of the matrices. 
-1 3 -!1 -1 3 -!1 1 -3 6 59 —100 80 
(a) 2 -2 1], 1 -2 2]; (b) j}2 3 -1), 45 —76 60); 
1 -9 1 -3 8 2 4 -2 15 -25 19 
1 3 -1 1 3 -l 1 2 -1 -15 8 4 
(c) }3 1 —-1], 3 1 -I); (d) ]2 1 -1], —24 13 6 
8 -—2 -1 9 -3 -l 7 -3 -l -8 4 8 
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7.7 The Spectral Decomposition Method 


The matrix spectral decomposition method is a particular case of a more general method that is used in functional 
analysis and quantum mechanics, namely, spectral operator decomposition [42]. In this section, we will discuss in 
detail computation of the matrix exponential for an arbitrary square matrix, as well as some trigonometric functions 
of a matrix variable. Such functions play a central role in constructing solutions of vector linear differential equations. 

The exponential function e* of the complex number \ may be defined by means of the corresponding Maclaurin’s 
series 

y A HP Xr” 
e€ eae (Ri ee te 

This gives us the key step to define the exponential matrix e“ of an n x n matrix A as the n x n matrix defined 
by the series 


A _ 40 eee th ae, oe 
ae ec ar ae cee 
where A° =I (the n x n identity matrix). Actually, we are interested in the general exponential function 
A? A3 A” 
a SAL et ae Pee ts (7.7.1) 


for some parameter ¢ (usually associated with time). To emphasize that e“* is a fundamental matrix (see §8.1), we 
also denote it by ®(t). The meaning of the infinite series on the right-hand side in Eq. (7.7.1) is given by 


def Ad . k A™m tM 
®(t) eA = lim | > ; (7.7.2) 


m=0 


where the limit is taken with respect to a matrix’? norm. Actually, the matrix exponential (7.7.2) is the solution of 
the following matrix initial value problem, which is equivalent to the companion integral equation: 


6=-AS, O(0)=-I &(t) = (0) + [ A®(r) dr. 
0 


To solve the above integral matrix equation, we apply the Picard method (see §2.3) starting with the initial approx- 
imation ®) = I, where I is the identity matrix. This leads to the sequence of matrix-valued functions 


t 
Bult) =1+a | ®,,(7) dr, n=0,1,2,.... 
0 
In particular, 


t 
®,(t) = +a / dr =1+ At, 
0 


t 
1 
®,(t) 1+A / @i(r) dr =14+ At + 5 A707, 


and so on, which leads to Eq. (7.7.1) because ®,(t) = I+ At + - e+ a Bpeeet A ae 
Theorem 7.14: For square matrices A and B of the same dimensions, the following relations hold: 


e a oat =AcAt—eAt A, 
dt 


‘ ey 22 


? 


ee 7" —e ce ee ifandonly if AB— BA; 


? 


e dete“ 4 0 for any t. 


“4There are many definitions for the norm, denoted by ||A||, of a square matrix A. The norm of a square matrix is a generalization of 
the length of a vector. 
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The proof of these results requires a precise definition of convergence of an infinite series of matrices and it 
involves the properties of the norm of a matrix. This would take us far astray from the main goal of this book. & 


Example 7.7.1: Consider a defective 3 x 3 matrix 


1 1 0 0 0 
A=/]0 1 0] =I+E, where E= |0 0 
0 0 1 0 0 


oo 


is a nilpotent matrix (so E? = 0). The matrix A has the triple eigenvalue \ = 1 with only two linearly independent 
eigenvectors (1,0,0)7 and (0,0,1)7. Therefore, this matrix is not diagonalizable. Raising A to the second power, 
we get 

A? = (I+E)? =1+2E+E’=1+2E 


since E? = 0. Repetition of this process leads to 
A”°=T+4+nE, n=1,2.,.... 


This formula allows us to find the exponential matrix: 


1 1 
Fe se I+(+E)t+ 5 (+2E)P +5 (I+3E) e+. 
Pe SE ca 
= iS as 
+yitgit+gh6 


ri 38 4t* 
+Et+ = E+ E+—E+.- 


3! 4! 
t #2 # # to 
= I Man ai tata +-.-|+Et]1l+t+— Tha es 


= Te'+Ete’=Ie'(1-t)+Ate’. 


At 


Hence, the exponential matrix e“! is a linear combination of two powers of the given matrix I = A° and A. 


Eq. (7.7.2) forces us to compute a power A™, where m is an arbitrary positive integer and A is a square n x n 
matrix. We define the powers of a square matrix A inductively, 


A™+1 — AA™ A° =I, m=0,1,.... 


It is known that sums and products of square matrices are again square matrices of the same dimensions. We can 
also multiply a matrix by a constant. Therefore, we can compute linear combinations of nonnegative integral powers 
of the matrix. For any polynomial q(A) = go + GA + qd? +-+++GmA™, we can unambiguously define the function 
q(A) for a matrix A as 

g(A) = gol + mA + QA? +--+ +qmA™. 


Having defined polynomial functions, we may consider other functions of an arbitrary matrix A. If a function f(A) 
can be expressed as a convergent power series 


iC ea, 
m=0 
then we can define a matrix function f(A) by 
N 
f(A) = vim, Do dm A™ 


for those matrices for which the indicated limit exists. The spectral decomposition method allows us to define a 
function of the square matrix without actually applying the limit. 

If v is an eigenvector for the matrix operator A that corresponds to the eigenvalue \, then A?v = A(Av) = 
A(\v) = \Av = A?v. 

In the general case, we have q(A)v = q(A)v for an arbitrary polynomial or analytical function q(A). Hence, every 
eigenvector of A belonging to the eigenvalue 2 is an eigenvector of g(A) corresponding to the eigenvalue q()). 
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Example 7.7.2: Let A = and q(\) = (A+1)? = \?+2\+4+1. This matrix has two real eigenvalues, \1 = 1 


1 3 
0 2 
and A2 = 2, with eigenvectors u, = a and uz = fie respectively. Indeed, Au, = uy and Aug = 2ug. 


def 


We can define q(A) in two ways. For the first method, we calculate B= A+I= F | and raise B to the 


4 15 
0 9 


ay=ar+2ast= [0 f+2[5 +o il=([0 os]: 


second power: B? = | 04 


| The second method consists of calculating the second power of A: A? = 0 4 
and using q(A): 

0 4 0 2 0 1 0 9 
The matrix g(A) has eigenvalues ju; = 4 and p2 = 9, corresponding to the same eigenvectors, u; = (1,0)? and 


ug = (3, i. 


Example 7.7.3: We reconsider Example 7.6.4 on page 406. Since the minimal polynomial for the given symmetric 
matrix is #(A) = (A—2)(A +1) = A? — A— 2, we have the relation between the first two powers of A: A? = A + 2I. 
The next powers of A are 


A? = AA*=A(2T+A)=2A+4+ A? =2A+4+(21+A) =214+3A, 
A* = 2A+3A?=2A+3(21+A)=61+4+5A, 

A® = 6A+5A?=6A+5(21+A)=10I1+11A, 

A® = 10A+11A?=10A+11 (21+ A) = 221+21A, 


and so on. So all powers of A are expressed as a linear combination of the matrix A and the identity matrix I. 


Let w(A) be the m-th degree minimal polynomial of an n x n matrix A, that is, 
POVH Sy be tee tice, grt ee 


Then 
wh (A) = col +c, A + cg A? +++ + emi A™ 1 +A™=0, 


where I is the identity matrix, and 0 is the n x n matrix of zeroes. We can express A™ as a linear combination of 
lower powers of the matrix A: 


A™ = —[coA° +c, A + cA? 4-0) +6m1A™}]. (7.7.3) 


Multiplication of this equality by A yields 


AA™ =Amtl = [coA t c, A? t co AB free pak . 


We can substitute expression (7.7.3) for A™ to obtain a polynomial of degree m—1. Hence, the m-th and (m+ 1)-th 
powers of the matrix A are again linear combinations of powers A’, 7 = 0,1,2,...m-—1. In a similar manner, we 
can show that any power A”, k > m, is a linear combination of powers A’, j = 0,1,...m—1, that is, 


A® = ao(k)I + a1(k)A + ao(kK)A? +++» +Qmi(K)A™ |, k=m,m+1,.... (7.7.4) 
We can extend this equation (7.7.4) for all positive k by setting 


jf 1, ifg=k<m, 
aj(k) ={ 0, iff Ak<m. 


For example, 
a;(m)=—-c;, jg =0,1,2,...m—1. 


The Cayley—Hamilton theorem (page 400) assures us that any power of a square n x n matrix A can be expressed as 
a linear combination of the first n powers of the matrix: A° =I, A, A?,..., A"~!. Therefore, any series in A can 
be expressed as a polynomial of degree at most n — 1 in A, regardless of the value of n. If the minimum polynomial 
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is known to be of degree m (m < n), only m powers of the matrix are needed to define all other powers of A. This 
allows us to make an important observation about any analytical function of a square matrix: it can be expressed 
as a linear combination of a finite number of the first m powers of the given matrix. 

Now we substitute Eq. (7.7.4) into the expansion (7.7.1) for e**: 


Therefore, 


es b;(t) AY, (7.7.5) 


where 


Thus, the infinite sum (7.7.1) that defines e“* reduces to a finite sum. Of course, its coefficients b;(t) are represented 
as infinite series. Fortunately, coefficients b;(t), 7 = 0,1,...,m— 1, in expansion (7.7.5) can be determined without 
the tedious computations of series. This simplification is stated in the following theorem. 


Theorem 7.15: Let (A) be the minimal polynomial of degree m for a square matrix A. The coefficient 


functions b;(t), 7 =0,1,...,m—1, in the exponential representation (7.7.5) satisfy the following equations: 
et = bolt) + biG Ae + + bm Ar >, k= 1,2,...48, (7.7.6) 
where Ax, k = 1,2,...,5, are distinct eigenvalues of the square matrix A. 


If the expansion (7.5.1), page 400, of the minimal polynomial w)(X) contains the multiple (A— A,x)"*, mz > 1, 
we include in the system to be solved m, — 1 additional equations 


m,—1 “1 


Pet = > —4 _oi(t)e?, p=1,2,...,me—1. (7.7.7) 
a Ue) 
j=p 
Remark 1. Equation (7.7.7) is equivalent to the following equation (p = 1,2,...,m,z — 1): 
d? et dP 
. a (bO+hOAt-+b, 40a" (7.7.8) 


dAP |y_y, dr? N=Ab 


Remark 2. Theorem 7.15 holds for any annulled polynomial, including the characteristic polynomial, instead of 
the minimal polynomial. The following example shows that using a characteristic polynomial to construct a function 
of a matrix is not optimal when the minimal polynomial has a lesser degree than the characteristic polynomial. 


Example 7.7.4: (Example 7.5.2 revisited) Let 


i 6. 3 
A=|-3 -5 -3 
, & 4 


Its characteristic polynomial, y(A) = (A+ 2)?(A — 1), is not equal to the minimal polynomial w(A) = (A + 2)(A— 1). 
So we use 7)(A) to define e@*. From Eq. (7.7.6), it follows that 


e! = bo(t) + bi(t), 
e 7 = bo (t) = 2b; (t). 
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We subtract the first equation from the last one to obtain 


1 1 
—3b,(t) =e **-—e! or b(t) = =e’—-=e*. 
3 3 
Then 
bo(t) = e' — bi (t) = 2 + 1 ae 
3 3 
Plugging these coefficient functions bo(t) and 6;(¢) into Eq. (7.7.5) yields 
2 1 1 1 
eAt = € e+ 3 ) I+ € ef 3 - A 
a et — e-2t et — e-2t 
= —ef+e 7% eb ide ei + e—% 
ef — e-2t ef — eW2t a 


Now suppose that we decide to use the characteristic polynomial y(A) = (A + 2)?(A — 1) instead of (A). Then 
eAt = bo(t)1 + bi (t)A + ba(t)A?, 
where the coefficient functions bo(t), b1(t), and be(t) satisfy the following system of equations: 


et = bo(t) + by (t) + bo (t), 
e ** = b(t) — 2b: (t) + 4b2(t), 
t e 7 = by (t) = Abo (t). 


We rewrite the above system of algebraic equations in vector form X b = v(t), where 


1 1 1 bo(t) et 
X=]1 -2 4], b=|bi(t)|, v(t) = | e~” 
0 1 -4 bo(t) ie ** 


Since the determinant of the matrix X is not zero, det X = 9, it is invertible, and we find the vector b directly: 


4 5 2 ef 1 de? +5e-7* 4 6te~? 
4 -4 -1 et | = ; 4e! —4e-7t — 3te—*! 
1 -1 -3] |te-%* et — e—7# — 3te—# 


ee 
9 


With this in hand, we calculate the exponential function 


4 5 10 0 
eAt = € et + 5 ee -) 0 10 
00 1 
1 3 3 
4 4 1 
+ (; et — 9 e 7 te) —3 -5 -8 
3 3 1 
1 -3 -3 
1 I 1 
+ ( ef = 9 et 3 te) 3 7 3 
-3 -3 1 
a ef — e-2t ef — e-2¢ 
= —e tte eb 4+ Qe-% -eb+e-% 
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Spectral decomposition method for “arbitrary” functions 


We can extend the definition of the exponential of a square matrix to an “arbitrary” function. Let f(A) be an 
analytic function in a neighborhood of the origin. In this case, it has a Maclaurin series representation: 


FO) = fot fAAF fo? toot fre te = SO far®, 
k=0 
Then f(A) can be defined via the following series: 
F(A) = fol+ fiA+ fod? te fad += D0 fed", 
k=0 


subject that this series converges. From Eq. (7.7.4), it follows that f(A) is actually the sum of the m powers of A, 
namely, 


love) love) m1 m—1 love) 
FA)=>> frA® = >> fe | D> aj (e)A®] = So aA* ( i) 
k=0 k=0 j=0 k=0 


j=0 
Therefore, 


f(A) = S* b AY, (7.7.9) 


where the coefficients 


should satisfy the following equations: 
f (Ak) = bo + b1AK +++ + Oma (7.7.10) 


for each distinct eigenvalue rz. If the eigenvalue Ax is of multiplicity m,z, then we need to add mz — 1 auxiliary 
equations similar to Eq. (7.7.8): 


df) d =f 
ma bo + yA + +++ + b5—-1A° , p=1,2,...,m_,—1. (7.7.11) 
dNP fy, dP ae 


The above formula shows that the function f(A) must have m;, — 1 derivatives at each eigenvalue Ax. 
If the matrix A is invertible (det A 4 0), the above definition can be extended for functions containing negative 
powers. Since a unique inverse of A exists, the relation 


A~” = (A“*) », n=1,2,--- 


returns us to positive powers of A~!. | 

Remark 3. The coefficients bo, b1, ..., bm—1 in the expansion (7.7.9) are completely determined by the eigenvalues 
of the given square matrix A. Therefore two different matrices with the same eigenvalues have the same coefficient 
functions b; in Eq. (7.7.9). 


Remark 4. Generally speaking, we are allowed only to use this approach for holomorphic (analytic) functions 
(which are sums of convergent power series) in a neighborhood of a spectrum. For example, we cannot determine 
VA for a singular square matrix A with the aid of the spectral decomposition method because the corresponding 
function f(A) = A/? does not have a Maclaurin representation at \ = 0. However, if f(A) is a smooth function in a 
domain including the spectrum of A, spectral decomposition is applicable to define the corresponding square matrix 
f (A). | 

Let us show that for 2 x 2 matrices (as well as for any matrix having a minimal polynomial of the first degree), 
a square root can be obtained with the aid of the spectral decomposition method only when the matrix coefficients 
satisfy a special condition. Suppose we try to find a root of a square 2 x 2 matrix A in the form 


VA = bol +b, A. (7.7.12) 
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Squaring both sides, we obtain 
2 
A= (vA) = (bol + by A)? = B21 + 2bpb, A + B2A2. 
Suppose we know the minimal polynomial 7(A) = A? + cA + co, where cy = —tr(A) and co = det A. Then 
A? = —coI — c, A. Using this equation, we find 
A = boI + 2bobi A — bf [col + cr A] = (06 — cobt) T+ (2bob1 — cib7) A. 

From the latter, we obtain two nonlinear equations to be solved for bo and by: 

be = b7 det A, 1 = 2bob; + b7 tr (A). (7.7.13) 
If this system of equations has a solution (not necessarily unique), the spectral decomposition method is applicable 
to find a square root. When det A = 0 and tr A = 0, Eq. (7.7.13) has no solution. 


1 
1 
tr A = 0 and det A = 0, so the conditions (7.7.13) are violated. Suppose we want to find a square root of A. Let us 
see what happens if we ignore this warning. Then we seek a root as a linear combination of matrices I = A° and A: 


VA = bol +0, A, 


Example 7.7.5: The nilpotent matrix A = =| has a double eigenvalue \ = 0 and A? = 0. For this matrix, 


where coefficients bp and b; should satisfy the equations 
v0 = bo, and 1= by 2/0. 


Since this system of equations with respect to bo, b; has no solution, the given matrix has no square root. But 
2 3 
matrix-functions cos (A1/?#) =I- 4 A and A7!/?sin (A1/?t) =tI1— 4A exist. 


4 2 0 AP Zea) G 21 «2 
Let us look at the 3 x 3 matrices B= |0 0 O|, B*= | 0 0 O},andR=]0 0 —2z2}. If we try 
0 0 0 0 0 0 0 0 O 


to apply the spectral decomposition method to find a square root of B, we would assume that 
VB = bol +b, B + b2B?. 


Squaring both sides yields 
2 © © 
(vB) = B=dj1 + b7B? + 16b5B? + 2b9b:B + 2b9b2B? + 8b1b2B? 


because B? = 4B? and B* = 16B?. Equating coefficients of I, B, and B’, we obtain the system of nonlinear 
equations 

6 =0, 1=2bobi, 0 =b7 + 16d} + Bobs + 8bibo, 
which has no solution. Therefore, the given matrix B has no square root expressed through powers of B. On the 
other hand, this matrix has infinite many roots because R? = B for any x. The matrix-function cos (vB) exists: 


1 1 1 t? 1 
cos 2t q cos2t— 4 ig COs2t+ g — 36 
t 


cos (vB) = 1 40 1 -= , which is the solution of 6+B & = 0, (0) = I, (0) = 0. 
0 0 1 


Example 7.7.6: It is not hard to verify that the following diagonal matrix A has two square roots +R, where 


4 0 0 1 2 0 0 
A=]0 0 0 and =, 5 = 0 0 0 
0 0 0 0 0 0 


Its minimal polynomial is (A) = A(A— 4) = A? — 4A, while its characteristic polynomial is y(A) = A?(A— 4). Hence, 
A? =4A. We use (A) to construct a root of A: 


VA = bol +b, A 
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because it is assumed that any function of the matrix A is expressed through two powers of A: I = A® and A. 
Squaring both sides of Eq. (7.7.12) and substituting A? = 4A, we obtain 


A = (bol +b, A)? = b21 + (2bob; + 402)A. 


Equating the same powers of A, we see that coefficients bp and b; should have the following values: bo = 0 and 
4b} = 1. This leads to two roots: +R. 

Now suppose we would like to use the characteristic polynomial instead of the minimal polynomial. According 
to the Cayley—Hamilton theorem, page 400, we have 


x(A) =0 or A? = 4A”. 


Hence, any power of A greater than 3 is expressed through its first three powers: I = A°, A, and A?. Now we 
assume that the square root of A is also represented as 


VA = bol +b) A + bo A?. 
Squaring both sides, we get 
A = BAI + 2b9b1 A + bobo A? + 2b: bo A? + DIA? + D2 A4, 
Substituting A? = 4A? and A+ = 16A? and comparing like powers of A, we obtain the system of nonlinear equations 
66 =0, 2bob1 = 1, b] + 2bobe + 8b1b2 + 1605 = 0, 
which has no solution. Therefore, the characteristic polynomial cannot be used for the calculation of VA. This 
conclusion is true not only for a square root function, but for any function having a derivative which is undefined at 


A = 0. However, if the function has continuous derivatives in the neighborhoods of eigenvalues, it does not matter 
whether a minimal or characteristic polynomial is utilized. 


Example 7.7.7: Consider two 2 x 2 matrices 


30 —-4 9 
ne ek Bele 


having the same eigenvalues: A; = —1 and Ag = 2. To find the exponential functions of these matrices, we need to 
determine the function bo(t) and 6; (t) from two simultaneous algebraic equations: 


e = bo(t) _ by (t), e* = bo(t) + 2b1(t), 


1 2 1 1 
which yields bo(t) = = e7* + = e7*, by (t) = 3 et 3 e'. We then construct both exponential functions using the 


same functions bo(t) and 6; (t): 


1 4 —4 1 _,/-1 4 
Aswotro@asge[t c]etet{) f], 


eBt — bo (t)1-+b:()B =e ee | et Ee = 


If, for a square matrix A, the minimal polynomial is w(A) = (A — Ao), then any (holomorphic) function of this 
matrix is 


f (A) = bol + bi A = (f(Ao) — Aof’o)) 1+ f/Ao)A 
because bp = f(Ao) — Aof’(Ao) and by = f’(Ao). In particular, 


eAt = (1— Apt) Te + ter A. (7.7.14) 
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2 4 3 
Example 7.7.8: The matrix B = | —4 -—6 —-3 | has the characteristic polynomial y(A) = (A + 2)?(A — 1), 
3 3 1 


which coincides with the characteristic polynomial of the matrix A from Example 7.7.4. Since the determination 
of coefficients in the expansion (7.7.5) depends only on the characteristic/minimal polynomial, we can use them for 
any matrix having the same characteristic polynomial. Therefore, 


eB = bo(t)I + bi (t)B + bo(t)B? 
io 0 
= (Fee Fets cre) 0 1 0 
001 
2 4 8 
+ (Fe-3 a ste) -4 -6 -3 
s. 3 1 
-3 -7 -3 
+ € e? — se _ ste) 7 11 3 
-3 -3 1 
Oo -1 -1 1 1 0 
= e] -1 -1 -1|/4+e 7] 1 2 1 |+te*] -1 -1 0 
1 it 3 -1 -1 0 0 0 0 


We can use Maple to perform all of the operations and check our answer. First, invoke the linalg package and 
define the matrix of coefficients needed for determination of bo, bi, and bo: 
B:=matrix([[1,1,1],[1,-2,4],[0,1,-4]]) 

Next we find its inverse to calculate the values of coefficients b; (j = 1, 2,3): 

Binv := inverse(B) 

y:=matrix([[exp(t)], [exp(2, t))], [exp(3, t))]]) 

BB := multiply(Binv, y); II := Matrix(3, shape = identity); 

expAt:=evalm(II*(BB[1, 1] + BB[2, 1]*A + BB[3, 1]*A*A) 


If the coefficients b; (j = 1, 2,3) are known, we can calculate e' by executing one command: 
expAt :=evalm(scalarmul (array (identity,1..3,1..3),b0) + scalarmul(A,b1) 
+ scalarmul (multiply(A,A) ,b2))) 


1 4 
2 -1 
3—;. The given matrix has the characteristic polynomial y(\) = A? — 9 = (A — 3)(A+ 3) with simple eigenvalues 
A = +3. We seek the ratio in the form 


Example 7.7.9: Let us define some functions for the matrix B = . We start with the function f(A) = 


B-I rA-— 2 

Bol boI+6,B and Ved o + byA 
Setting A = 3 and \ = —3 in the latter equation gives 

3-1 —3-1 

341 bo +3 Ls and —341 0 351 


Solving this system of algebraic equations with respect to by and b;, we obtain 


Hence, 
B-I 5 


Bat5, 1,.5/20)].1/7% 4].[/ © 
B+I 4 4 4{0 1 4|2 -1]° | -1/2 3/2 |° 
To define cos(Bt), we need to find the function bo(t) and b;(t) such that 


cos(Bt) = bo(t)I + bi (t)B. 
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For these functions, we have two equations: 


cos 3t = bo(t) + 3b; (t), 


from which 6; = 0 and bo = cos3t, and we get cos(Bt) = cos3tI = ew a . Since B? = 9I, the matrix- 


function cos(Bt) satisfies the matrix differential equation 


d? cos(Bt) . d? cos(Bt) 
a + B* cos(Bt) = 0 or a +9 cos(Bt) = 0. 


1. Let B be a square matrix similar to A, so that B = S~'AS, with detS 4 0. For an entire function f(), show 
that f (B) = S-'f(A)S. In particular, show that their resolvents are similar matrices: R)(B) = (AI—- B)7' = 
S7'R, (A)S. 

2. (a) Compute bo(t) and 61(t) in the expansion (7.7.5) for matrices 


1 O 2 0 
a=| 3 S| and B-| % ak 


(b) Compute the exponential matrices e“* and e®*. 


At and eB commute with each other? 


(c) Do the matrices e 


(a) Compute bo(t), b1(t), and b2(t) in the expansion (7.7.5) for the defective matrices 


3. 
-l 1 4 1 1 1 
A= 3 1 -4 and B=|]2 1 -1 
-1 0 3 0 -1 1 
(b) Compute the exponential matrices e*! and eP*. 
(c) Do the matrices e*! and e®’ commute? 
sin(At) 


4. For each of the 2 x 2 matrices from Problems 1 — 3 in 87.3, page 390, compute eft , and cos(At). 


5. Show that the following matrices have no square roots: 


@ [ol ®) [ty Ap © [4 4} @) [7% 3): 

@ [73 a: (f : @ [73 3) (h) 
é: feb tin tye Dannitoms coiled” onan tel a ns 

@ [af 17 Syl: © ff ash @) [73 3): 

@ [gf © | 3): @® ft 7): [io |) 


ofa: 9 Bap ES oR 
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7. For the given 3 x 3 matrices, compute ere cos(VAt) =F 
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AVE ang SWAT) 
VA 

1 3 —A —-1 -1 2 
3 1 44; (c) |-1 9 1]; 
1 -2 6 —2 -1 3 
—-4 -—2 6 6 0 —-4 
6. eBe. =8l (Cogn oer See 
—2 -1 3 1 0O 2 
3 3 —4 3 —8 —10 
Armee (Bis Glan oF Bs 
—2 -3 5 2 —-6 —7 
1 1 0 3 —8 -—10 
ae ee) (hy [+8 7%: <9 
0 —1 1 3 —6 —-—8 


8. For each of the following 3 x 3 matrices, compute a determine sa and cos (VAt), and then show that the 
result does not depend on the choice of the root. 


(a) 


(d) 


(g) 


2 -2 3 2 #4 
(a -4 4 3], 1 5 
2 1 O -4 4 
3 1 O -9 -3 
(c —-8 -6 2], 2 1 
-9 -9 4 ll 3 
1 2 -1 39 —46 
(e 2 1 -tI1], 38 —45 
9 -5 -l 38 —46 
10. 
in terms of the golden ratio @ = (1 + v5) /2: 
11. Show that the matrix A = 0 
12. Repeat the previous question for the matrix A = 
13. 
some positive integer p. 
14. 


-5 1 0 

i. eos 
ain a al 

2 -2 -1 

Di 20. aa 
3-3 -4 

-5 1 -l 

1 -9 3 ; 
2 —-2 -4 


(b) 


(e) 


(h) 


—7 
—5 
—1 


s) 


7 4 -4 
tO) 1 
10 5 —6 
5 1 -tI11 
7 -1 —13 
4 0 —8 
1 12 -8 
-1 9 -4 
al 66 1 


=) 


’ 


’ 


oCoWO OF 


-4 8 4 

(c) —-3 4 -3 |; 
-1 4 9 
-3 1 1 

(f) 5 ers ee a 
4 -8 2 
1 1 0 

(ON ee ee ee 
2: 232: 2 


4 an O. 4 
ee. wos ea. “Els 
ma re ree 
10 re ae 
Phe HhEO6 8 303 
2 = a ee 
=3 yo. 25 
Sap ee LE 
=i 1 a yet 


; : Sued & ; F F 1. 
Calculate the powers A” (n is an arbitrary positive integer) of the Fibonacci matrix A = 1 4 . Express your answer 


4 , has no square root. Find two matrix-functions cos (VAt) and A~!/? sin (art). 


Show that a nilpotent matrix has no square root. Recall that a square matrix A is said to be nilpotent if A? = O for 


For each of the following 3 x 3 positive matrices, determine the matrix functions W(t) = cos (VAt) and @®(t) = 


AW'/? sin (Av ae), and show that they satisfy the second order matrix differential equation X+AX=0. 


—4 5 
—4 
-1 


(b) 


—53 


(c) 


—22 
—9 


127 31 
12 


53 
20 


—25 -130 64 
(4) |}-1 4 4 
=I. 45: -20 
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Summary for Chapter 7 


I. 


10. 


11. 


12. 


13. 


14. 


15. 


16. 


17. 


18. 
19. 


A square n x n matrix A is called an unitary (or isometric) matrix if || Ax|| = ||x|| for all 
n-vectors x. A square n x n matrix A is called normal if AA* = A*A. Selfadjoint matrices and unitary matrices 
are normal. 


. A matrix A(t) is said to be continuous on an interval (a, 3) if each element of A is a continuous function on the given 


interval. With matrix functions we can operate in a similar way as with functions. 


. The determinant of a square n x n matrix A = [aij] is the sum of n! terms. Each term is the product of n matrix 


entries, one element from each row and one element from each column. Furthermore, each product is assigned a plus 
or a minus sign: 


det(A) = os (=1)° ani, G2ig +++ Onin, 


where the summation is over all permutations (i1,72,...,%n) of the integers 1,2,...,n and o is the integer that is 
determined by the evenness of the permutation. Therefore, half of all products in this sum comes with a plus sign and 
the other half comes with a minus sign. 


. A matrix A is called singular (or degenerate) if det A = 0 and nonsingular (or invertible) if its determinant is not 


zero, that is, det A 4 0. 


. Let A be an n x n matrix. The minor of the kj*” entry of A is the determinant of the (n — 1) x (n — 1) submatrix 


obtained from A by deleting row k and column j. The cofactor of the entry az; is Cof (a,;) = (—1)**% Minor (ax;). 


1 
: $ . Hite, i 
. If A is a nonsingular n x n matrix, then AT = a [Cof(a;i)],;- 
. The resolvent of a square matrix A is the matrix R)(A) defined as Ry(A) = (AI— A)7’, where I is the identity 
matrix. 
. A set of vectors x1, X2, ..., Xn is said to span the vector space V if every element V can be expressed as a linear 


combination c,X1 + cox2 +:-:+CnXn of these vectors. 


. A set of vectors x1, X2,..., Xn in V is said to be linearly dependent if one of these vectors is a linear combination 


of the others. 


If these vectors are not linearly dependent then they are said to be linearly independent. 


The dimension of a vector space V, denoted by dim V, is the fewest number of linearly independent vectors that span 
V. A vector space is said to be a finite dimensional space if its dimension is finite. Otherwise, we say that V is an 
infinite dimensional space if no set of finitely many elements spans V. 


Let V be a vector space. A subset X is said to be a basis for V if it has the following two properties: 


(a) any finite subset of X is linearly independent; 


(b) every vector in V is a linear combination of finitely many elements of X. 


The characteristic polynomial (A) is the determinant of the matrix AI— A, that is, x(A) = det(AI— A). Obviously 
x(A) has leading term \”. Any solution of the characteristic equation (A) = 0 is said to be an eigenvalue. The set 
of all eigenvalues is called the spectrum of the matrix A, denoted o(A). 


A nonzero vector x satisfying Ax = Ax is called an eigenvector of a square matrix A corresponding to the eigenvalue 
X. 


Let N) be the collection of all vectors x € X such that Ax = Ax. Since, by our definition, 0 is not an eigenvector, 
N) does not contain 0. If, however, we enlarge N) by adjoining the origin to it, then N) becomes a subspace, usually 
called the eigenspace or proper space. We define the geometric multiplicity of the eigenvalue as the dimension of 
the subspace N). If the eigenvalue has multiplicity 1, it is said to be a simple eigenvalue. 


Let » be an eigenvalue of a matrix A. The algebraic multiplicity of X is called the multiplicity of A as a root of the 
characteristic equation of A, i.e., a root of x(A) = 0. 


The eigenvalue \ of a square matrix is called defective if its algebraic multiplicity is greater than the geometrical one. 
The difference (which is always nonnegative) between the algebraic multiplicity and geometrical multiplicity is called 
the defect of the eigenvalue X. 


Ann X n matrix A is said to be diagonalizable if it is similar to a diagonal matrix, that is, there exists a diagonal 
matrix D and nonsingular matrix S such that D = S~'AS. 


An nxn matrix A is diagonalizable if and only if its eigenvalues have the same geometrical and algebraic multiplicities. 


If A is a diagonalizable matrix, namely, A ~ D, where D is a diagonal matrix of eigenvalues, then A = SDS7'. A 
function of the matrix A is defined as f(A) =Sf(D)S~°. 
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20. To implement the diagonalization procedure, do the following steps: 


(a) Determine the geometric multiplicities of eigenvalues. This is equivalent to constructing eigenvectors x1, X2, ---, 
Xm. If m =n, the dimension of the matrix A, then these eigenvectors span the n-dimensional vector space, and 
the matrix is diagonalizable. 

(b) Define a nonsingular matrix S that reduces the given matrix to a diagonal matrix. You may build the matrix S 
from eigenvectors, writing them in sequence as row vectors, namely, S = [x1 x2 --- Xn]. The determinant of the 
matrix S is not zero because S consists of column vectors of eigenvectors, which are linearly independent. 


(c) Calculate S~*. 


(d) Define the function of the matrix according to the formula 


fr) 0 0 
0 fO2) 0 

f(A)=Sf(D)S*=S]_. a os s Vss*s (7.4.3) 
0 0 FOn) 


21. A scalar polynomial q(A) is called an annulled polynomial (or annihilating polynomial) of the square matrix A, if 
q(A) = 0, with the understanding that A° = I replaces \° = 1 in the substitution. 


22. The annihilating polynomial, 7(A), of least degree with leading coefficient 1 is called the minimal polynomial of A. 
23. The Sylvester formula: If A is a square diagonalizable matrix and 


s 


WA) = (A= At)(A= Aa) + (A= As) = [TO Az), 


k=1 


is its minimal polynomial, then for a function f(A) we define f(A) as 
f(A) = yD f(Ak) Ze (A), (7.5.3) 
k=1 


nee COE eee Cee ees 
T.(A) Shes Ry(A) = ha 1 ae, 
K(A) Nk (A) (Ak — At) ++: (Ak — Ag—1)(An — Angi) +++ (Ak — As) 
are known as Sylvester’s auxiliary matrices. Note that Z,(A), also called the Lagrange interpolation polynomial, is the 
projection operator onto the eigenspace of A. If w(A) coincides with the characteristic polynomial x(A), then s equals 
n, the dimension of the matrix A. In the above formula, Ax, k = 1,2,...s, are distinct eigenvalues of the matrix A. 
24. Resolvent Method: For a given square n x n matrix A (defective or not), use the following procedure to define a 


function f(A). 
(a) Find the characteristic polynomial y(A) = det(AI — A). 
(b) By equating x(A) = 0, determine eigenvalues Ai, A2, ..., As of a matrix A and their algebraic multiplicities. 


(c) Find the resolvent of the given matrix A: 
R)(A) = (AI— A)7?. 
(d) Then for a function f(A) defined on the spectrum o(A) of square matrix A we set 


FA)= D7 Res FO) Ra(A). 


d\,Eo(A) 


P(A) 


OO)? is defined according to the multiplicity, 


(e) The residue of a ratio of two polynomials (or entire functions), Res, 
m, of the singular point Xo as follows: 


P(A) 1 d™* P(A)(A— o)™ 
Res = ——_ a : 7.6.2 
ro Q(A)  (m—1)! dant Q(A) er ( ) 
In particular, for m = 1, we have 
pe a ew) (7.6.3) 


Res G0) QQ)” 
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Spectral Decomposition Method: For a square matrix A, let 

W(A) = (A = Ar)” (A = Aa)? + (A = As)" (7.5.1) 
be its minimal polynomial of degree m = mi +---+ms, where Ax, k = 1,2,...,8, are distinct eigenvalues. The 
exponential matrix can be defined as 


e** — bo(t)I + b1(t) A + bo(t) A? +--+ bm—i(t) A” "," 


where the coefficient functions 6;(t), 7 = 0,1,...,m—1 satisfy the following equations: 
eet — bo(t) + bi (t)An +++ t bmi (tae *, k= 1,2,...,8. (7.7.6) 


If the expansion (7.5.1) of the minimal polynomial 7(A) contains the multiple (A — Ax)’”"*, mx > 1, we include in the 
system to be solved m, — 1 additional equations 


mp-1 1 ; 
P erkt — :s Ga by (that, p=1,2,...,m,_—1. (7.7.7) 


j=p 


A similar procedure is used to define f(A) = pare b; A’, where the coefficients b; can be found from the equations 


f(r) = bo + b1Ag + 2) tom,” | an eee 2 


If there is a multiple root in w(A) of multiplicity s, then you have to differentiate the corresponding equation s — 1 
times, so that the number of equations is equal to the number of unknown coefficients b;. 
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Section 7.1 


1. 


Suppose that P(t) is a differentiable square matrix, find a formula for the derivative of P"(t), where n is any positive 
integer. 
Find lim¢o0 A(t) or state why the limit does not exist. 


t-*(cost—1) tt esct tte? . sek 


(a) A(t) = (241) -2 (2 — | , (b) A(t) = I/vVI=t BP 


3. In each problem, determine A(t) where 


(a) A'(t) = ie | and A(0) = E Ar 


2 tant 
w a= [a 2] ame ao=[p gh 


(c) A’(t)= we neem and A(0) = F i 


Section 7.2 of Chapter 7 (Review) 


1. Find the determinants of the following matrices: 


-:| cost —sint |. t—1 2t+1 
(a) ¢ Ee an i (b) En red 


2. For each of the following 2 x 2 matrices, find its inverse. 


of 2. of of} of 


3. For each of the following 3 x 3 matrices, find its inverse. 


3 4 2 i 8 1) i <a 3 
fa) |2 2 —e)3 (b) |-1 6 5 |]; (c) | -4 3 2 |; 
“8 =f 1 i =e. =% oe 

51 2 A So i StF a 8 
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4. Find the element in the third row and second column of A7? if 


= 
2-1 
0 1 
1 2 


rFPNrR OO 


5. Show that A? = 0 for a 2 x 2 matrix A if tr A= det A = 0 (such matrix is called nilpotent). 
6. Suppose that 2 x 2 matrices A and B satisfy A? + B? = 2AB. 


(a) Prove that tr A = trB. 
(b) Prove that AB = BA. 


Section 7.3 of Chapter 7 (Review) 


1. For each of the following 2 x 2 matrices, find its eigenvalues and corresponding eigenvectors. 


@ alm [2 a: © [Ga @ & SB) © & S). 


2. For each of the following 2 x 2 matrices, find its eigenvalues and corresponding eigenvectors, including generalized 


eigenvectors. 
of so tof sof ch © ff a 


3. For each of the following 2 x 2 matrices, find its complex eigenvalues and corresponding eigenvectors. 


@ |p ajo 6 a) of sf @ & oh © & al, 


4. For each of the following 3 x 3 matrices, find its eigenvalues and corresponding eigenvectors. 


a ae 04 1 6 -3 -8 
(a) | 1 014; (b) | 2 2 1]; (ey |} 2 i =o 1; 
011 40 4 8. ..8 <5 
1 2 1 5 0 -6 3 2 2» 
(d) 6 at i. |e er te 2 als |) 5 et 2 
i =o -24 a | 5 5 38 


a | tO: 3 t 0. <1 
(a) 0 2 0}; (by |) 2 =e 2/5 (ey 4 =2 0. 2 
i mie a a i 1 2 ie 8 


6. For each of the following 3 x 3 matrices, find its eigenvalues and corresponding eigenvectors, including generalized 


eigenvectors. 

1 2 0 4 0 1 0 1 0 

(a) =] =1 1/3 b) 2 3 271; c) 0 oO 14,; 
0 1 1 -1 0 2 2 -5 4 
=2. 0. 2 5 -1 1 1 1 0 

(d) —-§ -—1 1 |; e) 1 3 ; f) 1 O 14]; 
1 3.038 -3 2 1 1 -2 3 
1 1 O 2 -1 3 2 -9 3 

(g) 1 4 14]; h) 1 -1 1 : i) 1 -1 1 : 
1 -2 2 —-5 5 —5 -9 9 -9 
4 -2 -2 3 4 —10 1 1 0 

(j) —2 3 -14;; k) 2 1 -2 |; 1) |-2 1 -2 
2 -1 3 2 2 -5 0 -1 1 
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7. For each of the following 3 x 3 matrices, find its complex eigenvalues and corresponding eigenvectors. 


1 1 2 3 -4 -2 6 0 -3 
(a) 1 0 -1 |; (b) —-5 7 -8 }; (c) -3 3 3 ; 
—-1 -2 -l -—10 13 -8 1 -2 6 
4 -4 4 4 4 0 1 -1l -2 
(d) -10 3 15]; (e) 8 10 —20 |; (f) 1 3 2 
2 -3 1 2 3 -2 1 -1l 2 
8. Prove that the set of generalized eigenvectors of a square n X n matrix T corresponding to an eigenvalue » equals 
ker (AI — T)”. 
222? 


9. For each of the following 3 x 3 matrices, find its eigenvalues and corresponding eigenvectors, including generalized 


eigenvectors. 
2 12 —-10 1 it 1 1 2 3 
(a) —2 24 —-11 |; (b) 1 3 -1 /3 (c) 1 -2 -3 |; 
—2 24 -8 0 2 2 1 -1l -2 
1 1 -3 6 -5 3 -1 1 -1 
(d) -4 -4 8 ; (e) 2 —1 3:15 (f) —2 0 2 
—2 1 0 2 1 1 =1. 3 1 


10. For each of the following 3 x 3 matrices, find its complex eigenvalues and corresponding eigenvectors. 


—-5 5 4 t= =2 -3 1 -3 
(ay. |} er Ns (b) |1 3 2 |; (c) |4 -1 2]; 
1 0 0 1 -1l 2 4 -2 3 
3-3 1 4 2 4 0 2 2 
(4d) | 0 2 2]; (ec) | 6 0 12 |; (f) | -5 2 5 
5 1 1 -8 0 -6 2 1 O 
Section 7.4 of Chapter 7 (Review) 
2 _— 
1. Using the diagonalization procedure, find the functions f(A) and g(A), where f(A) = e** and g(A) = — of the 
following 2 x 2 matrices: 
-—ll 7 -—12 7 18 —-11 8 -1l 
@ lee als (| ais a) © [P y'): as al 
14 -3 11 3 20 3 15 7 
© [St yh; © Ey ai @ [7% i): ay | a: 
. . oh - MH 1 
2. Using the diagonalization procedure, find the functions f(A) and g(A), where f(A) = e*’ and g(A) = aT? of the 
following 3 x 3 matrices: 
3. -2 O —8 -ll -2 2 -3 1 
(a) |-1 3 —2]; (b) |6 9 2]; (c) |4 5 —2I; 
0 -1 3 -6 —-6 1 29 —7 O 
2 1 1 0 1 2 2 9 3 
(ay | -@ at; (ec) |-5 4 7|; (a 
—2 3 -1 1 -3 11 1 3 O 
3. For each of the following 2 x 2 matrices, find all square roots. 
300 21 19 18 41 32 15 -6 
(a) ie (b) E | (c) i A (4) ee | 
25 —6 7 6 16 —-6 15 -1l 
(e) ie | (f) k ra (8) ler | (h) Ee 14 
. [16 —9]. » [20 19]. ot ar]. 8-4]. 
@) fee Al (i) B A (ie) ke 19 | @) a i 
7 12 12 —4 8 —8 40 -—-12 
co k | : (n) ie | ke) ieee ail (p) Le 25 
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Section 7.5 of Chapter 7 (Review) 


1. For each of the given 3 x 3 matrices with multiple eigenvalues, find Sylvester’s auxiliary matrices. 


25 —-8 30 7 —2 2 3 1 1 
(a) 24 -—7 30]; (b) 8 -1l 41]; (c) 2 4 21; 
-12 4 -14 -8 4 -1 -1 -1 1 
-19 12 84 3.2 4 3 2 4 
(d) 0 5 0]; (e) |6 2 6); (ff) |10 4 10 
-8 4. 33 4 2 3 4 2 3 
2. For each of the given 2 x 2 matrices, find e**, Ah and cos(At). 
1 10 —2 6 1 4 1 5 
@ [FMI Om rae © [i i}: fa |e 
4 9 1 3 4 3 1 3 
@ [t 2: © [5 3): @ lh m [i 2 
3. For each of the given 2 x 2 positive matrices, find all roots VA and determine evat 
8 4 7 3 13. 3 15 3 
a) [P 5) o | Omran @) [> io): 
0 2 3.3 6 2 2 4 
Sa Ol @) |} 7): a) | 3]. 
4. For each of the given 3 x 3 matrices with multiple eigenvalues, find e** using Sylvester’s method. 
3.2 4 3 1 -l 3 2 -2 
a) 2 0 24; b) 3.6 LT]; (c) —2 7 -21; 
4 2 8 -6 2 4 —10 10 —-5 
3.2 4 3 1 -1 —2 -2 4 
d) |14 6 14|; e) j1 3 -1]; (f) |-2 1 2]; 
4 2 83 3.3 -l —-4 —2 6 
—10 20 —16 13 —-18 14 14 -—20 16 
g) -9 17 -12); h) |10 -15 14]; (i) 9 -13 12 
-3 5 -2 5 —-9 10 3. -5 6 
sin (1 VA) 
5. Find ve and cos (1 VA) for each of the following 3 x 3 matrices. Then show that these two functions are 
solutions to the matrix differential equation ® + A ®(t) = 0. 
9 —-40 25 —116 -—330 210 —28 16 —-8 
(a) |-25 204 —125] ; b) 160 444 —280] ; (c) |}-48 28 12); 
—40 320 —196 180 495 —311 -16 8 8 
—47 -—48 192 —41 -—45 180 —-8 -12 48 
(d) |—96 —95 384] ; e) |—90 -—86 360]; f) |-—24 -20 96] ; 
—48 —48 193 —45 —45 184 —12 -12 52 
2 9 -4 17 8 —32 —7 -21 74 
(g) |-14 43 —40]; h) |16 25 —64] ; i) |-32 -—38 158] ; 
—7 9 5 8 8 —23 —-16 —-21 88 
—55 32 16 —20 12 6 44 —20 —-10 
(j) |-96 57 24]; k) |-36 22 9]; 1) |60 —26 —15] ; 
—32 16 17 -12 6 7 20 —-10 -1 
289 -96 —144 16 —-96 60 1 24 —-15 
(m) 144 -47 —-72 |; n) —60 484 —300 |; (o) 15 -116 75 
432 —144 —215 —96 768 —476 24 -192 124 
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Section 7.6 of Chapter 7 (Review) 


1. For each of the following 3 x 3 defective matrices, find e“' using the resolvent method. 
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11 1 5 1 -ll esa 
(a) | 1 3 -1 4; b) |7 -1 —13] ; fe) | =i. @ = l/s 

¢ 2 2 4 Or 8 —5 & a4 

=9 iy a 5, oe Sie: 9- “a 
(dy) |-1 6 43 e 1a =f 3 | (f): |.34 46° 4a) 

i 8 -8 10 if % 

4 -1 1 f ot 4 a sk ail 
(g) |10 -2 3]; h) |14 -3 6]; Gy (a> % 8 

1 0 1 5 -2 5 er: ee 


2. For each of the following 3 x 3 matrices, express e“' through real-valued functions using the resolvent method. 


1 2 -2 2 2 9 3 1 -2 
a) 2 5 -2 |; (b) 1 -1 8 (c) 3 2 #1 ; 
4 12 -5 -1 -1 -4 —2 3 1 
3.2 -1 3 1 -1 5 -—2 1 
d) |-2 4 -3]; (e) }2 3 —-l1]; (f) |4 -2 O|; 
—-5 2 7 4 4 -1 8 4 1 
8 12 —-4 4 -3 3 5 2 -1 
g) |-9 12 4]; (h) J|1 4 -3)]; (i) |-3 2 2]; 
37 12 20 2 5 -1 1 3 2 
3. -9 -l 1 -2 -1 1 -3 -1 
j) |2 -1 34]; (k) |1 3 -1]; (qd) |2 6 -1 
1 2 = 10 2 4 -1 2 4 -2 
3. For each of the following 3 x 3 defective matrices, find a square root and determine ovat, 
10 6 —3 -3 -4 12 12 8 —-I17 
a) ;-12 -11 12]; b) 4 5 —12]; (c) 1 2 4 |; 
-6 -6 7 0 0 1 1 -2 8 
-6 -14 -8 17 8 -8 12 6 —-15 
d) 10 18 8 |; e) |-2 -1 2 (f) 1 2 Ay | 
—38 —46 24 14 #6 -5 1 -2 8 
6 0 -4 —2 -14 -8 1 16 4-12 
g) |}-2 4 51]; h) 6 18 & 5 (i) |6 -3 24]; 
1 O 2 —54 -30 28 5 -10 24 
-4 8 4 2 -1 -1 5 -1 -1 
j) |-3 4 -83]; k) |1 1 -1]; qd) jl 1 -1 
-1 4 9 1 -1 0 1 -1 3 


. Show that the following 3 x 3 matrices have no square root. Nevertheless, find the matrix functions Y(t) = cos (Vat) 


and ®(t) = A~'/? sin (a¥?2), and show that they satisfy the second order matrix differential equation K + A X = 0. 


—2 1 2 -l1 2 -7 -1 -1 1 -l1 1 -1 
(a) }-1 0 1]; (b) 1 -2 1); (c) }-1 -1 2]; (d) 1 -3 2 
—2 1 2 1 -2 3 -1 -1 2 2 -6 4 
. For each of the following 3 x 3 positive matrices, determine mA) and cos (vA t), and show that the result does 
not depend on the choice of the root. 
—34 13 #19 2 8 -1 —-17 46 13 2 —40 —-8 
(a) |-63 25 34]; (b) |-1 7 Of]; (c) }|-10 26 6]; (d) |-1 4 a 
—20 7 12 -1 5 3 3. -7 3 —2 —-15 6 
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Section 7.7 of Chapter 7 (Review) 


1. Some of the following 3 x 3 matrices have the same spectrum. Therefore, the coefficients bo(t), bi(t), and be(t) in 
expansion (7.7.5), page 415, are the same for these matrices. Find these coefficients and construct the propagator 
matrix e“* for each given matrix. 


1 10 —-12 1 12 -8 0 1 
a) |2 2 3. hs b) |-1 9 —4|; c) 1 3 1 +4; 
2 -1 6 -1 6 -1 -1 0 2 
2 1 0 —-3 5 -5 —2 17 4 
d) |-3 2 -83]; e) 3 -1 3 : f) -1 6 14]; 
0 -1l 2 8 -8 10 0 1 2 
1 8 -12 4 0 5 1 12 -8 
g) j2 2 3 13 h) 5 3 1 13 i) -1 9 -4 |; 
2 -1 6 -1 0 2 -1 7 -1 
6 —5 10 6 -1 10 6 -9 18 
j) -1 2 -2]; k) -1 2 -2 |]; 1) -1 2 -2 
-1 3 -1 -1 1 -1 -1 1 -!1 
2. For each of the following 3 x 3 defective matrices, find eMt using the spectral decomposition method. 
-1 -1 2 —2 -1 3 6 —5 10 
a) -1 3 14; b) —2 1 14; c) -1 6 -6j;; 
—2 -1 3 —2 -1 3 -1 1 -1 
7 -5 12 2 1 -2 4 -1 1 
d) -1 3 -6]; e) 1 2 -2 |; f) 10 -2 3]; 
-1 1 -1 -1 1 1 0 1 
4 -5 6 6 —-5 10 1 12 -8 
g) -1 0 -3 1]; h) -1 2 -6 i) -1 9 -4 
-1 1 -1 -1 1 -1 -1 6 -1 
5 -l1 1 -15 -7 4 3 3. -4 
j) 1 3 0]; k) 34. «160 «S11 1) -4 -5 8 
-3 2 #1 17 7 5 —2 -3 5 


3. For each of the following 2 x 2 positive matrices, determine sna and cos (vat), and show that the result does 
not depend on the choice of the root. 


oPa of of oP 
© [ft yh: a par ela: ab om) [Poel 
og off wo BA of 4 


4. For each of the following 3 x 3 positive matrices, determine sna and cos (VA t), and show that the result does 


not depend on the choice of the root. 


3 0 4 —t 0 4 5 -5 8 
(a) £4. ale (b) |o -1 14; (ce) | —1 2 =2 |; 
th. ty eal = ee ee 
5 2 5 a aes it 2 4 
(d) 2 8 2 Is (e) | 1 5 -1 1]; (f) 0 9 O |; 
=h. = <5 Peet at) 28: <i 
=f 21° 9 2: Ge. a = a 
(ge). | Sl: 28; at |e (h) | -1 6 1]; (i) | -5 1 5 
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Systems of First Order Linear 
Differential Equations 


As is well known [14], many practical problems lead to systems of differential equations. In the present chapter, 
we concentrate our attention only on systems of linear first order differential equations in normal form when the 
number of dependent variables (unknown functions) is the same as the number of equations. The theory of such 
systems parallels very closely the theory of higher order linear differential equations (see Chapter 4); however, there 
are some features that are not observed in the theory of single differential equations. 

The opening sections present general properties of linear vector differential equations with variable and constant 
coefficients. In the next sections, we discuss solutions of the initial value problems for nonhomogeneous systems 
using the variation of parameters, method of undetermined coefficients, and the Laplace transformation. While a 
second order system of equations can be reduced to one of first order, we present a direct solution of such systems 
in the concluding section. 


8.1 Systems of Linear Differential Equations 


In this section, we discuss the properties of the solutions for systems of linear nonhomogeneous differential equations 
with variable coefficients in the normal form: 


dx(t ‘ 

x ) = P(t) x(t) + f(t) or x=P(t)x(t)+f(0), (8.1.1) 
where the dot represents the derivative with respect to t: x = dx/dt, P(t) = [pi;(t)] is a square n x n matrix 
with continuous entries p;;(t) on some interval, called the coefficient matrix, f(t) = (fi(t), fo(t),...,fn(t))? isa 
given continuous column vector function on the same interval, and x(t) = (x1(t), r2(t),...,@n(t))7 is an n-vector of 
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unknown functions. A column vector x(t), which is to be determined, is usually called the state of the system at 
time f. 

A system of linear differential equations in normal form (8.1.1) is called a vector differential equation. If the 
column vector f(t), called the nonhomogeneous term, the forcing or driving function, or the input vector, is 
not identically zero, then Eq. (8.1.1) is called nonhomogeneous or inhomogeneous or driven. Otherwise we have a 
homogeneous vector equation 


a =P(t)x(t) or x*x=P(t)x(2). (8.1.2) 


Equation (8.1.2) is called the complementary equation to nonhomogeneous equation (8.1.1), and its general 
solution is called the complementary function, which contains n arbitrary constants. The homogeneous equation 
(8.1.2) obviously has identically zero solution x(t) = 0, which is referred to as the trivial solution. The first 
observation about solutions of the homogeneous vector equation (8.1.2) follows from the linearity of the problem. 


Theorem 8.1: [Superposition Principle for Homogeneous Equations] Let x; (t), x2(t), ..., Xm(t) be a set of 
solution vectors of the homogeneous system of differential equations (8.1.2) on an interval |a,b|. Then their 
linear combination, 

x(t) = c x1(t) + co Xo(t) +--+ + Cm Xm (8), 


where c;, 7 = 1,2,...,m, are arbitrary constants, is also a solution to Eq. (8.1.2) on the same interval. 


Definition 8.1: A set of n vector functions x(t), x2(t), ..., Xn(t) is said to be linearly dependent on an 
interval |a, b| if there exists a set of numbers cy, C2, ..., Cn, with at least one nonzero, such that 


C1 X1(t) + co X2(t) +--+ +n Xn(t) =0 for all t € |a, d]. 


Otherwise, these vector functions are called linearly independent. 


Two vectors are linearly dependent if and only if each of them is a constant multiple of another. The following 
example shows that the functions can be linearly independent on some interval, but also can be linearly dependent 
vectors at every point on this interval. As we will see shortly (Theorem 8.2), this situation is never observed when 
vector functions are solutions of a linear homogeneous system of equations (8.1.2). 


Example 8.1.1: The vector functions 


o-[e)-e[t] = [1] 


are linearly independent on (—o0o, 00) since x(t) = e’ y(t) and there is no constant C such that x(t) = C y(t). 


Theorem 8.2: Let x(t), ..., Xm(t) be solutions of the homogeneous vector equation (8.1.2) on an interval 
|a, b|, and let to be any point in this interval. Then the set of vector functions {x,(t), ..., Xm(t)} is linearly 
dependent if and only if the set of vectors {x1(to), ..., Xm(to)} is linearly dependent. 


It is obvious that if the set of vector functions is linearly dependent, then at any point the corresponding 
vectors are linearly dependent. Now, we suppose the opposite, and assume that constant vectors {x1(to), ..-, Xm(to)} 
are linearly dependent. Then there exist constants c,, ..., Cm, which are not all zeroes, such that 


c1X1(to) + c2X2(to) feeet CmXm (to) = 0. 


Since each vector function x,(t), 7 = 1,2,...,m, is a solution of the homogeneous vector equation x = P(t) x, the 
vector 
x(t) = cyx1(t) + cgXo(t) +--+ + CmXm/(t) 


is a solution of the initial value problem 


8.1. Systems of Linear Equations 433 


From Theorem 6.6, page 372, it follows that this initial value problem has only trivial solution x(t) = 0 for all ¢ in 
the given interval. Hence, the set of functions {x1(t), ..., Xm(t)} is linearly dependent. oO 

Next, we are going to show that the dimension of the solution space of the vector equation (8.1.2) is exactly n. 
To prove this, we need the following corollary from the previous theorem. 


Corollary 8.1: Let x;(t), k =1,2,...,n, be solutions to the initial value problems 


d 
= =P(t)x,(t),  xe(to)=ex, #=1,2,...,n, 
where 
1 0 0 
0 1 0 
ei, = pe Se || » Cn = 
0 0 il 
Then x;,(t), k = 1,2,...,m, are linearly independent solutions of the system x = P(t)x. | 


Corollary 8.1 affirms that there exist at least n linearly independent solutions of the homogeneous system of 
differential equations (8.1.2). Now suppose that the dimension of its solution set exceeds n and there are n + 1 
linearly independent solutions uj(t), ..., un(t), Un+i(t). Theorem 8.2 would imply that the vectors uj(to), ..., 
U,+1(to) are linearly independent in n dimensional space R”, which is impossible. Therefore, we proved the following 
statement: 


Theorem 8.3: The dimension of the solution space of the n x n system of differential equations x(t) = P(t) x(t) 
is n. 


Definition 8.2: Let P(t) be an nxn matrix that is continuous on an interval |a, b| (a < b). Any set of n solutions 
x(t), X2(t), ..., Xn(t) of the homogeneous vector equation x(t) = P(t) x(t) that is linearly independent on the 
interval |a, b| is called a fundamental set of solutions (or fundamental solution set). In this case, the n x n 
nonsingular matrix, written in column form 


X(t) = [xi(t), xo(t), ... 


where each column vector is a solution of the homogeneous vector equation x(t) = P(t) x(t), is called a funda- 
mental matrix for the system of differential equations. 


In other words, a square matrix whose columns form a linearly independent set of solutions of the homogeneous 
system of differential equations (8.1.2) is called a fundamental matrix for the system. A product of a fundamental 
matrix and a nonsingular constant matrix is a fundamental matrix. This leads to the conclusion that a fundamental 
matrix is not unique. Because the column vectors of the fundamental matrix X(t) satisfy the vector differential 
equation x’(t) = P(t) x(t), the matrix-function itself is a solution of the matrix differential equation: 

dX(t) 


FO = PO RO oor X(t) = P(t) X(t) (8.1.3) 


that contains n? differential equations for each entry of the n x n matrix X = [2;;(t)]. For example, consider a 


two-dimensional case: (i) () () () 
ry1(t) ry2(t Piult) piolt 
X(t) = d P(t) = . 
O- [RG cael mt POE Bi 
Then the matrix differential equation (8.1.3) can be written as 
u(t) £12(t) _ pii(t) pie(t)| |eiu(t) v12(t) 
toi(t) £20(t) pai(t) po2(t)| |wai(t) re2(t) 
_— Pi1%11 1 P12%21) P11%12 T P1222 
p2i%11 + po2%21 = p21%12 + p22%22 | * 
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This matrix equation is equivalent to two separate vector equations (for each column): 


fe = pit + P1221, ena i = pi1%12 + P1222, 


£21 = p21 211 + p22%a1, £92 = po1®11 + p22%01- 


The established relation between the matrix differential equation (8.1.3) and the vector equation (8.1.2) leads to the 
following sequence of statements. 


Theorem 8.4: If X(t) is a solution of the n x n matrix differential equation (8.1.3), then for any constant 
column vector ¢ = (c1,¢2,-..,¢n)", the n-vector u = X(t) c is a solution of the vector equation (8.1.2). 


Theorem 8.5: If an n x n matrix P(t) has continuous entries on an open interval, then the vector differential 
equation x = P(t) x(t) has an n x n fundamental matrix X(t) = [x1(t), xo(t), ..., Xn(t)] on the same interval. 
Every solution x(t) to this system can be written as a linear combination of the column vectors of the fundamental 
matrix in a unique way: 


x(t) = c1x1(t) + coxe(t) +--+: +c,Xn(t) or in vector form x(t) = X(t)c (8.1.4) 


for appropriate constants C1, C2, ..., Cn, Where ¢ = (c1,¢2,...,€n)? is a column vector of these constants. a 


Throughout the text, we will refer to (8.1.4) as the general solution to the homogeneous vector differential 
equation (8.1.2). 


Theorem 8.6: The general solution of a nonhomogeneous linear vector equation (8.1.1) is the sum of the 
general solution of the complement homogeneous equation (8.1.2) and a particular solution of the inhomogeneous 
equation (8.1.1). That is, every solution to Eq. (8.1.1) is of the form 


20) Teles) ame Gl) eo elt) ee) (8.1.5) 


for some constants C1, C2, ..., Cn, where x;,(£) = c1:x1(t) + c2Xo(t) +---+cnXp(t) is the general solution of the 
homogeneous linear equation (8.1.2) and x,(t) is a particular solution of the nonhomogeneous equation (8.1.1). 


Theorem 8.7: [Superposition Principle for Inhomogeneous Equations] Let P(t) be an n x n matrix function 
that is continuous on an interval |a, b|, and let x;(t) and x(t) be two vector solutions of the nonhomogeneous 
equations 

x1 (t) = P(t) 24 SP f,(t), X(t) = P(t) Xo + fo(t), te la, b], 


respectively. Then, for arbitrary constants a and (, their linear combination x(t) = ax,(t) + (x2(t) is a solution 
of the following nonhomogeneous equation 


x(t) = P(t)x+ af) (t) + Bfo(t), t € |a, d|. 


Corollary 8.2: The difference between any two solutions of the nonhomogeneous vector equation x = P(t)x+f 
is a solution of the complementary homogeneous equation x = P(t) x. 


Example 8.1.2: It is not hard to verify that the vector functions 


wo-[7]: s0-[8] 


are two linearly independent solutions to the following homogeneous vector differential equation: 


X(t) =P(t)x(), P(e) = | : or | (t £0). 
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Therefore, the corresponding fundamental matrix is 


xf = | ie | det X(t) = —te’. 


Definition 8.3: The determinant, W(t) = det X(t), of a square matrix X(t) = [xi(t), xa(t), ..., Xn(t)] formed 


from the set of n vector functions x;(t), x2(t), ..., Xn(t) is called the Wronskian of these column vectors 


{xi(t),...,Xn(t)}. 


Theorem 8.8: [N. Abel] Let P(t) be an n x n matrix with entries p;;(t) (¢,7 =1,2,...,n) that are continuous 
functions on some interval. Let x;,(t), k = 1,2,...,n, be n solutions to the homogeneous vector differential 
equation x = P(t) x(t). Then the Wronskian of the set of vector solutions is 


WO =Wenlen ( | * PW) ar) (8.1.6) 


to 


with to being a point within an interval where the trace tr P(t) = pi1(t) +p22(t)+---+pnn(t) is continuous. Here 
W(t) = det X(t), where X(t) = [xi(t), xo(t), ..., Xn(t)] is the matrix formed from the set of column vectors 


{ xi(t), Xo(t),...,Xn/(t) }. 


Differentiating the Wronskian W(t) = det X(t) using the product rule, we get 


d 
dt Wit) = Ix (t), X(t), sees Xn F x1 (t), x9(t), seg Xn a eel [xi (¢), X(t), een x;,(t)| : 
Looking at a typical term in this sum, we use the vector differential equation (8.1.2) to obtain 

Jen (t), 2 4(t), «5 Xn] = b(t), ---, PUG (8), «5 xalf)]- 


Now we write out the product P(t) x;(t) in its k-th entry form: 


Pas |=) Pelle: 
s=1 


Since each k-th column of the determinant is a linear combination of the other columns except the k-th one, we sum 


all these determinants to obtain 4 
er W(t) = (trP) W(8), (8.1.7) 


which is a first order separable equation. Integration yields Abel’s formula displayed in Eq. (8.1.6). Problem 1 on 
page 438 asks you to show all the details for a two-dimensional case. 


Corollary 8.3: Let x:(t), xo2(t), ..., Xn(t) be column solutions of the homogeneous vector equation x = 
P(t)x on some interval |a,b|, where n x n matrix P(t) is continuous. Then the corresponding matrix X(t) = 
[x1(t), X2(t), ..., Xn(t)] of these column vectors is either a singular matrix for all ¢ € |a,b| or else nonsingular. 


In other words, det X(t) is either identically zero or it never vanishes on the interval |a, }]. 


Corollary 8.4: Let P(t) be an nm x n matrix function that is continuous on an interval |a,b|. If 
{ xi(t), x(t), ..., Xn(t)} is a linearly independent set of solutions to the homogeneous differential equation 
x = P(t)x on |a, b|, then the Wronskian 


W (t) = det[x1(t), xo(t), ..., Xn(t)] 


is not zero at every point ¢ in |a, b]. 
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Example 8.1.3: (Example 8.1.2 revisited) The matrix 


Foe | ; “te 


has the trace trP = 1+ 1/t. From Abel’s theorem, it follows that the Wronskian 


(0 < t) 


W(t) = Cel trP(t) at 
Ces U+1/t) dt = Cettint =Ctet. 


On the other hand, direct calculations show that the Wronskian of the functions x;(t¢) and x2(t) is 


t? et 
Ww =aet | ; 5 |=-te #0 for t £0. 


Now let us consider the initial value problem 


* =P(t)x(t),  x(to) =x. (8.1.8) 


From Theorem 8.5, page 434, it follows that the general solution of the vector differential equation (8.1.2) is 


where c = (ci,€2,.-.,€n)? is the column vector of arbitrary constants. To satisfy the initial condition, we set 
X(to) c = Xo or c = X71 (to) xo. 
Therefore, the solution to the initial value problem (8.1.8) becomes 
x(t) = B(t, to)xo = X(t)X 1 (to) xo. (8.1.9) 


The square matrix ®(t,s) = X(t)X~1(s) is usually referred to as a propagator matrix. Thus, the following 
statement is proved: 


Theorem 8.9: Let X(t) be a fundamental matrix for the homogeneous linear system x’ = P(t)x, meaning that 
X(t) is a solution of the matrix differential equation (8.1.3) and det X(t) 4 0. Then the unique solution of the 
vector initial value problem (8.1.8) is given by Eq. (8.1.9). 


Corollary 8.5: For a fundamental matrix X(t), the propagator matrix-function ®(t, to) = X(t)X~1(to) is the 
unique solution of the following matrix initial value problem: 


SP(E10) _ p(y B(t,t0), B(tosto) =1, 


where I is the identity matrix. Hence, ®(t, to) is a fundamental matrix of Eq. (8.1.2). 


Corollary 8.6: Let X(t) and Y(t) be two fundamental matrices of the homogeneous vector equation (8.1.2). 
Then there exists a nonsingular constant square matrix C such that X(t) = Y(t)C, det C 4 0. This means that 
the solution space of the matrix differential equation (8.1.3) is 1. 
Example 8.1.4: Solve the initial value problem 
dx 1 3t —t? 1 
Ak =P(t)x, x(1)=xo, where P(t) = oe | a | (t#0), x= | 9 | ; 


given that vectors 
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constitute the fundamental set of solutions. 
Solution. A fundamental matrix is 


X(t) =[ x(8), xl) ]=[ 4 ral oo = = ae | 4 «| 
Then 7 cra 4 7 ij 3 
z a=5| 4 | = x axa =5| 5 | 


Therefore, the solution of the given initial value problem becomes 


x(t) = &(t, 1)x9 = X(t)X71(1) xo = | ' a | * | ; 7 ; | | 
A linear inhomogeneous equation of order n 
J) + ana(thyY +++ +ar(t)y! +ao(t)y = ft) (8.1.10) 
can be replaced with an equivalent first order system of dimension n. To do this, we introduce new variables 
=i) Bay ay BE (8.1.11) 


It turns out that these new variables satisfy the system of first order equations 


ry = 2, 
(8.1.12) 
a = <n, 
vy = —An-1(t)tn — +++ — ay (t)x2 — ag(t)a1 + f(t). 


The system (8.1.12) is a system of n first order linear differential equations involving the n unknown functions 2; (¢), 
ro(t), ..., @n(t). By introducing n-column vector functions x(t) = (x1 (t),...,@n(t))7 and f(t) = (0,0,...,0, f(#))*, 
we can rewrite the system of equations (8.1.12) in vector form (8.1.1), page 431, where 


[9 1 0 -. 0 

0 0 i 2 % 

Pit)=| . 7 ; (8.1.13) 
ee) a as). 4 HG, aD 


Therefore, every result for a single differential equation of order larger than 1 can be translated into a similar result 
for a first order system of differential equations. 


8.1.1 The Euler Vector Equations 


The Euler system of equations 


tx=Ax(b), t > 0, (8.1.14) 

with a constant square matrix A, is a vector version of the scalar n-th order Euler equation (see §4.6.2) 
Bint ty, at yh 4 -aty’ +04 = 0, (8.1.15) 
where ax, (k = 0,1,...,”) are constant coefficients. While Eq. (8.1.14) can be reduced to a constant coefficient 


vector equation by substitution 2 = Int, we seek its solution in the form x(t) = t4€, where € is a column vector 
independent of t. Since x = \t*~€, we get from Eq. (8.1.14) that \ must be a root of the algebraic equation 


AE = AE. 


This means that 4 must be an eigenvalue of the matrix A, and € must be its eigenvector. If the given matrix 
A is diagonalizable, then we get n eigenvalues 1, ..., An (not necessarily different) with corresponding linearly 
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independent eigenvectors €1, ..., €n. The general solution of Eq. (8.1.14) is a linear combination of these n column 
vectors 

x = ¢, Ee, +p tg +--- +O, bE, (8.1.16) 
where ci, ..., Cn are arbitrary constants. When the matrix A is defective, we do not enjoy having n linearly 


independent eigenvectors, but we get only m of them, where m < n. We do not pursue this case because it requires 
the use of more advanced material—generalized eigenvectors and functions of a complex variable. 


Example 8.1.5: Consider the system of equations with a singular matrix: 
: 2 2 
tx(t) = Ax(t), where A= 1 -1/- 


Since the eigenvalues and corresponding eigenvectors of the matrix A are \ = 0, €) = (—1,1)7 and \ = 1, 
€, = (2,—1)", the general solution of the given equation becomes 


—1 2 
x() =a | l |+ar| “1 |. 
Example 8.1.6: Consider the system of equations with a diagonalizable matrix: 
: 5 —6 
tx(t) = Ax(t), where A= 3 4 |° 
Using eigenvalues and corresponding eigenvectors \ = 2, €2 = (2,1)7 and \ = —1, €_; = (1,1)7 of the matrix A, 


we construct the general solution 
2 1] 1 
x@)=at|T]+art| 4 |. 


Example 8.1.7: Consider the vector differential equation with a defective matrix: 


tx(t)=Ax(é), where A= | > “ar (8.1.17) 
The matrix A has a double eigenvalue 1 = —1 of geometric multiplicity 1. This means that the eigenspace is spanned 


on the vector € = (2,2)7. Thus, one solution of the system (8.1.17) is x:(t) = t~1€, but another linearly independent 
solution has a different form. 


Let 7 = (0,1)? be a generalized eigenvector, so (A +1) = €. Based on the reduction of order procedure 
presented in §4.6.1, we attempt to find a second linearly independent solution of the system (8.1.17) in the form 


x2(t) =t7'In|t]/€+¢7'n. 
Upon differentiation, we obtain 
ko = —t? In|él€ +t 76 — 02 = 0 Pn el + 4-7 (E— 0). 


Since A€ = —€ and An = €—7, the vector x2(t) is indeed a solution of the system (8.1.17) and linearly independent 
from x;(t). 


1. Prove Abel’s theorem 8.8 on page 435 for a 2 x 2 matrix P(t) using the identity 


dW _| 22 y(t) | w(t) 2B | ai(t) w(t) 
—=| £% + dt |, where W(t) = det . 
a | 4 w(t) |T| moe = at) aalt) walt) 
2. Prove Corollary 8.6, page 436. 
3. Let us consider the homogeneous system of equations 
d 3-t 
*_p@x, P(t)=|(2%, | #0). (8.1.18) 
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(a) Compute the Wronskian of two solutions of Eq. (8.1.18): x1 = (#°,t?)7 and x2 = (t?,f-1)7. 
(b) On what intervals are x;(t) and x2(t) linearly independent? 
(c) Construct a fundamental matrix X(t) and compute the propagator matrix X(t)X~1(2). 
4. In each exercise, two vector functions vi(t) and v2(t), defined on the interval —oo < t < oo, are given. 


a) Evaluate the determinant of the 2 x 2 matrix formed by these column vectors V(t) = [vi(t), v2(é)]. 


(a) 
(b) At t = 0, the determinant, det V(0), calculated in part (a) is zero. Therefore, there exists a nonzero number k such 
that vi(0) = kv2(0). Does this fact prove that the given vector functions are linearly dependent on —oo < t < co? 


(c) At t = 2, the determinant det V(2) 4 0. Does this fact prove that the given vector functions are linearly 
independent on —oo < t < co? 


© wo-[} so-[] © 0-0 [4] 


5. In each exercise, verify that two given matrix functions, X(t) and Y(t), are fundamental matrices for the vector 
differential equation x = P(t)x, with specified matrix P(t). Find a constant nonsingular matrix C such that X(t) = 
Y(t)C. 


= 1+ sinh 2¢ -1 _fe/2 e*/2 
(a) P@) = sech2t 2 sinh 2¢ — | : — kee cosh t| ’ 
- bic Van oa earned 


6cosht—4sinht 4sinht—2cosht 


’ 


—3t 6t At t at t 1 p2t t 
_ @e€ 2e°+1 —e _ ile e _ | ete se]. 
) P= | 5 ett se] X= |S fa) ¥= [SS aE 
5 4 2 2 2 
= -s 2t At 1 |4t¢+ 4t° 4t*° — 2t 
—| 3 3 = at 
@ra=[|% i]. x-[T S]. vai Aci]. 


6. Consider the system of equations and two of its solutions 


& = Pix, Pw=| seru za. Ca WAR x)= [7]. 


2-1) tI 


a) Show that the column vectors x;(t) and x2(t) are solutions to the given vector equation x = P(t)x. 


(a) 
(b) Compute the Wronskian of x; and x2. 
(c) 


(d) Construct a fundamental matrix X(t) and compute X(t)X~'(2). 


On what intervals are x;(t) and x2(t) linearly independent? 


7. In each exercise (a) through (h), a nonsingular matrix X(¢) is given. Find P(t) so that the matrix X(t) is a solution of 
the matrix equation X = P(t)X. On what interval would X(t) be the fundamental matrix? Hint: P(t) = X(t)X71(t). 


sint 1 t t? tant 1 ee 
i) cost t (b) he | i ee 1 (4) E t 
sinht t Int 1 tanht t t tt 
2) ee | (f) ie | (8) ee i (h) E ‘| 
8.2 Constant Coefficient Homogeneous Systems 
This section provides a qualitative analysis of autonomous vector linear differential equations of the form 


y(t) = Ay(t), (8.2.1) 


where A is an n x n constant matrix and y(t) is an n-column vector (which is n x 1 matrix) of n unknown functions. 
Here we use a dot to represent the derivative with respect to t: y(t) = dy/dt. A solution of Eq. (8.2.1) is a curve 
in n-dimensional space. It is called an integral curve, a trajectory, a streamline, or an orbit of the system. 
When the variable t is associated with time, we can call a solution y(t) the state of the system at time t. Since a 
constant matrix A is continuous on any interval, Theorem 6.3 (page 371) assures us that all solutions of Eq. (8.2.1) 
are determined on (—oo,0o). Therefore, when we speak of solutions to the vector equation y = Ay, we consider 
solutions on the whole real axis. 
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We refer to a constant solution y(t) = y* of a system as an equilibrium if dy(t)/dt = 0. Such a constant 
solution is also called a critical or stationary point of the system. An equilibrium solution is isolated if there 
is a neighborhood to the critical point that does not contain any other critical point. If matrix A is not singular 
(det A # 0), then 0 is the only critical point of the system (8.2.1). Otherwise, the system (8.2.1) has a subspace of 
equilibrium solutions. 


Example 8.2.1: Consider a linear system of differential equations with a singular matrix: 


aay, here aalecae a=|> wale 


The characteristic polynomial x(A) = A(A + 3) of the given matrix A has two distinct real nulls \ = 0 and A = —3. 
Since the eigenspace corresponding to the eigenvalue \ = 0 is spanned on the vector (2, 1)7, every element from this 
vector space is a critical point of the given system of differential equations. Indeed, the general solution of the given 
vector differential equation is 

At | 4—e% ae! 


_ At _i 
y(t) =e*"c, where e = 312-2678 de] 


and ¢ = (ci,¢2)" is a column vector of arbitrary constants. As t approaches infinity, the general solution tends to 


1 | 4c;-2c. | ca] 4 c2 | 2 
vo s[ oe [-Fl2[+F[i] » = 


To plot a direction field and then some solutions, we type the following script in Mathematica: 


VectorPlot[{x - 2*y, 2*x - 4 y}, {x, -3, 3}, fy, -3, 3}, 

AxesLabel -> {x, y}, Axes -> True, VectorPoints -> 20, 

VectorScale -> {Tiny, Automatic, None}] 

StreamPlot[{x - 2*y, 2*x - 4 y}, {x, -3, 3}, fy, -3, 3}, AxesLabel -> {x, y}, Axes -> True, 
StreamScale -> {Tiny, Automatic, None}, 

StreamPoints -> {{.1, 2}, {0, O}, {1, 2}, {-1, -1}, {1, 13}, StreamStyle -> {Black, "Line"}] 
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Figure 8.1: Example 8.2.1, direction field, plotted Figure 8.2: Example 8.2.1, some solutions, plotted 
with VectorPlot (Mathematica). with Mathematica. 


As we can see, every solution is the straight line y = 2a + c, for some constant c (see Fig. 8.2), because 


dy — dy/dt | 2a—4y _ 


= = 2. 
dz da/dt «x«—2y 


-3 0 
0 O 
polynomial x(A) = A(A + 3). Since the corresponding system of equations « = —32, y = 0, is equivalent to one 


Let us consider a similar vector equation y = By with the matrix B = | that has the same characteristic 
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uo Ag 0 
equation — = — = — 
: dt «& —32 
toward the stationary line (see Fig. 8.3). The general solution of the equation y = By is 


ro-eren[F A 2]-ae[]ralt 


where cj, C2 are arbitrary constants. Therefore, y(t) > c2(0,1)7 as t > oo. 


= 0, the abscissa y = 0 is a stationary line. The orbits will be horizontal lines, pointed 


Example 8.2.2: Consider another vector differential equation 


= ay, where y= | 30 |, a=| 73 5 | 


The nilpotent matrix A has a double (deficient) eigenvalue 4 = 0 with the eigenspace spanning on the vector 
(1,2)? because its characteristic polynomial is y(A) = det(AI — A) = \?. Solutions of the corresponding linear 
vector differential equation y = Ay are also straight lines, similar to Example 8.2.1. They are pointed in different 
directions and are separated by the stationary line y = 2a (see Fig. 8.4). 
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Figure 8.3: Example 8.2.1, direction field for the sys- Figure 8.4: Example 8.2.2, direction field, plotted 
tem with matrix B. with Mathematica. 

From Theorem 7.14, page 412, we know that the fundamental matrix ®(t, tg) = e4(—*) is a solution to the matrix 
equation Eq. (8.1.3), page 433, for any to. Since for a constant coefficient vector equation (8.2.1) the propagator 
matrix depends always on the difference t — to, it will be denoted as ®(¢ — tg). From the definition of the exponential 
matrix (7.7.1), page 412, it follows that (0) =I, the identity matrix. Multiplication of the propagator ®(t) = e* 
by a constant nonsingular matrix C leads to another fundamental matrix X(t) = e4'C that satisfies the matrix 
equation (8.1.3) subject to the initial condition X(0) = C. 

The following statements give us the main properties of the exponential matrix. 


Theorem 8.10: Let A be an n x n matrix with constant real entries. The propagator ®(t) = e4* is a 


fundamental matrix for the system of differential equations (8.2.1). In other words, the column vectors of the 
exponential matrix e“* are linearly independent solutions of the vector equation y(t) = A y(t). 


Corollary 8.7: Let A be an n x n matrix with constant entries. Then the exponential matrix ®(t) = e4 is 


the unique solution of the matrix differential equation subject to the initial condition 


d® 
a =AP, P(0) =1 (8.2.2) 


where I is the n x n identity matrix. 
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Theorem 8.11: Let Y(t) = [yi(t), yo(t), .-., yn(t)] be a fundamental matrix for the vector differential equa- 
tion (8.2.1) with a constant square matrix A. Then 


eAlt-to) — ¥(t)¥—1 (to) = B(t — to). 


Corollary 8.8: The inverse matrix to ®(t) = e4! is B-!(t) = e~ A". 


Theorem 8.12: For any constant n x n matrix A, the column vector function 
y(t) =e" c= a yilt) + ca ya(t) + --- + cr yn(t) (8.2.3) 


is the general solution of the linear vector differential equation y = A y(t). Here y z(t) is k-th column (k = 


1,2,...,n) of the exponential matrix @(t) = e4* and c = (ci,c2,...,¢n)" is a column vector of arbitrary 
constants. Moreover, the column vector 
y(t) = B(t — to) y(to) = e*"— yo (8.2.4) 
is the unique solution of the initial value problem: 
y(t)=Ay(t), y(to) =yo- (8.2.5) 


If y(t) is a solution of a constant coefficient system y(t) = A y(t) and if to is a fixed value of t, then y(t + to) 
is also a solution. However, these solutions determine the same trajectory because the corresponding initial value 
problem (8.2.5) has a unique solution expressed explicitly through the propagator matrix y(t) = ®(t)yo = e“‘yo. 

So if two solutions of the same linear system of equations (8.2.1) with constant coefficients coincide at one point, 
then they are identical at all points. Thus, an integral curve of the vector differential equation y = Ay is a trajectory 
of infinitely many solutions. Therefore, distinct integral curves of Eq. (8.2.1) do not touch each other, which means 
that the vector equation y(t) = A y(t) has no singular solution. 


Example 8.2.3: If A is a square constant matrix, then software packages can be used to calculate the fundamental 
matrix ®(t) = e* for the system y = Ay. After the matrix 


13 =U 
An| 21 16 | 


has been entered, either the Marima command 
load(linearalgebra) $ 

matrixexp(A,t); 

the Maple” command 

with(LinearAlgebra): MatrixExponential(A,t) 
or MatrixFunction(A,F, lambda) with F(A) = e*, or 
with(linalg): exponential(A,t) 

the Mathematica® command 

MatrixExp[A t] 

or the MATLAB® command 

syms t, expm(A*t) 

yields the matrix exponential (propagator) 


at __ | 15e'—14e* 10e’—10e% 
~ | 216e7*-Q1e! 15e%*—14e! |° 


If we seek a solution of Eq. (8.2.1) in the form y(t) = ve, then by substitution for y into Eq. (8.2.1), we obtain 
(AI- A)v =0. 


Therefore, \ is an eigenvalue and v is a corresponding eigenvector of the coefficient matrix A. So if an n x n matrix 
A has m (m < n) distinct eigenvalues A,, k = 1,2,...,m, then the vector differential equation y(t) = A y(t) has 
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at least m linearly independent exponential solutions v; e**' because eigenvectors vz corresponding to different 
eigenvalues A; are linearly independent (Theorem 7.7, page 387). 

If the matrix A is diagonalizable, then we have exactly n linearly independent solutions of the form v; e**'. The 
following theorem states that we have at least m linearly independent exponential solutions to the vector differential 
equation y = Ay, where m is the number of distinct eigenvalues of the constant matrix A. 


Theorem 8.13: Suppose that an n x n constant matrix A has m (m < n) distinct eigenvalues \1, Az, .--; Am 
with corresponding m eigenvectors v1, V2, ---; Vm. Then the column functions 
yi(t) =vie*®, yo(t)=voe™*, ..., Ym(t)=vmerm 


are linearly independent solutions of the vector equation (8.2.1). 


Theorem 8.14: Suppose that an n x n constant diagonalizable matrix A has n real or complex (not necessarily 
distinct) eigenvalues \1, A2, ..., An with corresponding n linearly independent eigenvectors v1, va, .--, Vn. Then 
the general solution of the homogeneous system of differential equations y(t) = A y(t) is 


G ib Ant 
y(t) =avi Co Pejvie +e teva ™ (8.2.6) 
where Cj, C2, .--, Cn are arbitrary constants. | 


It should be noted that the general solution in the form (8.2.6) is not convenient for solving the initial value prob- 
lem since it leads to determination of arbitrary constants from an algebraic system of equations. For a nondefective 
square matrix A, let us form a fundamental matrix from the column vectors specified in Theorem 8.14: 


Y(t) = [e**vi, acl ae aval ; 
Therefore, there exists a fundamental matrix Y(t) for a system of homogeneous equations with a nondefective 


matrix A that can be expressed through the exponential matrix and the matrix generated by its eigenvectors vz 
(K 21, 225557): 


Y(t) = levine Vane Vn) = e*t Ivi, va,---,Vn]- (8.2.7) 
Example 8.2.4: Let us consider A = | : : | , then the propagator 
et est — et 
Cad ie 


is an example of a fundamental matrix because its columns are linearly independent solutions of dx/dt = Ax. 
Substituting x = e*v into the latter equation, we obtain two linearly independent solutions x; = e*!*v, and 
x2 = e%?! vy, where v; = (1,0)? and v2 = (1,1)" are eigenvectors corresponding to two eigenvalues A; = 1 and 
A2 = 3, respectively. Therefore, another fundamental matrix for the same system will be 


vo-[e 2 ]ebemeel=(ef] fl) 


and 
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Example 8.2.5: Suppose that two square nxn matrices A and B satisfy the anticommutative relation AB = —BA. 
Then A*B = (—1)* BA* for any nonnegative integer k. By expanding the exponential matrix e“* into the power 
series, it then follows that Be“* = e~4'B. Let y(t) be the solution to the initial value problem 


d 
Y=BeAty, — (0) =yo. (8.2.8) 


Calculations show that 


Since et y = yo when t = 0, and e4* y = e(4+B)t yy, we obtain the desired solution of the IVP (8.2.8) to be 


y(t) = e~ At elAtB)t yy, 


8.2.1 Simple Real Eigenvalues 


For a nondefective (see Definition 7.9 on page 387) constant matrix A, every solution of the vector equation y = Ay 
can be rewritten as the sum of exponential terms (Theorem 8.14 on page 443) 


y(t) =e**e= fe + foe +--+ be", (8.2.9) 


where €, = Cy Vx and vx are eigenvectors corresponding to eigenvalues A,, and cz are arbitrary constants, k = 
1,2,...,n. In what follows, we consider only vector equations y = Ay with nonsingular matrices. In this case, the 
system has only one critical point—the origin. The objective of the following material in this section is to provide a 
qualitative description of solutions to the vector equation y = Ay in a neighborhood of the isolated critical point. 

If all eigenvalues are real and negative, then exponentials in Eq. (8.2.9) decrease very fast as t increases. This 
means that the general solution approaches the origin when ¢ is large. We say in this case that the origin is the 
attractor. Every solution approaches the origin as t > oo, hence it is asymptotically stable. If all eigenvalues 
are real and positive, then every solution moves away from the origin (except the origin itself to which corresponds 
c = 0), it would be natural to call the origin a repeller and to refer to this critical point as unstable. 

If some of the eigenvalues are real and positive and some are real and negative, then we cannot call the origin 
a repeller or attractor. Such a case deserves a special name—we call it a saddle point. For example, let A; > 0 
while all other eigenvalues are real and negative. Then, the solution y; = €; e*!’ approaches infinity as t > 00. As 
t increases, all other linearly independent solutions y;(t) = €; e***, k = 2,3,...,n, approach zero. This means that 
the solution (8.2.9) is asymptotic, as t + oo, to the line spanned on €; = civ (unless c; = 0). The presence of 
solutions near the origin that move away from it would lead us to call the origin unstable. 

Now we turn our attention to the two-dimensional case assuming that A in Eq. (8.2.1) is a 2 x 2 constant matrix 
and y(t) is a 2-column vector function. Then the system (8.2.1) is called a planar system, and we can exhibit 
qualitatively the behavior of solutions by sketching its phase portrait—trajectories with arrows indicating the 
direction in which the integral curve is traversed. Visualization of planar systems not only facilitates understanding 
of geometrical properties of solutions, but also helps in examination of higher-dimensional systems. 

Suppose that the characteristic polynomial .(A) = det (AI — A) of a 2 x 2 matrix A has two distinct real nulls, 
that is, x(A) = (A — A1)(A — Ag), Ar A Aa, and Aj, Az are real numbers. Then any solution of Eq. (8.2.1) is 


y(t) = crv e™* + cove e*?* = & e™* + Be, (8.2.10) 


where v1, 6) = civ and vo, €2 = c2Vv2 are linearly independent eigenvectors corresponding to eigenvalues A, and 
A2, respectively; here c,, c are arbitrary constants. 

Let Ly and Lz denote lines through the origin parallel to v; and v2, respectively. A half-line of LZ; (or L2) is 
the ray obtained by removing the original along with the remaining part of the line from L, (or Lz). Let Az be the 
largest eigenvalue of the matrix A. To emphasize it, we associate a double arrow with vector v2, see Fig. 8.5. 

Letting c; = 0 in Eq. (8.2.10) yields y(t) = cove e**. If co £0, the streamline defined by this formula is half-line 
of Lz. The direction of motion is away from the origin if A2 > 0, toward the origin if Ag < 0. Similarly, the trajectory 
of y(t) = civ; e*! with c, £0 is a half-line of Ly. 
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N cy > 0, co < 0 - ia 
N 


Pe C1 > 0, c2 > 0 


Cc. <0, @>O0 


Figure 8.5: Open sectors bounded by DL, and Lo. 


Henceforth we assume that c; and cg in (8.2.10) are both nonzero. In this case, the solution curve cannot touch 
or cross L, or Lz since every point on these lines belongs to the trajectory of a solution for which either c; = 0 or 
co = 0. Hence, the streamline of (8.2.10) must lie entirely in one of the four open sectors bounded by L, and Lz, 
but should not contain any point from these lines. The position of trajectory is totally determined by the initial 
point y(0) = civ1 + cave. Therefore, the signs of c1, cz determine which sector contains the solution curve. 

Assuming \2 > 1, we factor the exponential term e*2* to obtain 


y(t) = [a+ gr emer 


The term €, e~2)* is negligible compared to € for t sufficiently large since Ay — \2 < 0. Therefore, the trajectory 
is asymptotically parallel to Lz as t > oo. The shape and direction of traversal of the streamline depend upon the 
signs of eigenvalues. We are going to analyze these cases separately. 

Suppose that A, and 2 are both negative. Let, for example, 41 < Az < 0. The solution moves toward the origin 
tangent to £2 = ceVv2 as t — +00 and we have an asymptotically stable critical point x = 0. We say in this case 
that the critical point is a node or a nodal sink. 

If eigenvalues A, and Ag are both positive, say 0 < Ay < Ag, then the solution y(t) moves away from the origin 
and we call the critical point y = 0 a nodal source (unstable). 

Now assume that the given diagonalizable matrix has two real eigenvalues of different sign, Ay < 0 < A2. Then 
the general solution is comprised by a linear combination of the exponential terms 


» 


Ait t 
y(t) =ce*' v1 + ce?" Vo 


9 


where v, is the eigenvector corresponding to the negative eigenvalue A; and ve is the eigenvector corresponding to 


the positive eigenvalue \2. Since one of the solutions, c; e*!’, tends to zero as t > 00, while the other one, cz e%2*, 


grows boundlessly when cz ¥ 0, the origin is called a saddle point, and it is unstable. The lines through the origin 
along the eigenvectors separate the solution curves into distinct classes (see Fig. 8.8, page 447), and for this reason 
each line is referred to as a separatrix. 


Example 8.2.6: (Nodal source, repeller) The matrix 


ey 


has two positive eigenvalues Ay = 1 and A2g = 2 with corresponding eigenvectors 


m=tu=[7], w=2ve=[ 4 | 


Then the general solution of Eq. (8.2.1) with this 2 x 2 matrix A is 


y(t) =cqe'viter e7 v5 
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with two arbitrary constants cy and cp. The corresponding direction field is presented in Fig. 8.6, where the domi- 
nating vector is indicated with double arrows. 


x'=2y, y=—-x+3y , : 
mr f / Pig’ df of fit 1 dd ble 
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Figure 8.6: Example 8.2.6, repeller, plotted with Figure 8.7: Example 8.2.7, attractor, plotted with 


Mathematica. Maple. 


Example 8.2.7: (Nodal sink, attractor) The matrix 
1 -2 
a= [4 os) 


has two negative eigenvalues Aj = —1 and Ag = —3 with corresponding eigenvectors 


Using the exponential matrix 


we obtain the general solution of y = Ay: 
y(t) =ce-*vi + Qe * vo = e*™* [vi, vale 


with an arbitrary constant vector ¢ = (c1,¢2)". Here [v1,v2] is the square matrix of column eigenvectors v1 and 
vg. The phase portrait of the corresponding system is given in Fig. 8.7. 


Example 8.2.8: (Saddle point) The matrix 
0 2 
“(1 
has one negative eigenvalue 4; = —1 and one positive eigenvalue A2 = 2 with corresponding eigenvectors 


m=-1u=| 2], ro=2, ve=| 7 |. 


The general solution of Eq. (8.2.1) with this 2 x 2 matrix A contains two arbitrary constants c; and co: 


y(t) =c1e-' vj tee" vo = ia [vi, vole, 


ay Ll) 2e*+e* Qe*=2e" = 2 1 ee 
€ ~3 e2t — e-t 2e% 4 et , V1,V2| = 11/7 = . 


where 
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X=2y;, Yoxty 


Figure 8.8: Example 8.2.8, saddle point, plotted Figure 8.9: Example 8.2.9, spiral source, plotted 
with Mathematica. with Mathematica. 


8.2.2 Complex Eigenvalues 


Suppose that a real-valued square matrix A of the system of differential equations y(t) = A y(t) has a complex 
eigenvalue \ = a+ jG with w = u+ jv being its associated eigenvector. Here j is the unit vector in the positive 
vertical direction on the complex plane so that j? = —1. Since all entries of the matrix A are real numbers, \ = a—jG 
is also an eigenvalue of A with associated eigenvector W = u—jv. According to Theorem 8.14, page 443, the general 
solution contains the term 

y(t) =c.w elotsye 4 C9 Ww elo i8yt 


where c; and cg are arbitrary (complex) numbers. To make y(t) real, we have to assume that co = @| =a — jb is 
the complex conjugate of c, = a+ jb, in which a and 6 are some real constants. This means that (a +j3)(u+jv) = 
A (u+ jv), which leads (after separating real and imaginary parts) to two simultaneous vector equations: 


Au = au — Sv and Av =av-+ fu. 
Using Euler’s formula, e&? = cos @ + jsin@, we transform y(t) into a real-valued form: 


y(t) = ce (ut jv) ef)! + 69 (a — jv) elo Jt 

(a + jb) (u + jv) eH)! + (a — jb) (u— jv) ef -J9 

e** [(a + jb) (u + jv) (cos Bt + jsin Bt) + (a — jb) (u— jv) (cos Bt — jsin Bt)] 
= 2e™ [(au — bv) cos Bt — (bu + av) sin Bi}. 


If we denote €, = 2(au — bv) and 2 = —2(bu + av), then y(t) has the form 
y(t) =e™ [€1 cos Bt + €9 sin Bit], (8.2.11) 


where €; and & are real-valued vector solutions of the following system of algebraic equations: 


AE; = a€) + BEo, A€a = a€2 — BEy. 


The trigonometric functions cos Gt and sin Bt are both periodic with period 27/|6| and frequency |3|/(27), mea- 
sured in hertz (|| is called the angular frequency). Consequently, the vector-valued function e~™ y(t) exhibits an 
oscillating behavior. 

If a 2 x 2 matrix A of the system y = Ay has complex conjugate eigenvalues 1 = a +j§, we refer the origin as 
a spiral point. If its real part is negative, a < 0, the point is asymptotically stable because all solutions approach 
0, and the point is called an attractor. If a is positive, all solutions leave the origin, and the critical point O is an 
unstable spiral point (repeller). When the real part is zero, ReA = RA = a = 0, all solutions oscillate around the 
origin. We refer to this last case as a center (stable but not asymptotically stable). 
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Example 8.2.9: (Spiral source) Let us consider the initial value problem for the system of ordinary differential 
equations 


& = x(t) + 2y(t), 
y = —2a(t) + y(t), 
with the corresponding matrix 
1 2 
A= | 2 | 
Eigenvalues of the matrix A are \; = 1+2j and \z = A, = 1—2j because they annihilate its characteristic polynomial 


(A) = det (AT — A) = (A— A1)(A — Ag) = (A- 1)? +4. 
To find a function of the matrix A applying Sylvester’s method, we first have to determine the auxiliary matrices 


Ady A-14+2j 


Z A = Oo 
(n)(A) Wide. 145-0) = 142] 
313 3)-[4 #1 
4j | —2 2j 3 2 |’ 
Ay Ava 
Za.)(A) = — 


1 [ -2j 2 a 
= ns 4 — => Z A . 
5 | 9 oa | oo (a) (A) 


2 


Therefore, using Euler’s formula, e? = cos @ + j0, we get the fundamental exponential matrix 
; g gS 


cos2t sin 2t 
erage Z(y,)(A) ere Z(2)(A) =e | —sin2t cos 2t | , 


: ; : At. _ ot cos2¢t = sin 2t C1 
Since the general solution y(t) = e“'c = e Ee aiNBe coc oe | | = 
origin is a spiral source (unstable). Substituting c; = 1 and cg = 2, we obtain the required solution to the given 
initial value problem. 

Sometimes it is convenient to use polar coordinates x = r cos6, y = r sin@, where r = \/x?2+y? and @ = 
arctan(y/a). Then the given system of differential equations can be rewritten as 


contains the exponential multiple e’, the 


=2. 


Since these ordinary differential equations are decoupled, we solve them separately to obtain 
r(t) =vb5eF, 6(t) = —26 + arctan(2). 


Hence, trajectories spiral clockwise away from the origin as t increases (see Fig. 8.9 on page 447). 


Example 8.2.10: (Center) Consider another system of ordinary differential equations 


a| ao l=[-¢ al [2] we a-[ 7 4]. 


The matrix A has two pure imaginary conjugate eigenvalues \ = +2j; its fundamental matrix is 
At — [C08 2t + 4 sin 2t sin 2t 
7 —3 sin 2¢ cos 2t — 4 sin 2t| ° 


Therefore, the origin is the center (stable but not asymptotically stable), see Fig. 8.10 on page 449. 
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Figure 8.10: Example 8.2.10, center, plotted with Figure 8.11: Example 8.2.11, proper unstable node, 


Mathematica. plotted with Mathematica. 


8.2.3 Repeated Eigenvalues 


Let us look more closely at the structure of the exponential matrix function e*'. According to the resolvent. formula 
(see §7.6), the exponential matrix function is equal to the sum of residues over all of the eigenvalues of the square 
matrix A: 
eAt= SJ” Res e**R,(A), (8.2.12) 
Nk 
An €o(A) 


where o(A) is the set of all eigenvalues (also called the spectrum) of the n x n matrix A, and R)(A) = (AI— A)~1 
is the resolvent of the matrix. Since the evaluation of the residue of a function may contain, at most, an (n — 1)-th 
derivative with respect to \, such a derivative, when applied to the product g(\) e>*, gives 


de At w (n-l (n—k—1) dt At w (nol (n—k—1) yk ,At 
ee = De ee = Lg Jee 


k=0 k=0 


where (7) = n!/k!/(n — k)! is the binomial coefficient. Therefore, if the minimal polynomial (A) of an n x n 
matrix A contains a multiple (A — Ao), then the residue at the point A = \9 produces a polynomial in ¢ of degree 
m — 1 times the exponential term, e*°’. In general, the exponential matrix e“* is equal to the sum of exponential 
terms e**, times polynomials in t of degree one less than the corresponding multiplicity of its minimal polynomial 
expansion. 

Since the exponential function e*’* grows or decreases faster than any polynomial in t, the behavior of the terms 
from the exponential matrix e“' that correspond to the multiple eigenvalue A; completely depends on the sign of the 
real part of A;. If the real part RA; > 0, the critical point is unstable because solutions containing the exponential 
term e)' = e*s* [cos (GA;t) + j sin (SAjt)] approach infinity. If RA; < 0, the corresponding terms containing 
e*)* die out. When singular matrix has eigenvalue \ = 0 of multiplicity m, its residue at this point turns into a 
polynomial in t of order m — 1. 

Suppose that a matrix has an eigenvalue A* of multiplicity m > 1. If its geometric multiplicity is equal to m, we 
have m linearly independent eigenvectors corresponding to X*, and the origin is called a star or proper node. If the 
geometric multiplicity is less than m, we call it defective and the origin is referred to as a deficient or degenerate 
or improper node. Its stability is determined by the sign of #A*, the real part of A*. 


Example 8.2.11: (Proper node) Consider the differential equation 


: 2 0 

y=Ay, where A= | j a 
which actually consists of two uncoupled equations. The matrix A has double eigenvalue \ = 2 with two linearly 
independent eigenvectors: v1 = (1,0)7 and v2 = (0,1)7. Therefore, the origin is a proper node (or star), which is 
unstable (Fig. 8.11). 
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Example 8.2.12: (Degenerate node) The matrix 


is defective because its characteristic polynomial y(\) = (A — 3)? has one double root \ = 3 to which corresponds 
only one eigenvector (1,—1)". The exponential matrix is 


1-t -i 
At ,3t 
as | t mal 


so the general solution of the vector equation y = A y becomes 


y(t) = | ar | oa | ee 


where c = (ci, C2)? is a column vector of arbitrary constants. As t +> +00, the solution blows up, so the origin is an 


unstable deficient node. 
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Figure 8.12: Example 8.2.12, degenerate unstable Figure 8.13: Example 8.2.13, degenerate stable 
node, plotted with Maple. node, plotted with Mathematica. 
Example 8.2.13: (Degenerate node) The matrix 
0 -l 
a= [1 2 
is defective since its characteristic polynomial y(\) = (A + 1)? has one double root \ = —1 with an eigenvector 


(1,1)7. The exponential matrix is 


_,[ 1+¢ -t 
At -t 
= | t rane 


te, (1 = t)cg t 


roe Bl ee k +t)e. tee tt EB ‘ ss Fe | 


where c = (c1,¢2)" is a column vector of arbitrary constants. As t + +00, the solution approaches zero, so the 
origin is an asymptotically stable deficient node. 
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8.2.4 Qualitative Analysis of Linear Systems 


This subsection is an introduction to vector autonomous differential equations. We summarize the observations 
made previously and give a relatively simple classification of constant coefficient homogeneous vector equations with 
solutions of a specified kind. Treating the independent variable t as time, we discuss the behavior of the solutions 
as t — oo based on an analysis of the corresponding matrix. In particular, we consider the system of differential 
equations 

y = dy/dt = Ay, (8.2.1) 


where y = y(t) = (yi (t), yo(t),---, yn(t))” is an n-column vector of unknown functions (“T” stands for transposition), 
and A is an n xX n square nonsingular matrix that does not depend on time ¢. Its general solution is known to be 


y=eAte, (8.2.3) 
where ¢ = (ci(t), c2(t),---,¢n(t))” is an n-column vector of arbitrary constants. The exponential matrix e“* may 
have a rather complicated form. Nevertheless, we know that each of its entries contains a polynomial of a degree less 
than or equal to n— 1 times the exponential term: e*’, where \ is an eigenvalue of the matrix A. This observation 
gives us a clue to its long-term behavior. When the matrix A is not deficient, the general solution of Eq. (8.2.1) is 
the sum (Theorem 8.14, page 443) of exponential terms 


y(t) = c1 v1 e** + cp vg 0? + + en vn er, (8.2.6) 


where V1, V2, -.-, Vn are linearly independent eigenvectors corresponding to eigenvalues 41, A2, ..-, An (some of 
them can be equal). 

If all eigenvalues are real and negative, then exponentials in Eq. (8.2.6) decrease as t increases. This means that 
the general solution approaches the origin for a large t. We say in this case that the origin is the attractor or sink. 
Every solution approaches the origin as t + oo; hence, it is asymptotically stable. If all eigenvalues are real and 
positive, then every solution moves away from the origin (except the origin itself, which corresponds to c = 0); we 
can call the origin a repeller or source and describe the critical point as being unstable. 

If some of the eigenvalues are real and positive and some are real and negative, then we cannot call the origin 
a repeller or an attractor. Such cases deserve a special name—we call the origin a saddle point. For example, 
let A; > O and all the other eigenvalues be real and negative. Then the solution y;(t) = €;e™ (€; = civi) 
approaches infinity as t + oo unless €; = 0. As t increases, all other linearly independent solutions y;(t) = €, e>**, 
k = 2,3,...,n, approach zero. This means that the solution (8.2.6) is asymptotic to the line determined” by €; 
as t + oo. The presence of solutions that move away suggests that the origin is unstable. As shown in §8.2.3, the 
existence of multiple eigenvalues does not effect the stability of the critical point unless RA = 0. 

When the characteristic polynomial (A) = det(AI — A) has a complex null A = a+ jG, its complex conjugate is 
also an eigenvalue. Since we only consider systems of equations with real-valued matrices, all complex eigenvalues 
appear in pairs with their complex conjugates. To the pair of complex roots of x(A) = 0 of multiplicity m corresponds 
2m real-valued solutions of the system (8.2.1) that have the multiple e® times polynomial in ¢ of degree m— 1 times 
a trigonometric function sin St or cos Gt. Since an exponential function grows/decreases faster than any polynomial, 
the behavior of this solution is determined by the real part a of the eigenvalue \ = a+ jG. If a > 0, we have 
a repeller, if a < 0, we get an attractor. The presence of trigonometric functions leads to oscillating behavior of 
solutions. If this occurs, then the origin is called the spiral point (or focus). 

If pure imaginary eigenvalues \ = +j@ are simple roots of the characteristic equation y(A) = 0, the corresponding 
solution is stable, and the stationary point is called the center. However, if a pure imaginary eigenvalue is defective, 
then the solution is always unstable. 

These observations illustrate the following stability result. 


Theorem 8.15: Let A be a real invertible (det A # 0) square matrix. Then the linear vector equation 
y(t) = Ay(t) has the only one equilibrium point—the origin. This critical point is 


1. asymptotically stable if all eigenvalues of A have a negative real part; 


2. stable but not asymptotically stable if all eigenvalues of A are simple pure imaginary numbers; 


“The line, which is determined by a vector €, consists of all points x = c€, c is an arbitrary constant. So this line spans € and includes 
the origin. 
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3. neutrally stable if all eigenvalues have negative real parts but at least one is a pure imaginary of multiplicity 

1, 

4. unstable if all eigenvalues of defective matrix A are pure imaginary but at least one of them has multiplicity 
larger than 1; 


5. unstable if at least one eigenvalue of A has a positive real part. 


Now we turn our attention to the planar case, assuming that A is a 2 x 2 matrix and y is a 2-column vector in 
Eq. (8.2.1). The two-dimensional case facilitates the visualization of solution curves of the system. When n = 2, the 
system (8.2.1) is a plane system, and we can qualitatively observe the behavior of solutions by sketching phase 
portraits—trajectories with arrows indicating the direction in which the integral curve is traversed. Consider the 
following system of linear differential equations: 


r =aaz(t) + by(t), dy(t 
7 ig Cee or in vector form: ev) = Ay(t), 
y =ca(t)+dy(t), dt 


d 
This system can be solved by the elimination method (see §6.4): 


where the matrix A = |: ; of the system is assumed to be nonsingular (with det A 4 0) and y(t) = (x(t), y(t)”. 


dy yy  ca+dy  a+dy/x 


dx «¢ ar+by a+by/x 


The behavior of the solutions near the origin (which is the only critical point) depends on the nature of the eigenvalues 


A, and Az of the 2 x 2 matrix A: 


ered ; (tr A)? —4 det A, (8.2.13) 


where tr A = a+d is the trace and det A = ad — bc £ 0 is the determinant of the nonsingular matrix A. There are 
four kinds of stability for equilibrium solutions: 


Critical Point Stability 


Center stable trA = 0, detA > 0 
Sink asymptotically stable trA <0 and detA > 0 


Source unstable: all trajectories recede | trA > 0 and detA >0 
Saddle point unstable, but some solutions detA <0 
may approach the critical point 


Further classification of critical points depends on how trajectories approach or recede from them. An equilibrium 
solution y* is called a node if every trajectory approaches it or if every trajectory recedes from it, and these orbits 
do not reverse their directions in the neighborhood of y*. This means that every trajectory is tangent to a line 
through the critical point. 

A critical point is a proper node or star if solution curves approach it or recede from it in all directions. A 
critical point is an improper or degenerate node if all trajectories approach or emanate from it in at most two 
directions. A node is called deficient if the orbits only approach it or recede from it in one direction. 

An equilibrium solution is a spiral point if trajectories wind around the critical point as they approach it or 
recede from it. If solutions near an isolated critical point neither approach it nor recede from it, we call such an 
equilibrium solution a center. 

Therefore, there are five types of critical points: 


e a proper node (stable or unstable); 
e an improper or degenerate node (stable or unstable); 
e spiral point (stable or unstable); 


e center (always stable); 
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e saddle point (always unstable). 


The behavior of solutions near a critical point depends on eigenvalues of the corresponding matrix; therefore, we 
consider the following cases: 


e eigenvalues are real and distinct of the same sign; 

e real eigenvalues of opposite sign; 

e equal eigenvalues; 

e complex conjugate eigenvalues with a nonzero real part; 
e pure imaginary eigenvalues. 


For planar systems, all cases are summarized in the following table: 


Type of Critical Point 
Nodal source (node) 


Nodal sink (node) Aggmptoticaly stable 
Sade pom 


Proper node/star point Unstable 
Proper node/star point Asymptotically stable 


Al = A2 > 0, 
independent eigenvectors 
Ay = A2 < 0, 
independent eigenvectors 


Ai = A2 > 0, 
Improper/degenerate node 


mene eigenvector Unstable 


= = Ag x 0, 
Improper /degenerate node | Asymptotically stable 


eres a>0 
Asymptotically Wal 


Example 8.2.14: Let A be the singular matrix 


1 -7 3 
A=]-1 -1 1 
4 -4 0 


Its characteristic polynomial A(A) = det (AI— A) = A(A? — 16) has a multiple A, so to the eigenvalue \ = 
corresponds the eigenvector yo = (1, 1,2)". Therefore, the origin is not an isolated critical point for the differential 


equation 
y(t) = Ay(t), (yi, Y2, 43)" - 


This means that the equilibrium solutions are spanned on the eigenvector yo. 


where y = 


Example 8.2.15: Consider the vector differential equation 


ae 
g=Ay, with A=|-6 -3 8 |, 
a | 


is a 3D vector function to be determined. The matrix A of the equation has two 
—4. The fundamental matrix is 


where y = (yi(t), y(t), ys(t))? 


pure imaginary eigenvalues A1,2 = 


+j and one negative eigenvalue A3 = 


1 25 cost + 19 sint 3 cost +22 sint —19 cost+ 25 sint 
eA“! = — |-16 cost— 38 sint llcost—27sint 38 cost —16 sint 
ae 8 cost + 2 sint 3 cost +5 sint —2 cost+8 sint 

1 -8 -3 19 

+e] 16 6 —38 

Mv -§ -3 19 
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Because the exponential multiple e~** tends to zero as t +> +00, the general solution oscillates. Therefore, the origin 
is a neutrally stable critical point. 


1. In each exercise (a) through (d), verify that the matrix-function written in vector form Y(t) = [e*!*v1, e*?’va] is the 
fundamental matrix for the given vector differential equation y = Ay, with specified constant diagonalizable matrix A. 
Then find a constant nonsingular matrix C such that Y(t) = e*'C. 


At 


@aa[t gh vo= |e" be Al) 
wal of: vio= |e" lf eG 
© a=[3 vo=[e" [7] GI} 
@ a= 7) - cla [levers ieorral| 


2. Compute the propagator matrix e“' for each system y = Ay given in exercises (a) through (d). 


(a) «= 6x — Gy, y = 4a — 4y; (c) & = 9x + 2y, y = 2x 4+ by; 
(b) & = lla — 15y, y = 6x — 8y; (d) i= 92 — 8y, y = 6x — By. 


3. In each of exercises (a) through (h): 
e Find the general solution of the given system of homogeneous equations y = Ay and describe the behavior of its 
solutions as t + +00. 


e Draw a direction field and plot a few trajectories of the system. 
2 3 3. 2 -3 #4 5 6 
@ [th w) [% GO): © [P 4); @ |*, Sy: 
7 6 13. 4 6 -7 4 2 
© [Fgh @l2 Gk @ [to om) [} 2). 


4. For each of the following matrices in the homogeneous equation y = By, classify the critical point (0,0) as to type, 
and determine whether it is stable, asymptotically stable, or unstable. In the case of centers and spirals you are asked 


to determine the direction of rotation. Also sketch the phase portrait, which should show all special trajectories and a 
few generic trajectories. At each trajectory, the direction of motion should be indicated by an arrow. 


e In the case of centers, sketch a few closed trajectories with the right direction of rotation. For spirals, one generic 
trajectory is sufficient. 

e In the case of saddles or nodes, the sketch should include all half-line trajectories (corresponding to eigenvectors) 
and a generic trajectory in each of the four regions separated by the half-line trajectories. The half-line trajectories 
should be sketched correctly, that is, you have to compute eigenvalues as well as eigenvectors. 


e In the case of nodes you should also distinguish between fast (indicated by a double arrow, which corresponds to 
the largest eigenvalue) and slow (with a single arrow) motions. 


13 4 0.2 1 3 1 1 8 4 3 
al i , on ie al = k 4 cial E | 7 bi 
2 —5 2 =25 4 —10 1 4 > 4 
Bem [lc] Pre Si] mele ce]s ee [A af Be | A 
x 3 2 15 2 45 2 6 2 5 
Bu = i A ; Bi = Ee | >; Bizs= Ee 5 ; Bu= E 5 5 Bis = ie =| : 


5. Find the general solution of the homogeneous system of differential equations y = Ay for the given square matrix A 
and determine the stability or instability of the origin. 


@ [% Sy): w) [9]: c) [a =): q@) |°, 2; 
@ [4 3): eee ae @ila 2) Gy ah 
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8.3. Variation of Parameters 


Suppose we need to find a particular solution x,(t) of the inhomogeneous linear system of differential equations with 
variable or constant coefficients 

x(t) = P(t)x+ f(t), (8.3.1) 
where P(t) = [p;;(t)] is a given n x n matrix with continuous coefficients p,;(t) (i,j = 1,2,...,n), f(t) is a known n- 
column vector ((n x 1)-matrix) of integrable functions, and a dot denotes the derivative with respect to t: x = dx/dt. 
Here x(t) = (a1, 22,..-,2n)? is the column vector of n unknown functions that are to be determined. Suppose that 


we know the fundamental matrix X(t) for the associated homogeneous system x(t) = P(t)x. This matrix is 
convenient to write as a collection of linearly independent column vectors 


X(t) = [xi(t), xo(t),..., xn(t)], det X(¢) £0, 
where each n-column x,;(t), k = 1,2,...,n, is a solution of the homogeneous equation: 
x(t) = P(t) x(t). (8.3.2) 


for an arbitrary constant column vector c. The variation of parameters method (also sometimes referred to as 
Lagrange’s method) calls to replace the constant vector c with a variable vector u(t), and seek a particular solution 
of Eq. (8.3.1) in the form 


X(t)u(t)=f(t) or u(t) =X 7!(t)f(t). 
Thus, if u(t) is the solution of the latter equation, namely, 
u(t) = / X(t) £(t)dt+e, 
where c is an arbitrary constant vector of integration, then a particular solution becomes 
xp(t) = X(t)u(t) = X(t) / X71 (t) f(t) dt + X(t)c. 


The latter term X(t) c is the general solution of the complementary equation x(t) = P(t) x while the former gives a 
particular solution of the nonhomogeneous equation (8.3.1). Therefore, we have proved the following statement. 


Theorem 8.16: If X(¢) is a fundamental matrix for the homogeneous system x = P(t) x(t) on some interval 
where the square matrix-function P(t) is a continuous and column vector f(t) is integrable, then a particular 
solution of the inhomogeneous system of equations (8.3.1) is given by 


x,(t) = X(t) i X~1(t) £(t) dt = X(t) i X-1(r) f(r) dr. (8.3.3) 
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Corollary 8.9: If X(t) is a fundamental matrix for the homogeneous system x = P(t) x(t) on some interval 
|a, b|, where P(t) and f(t) are continuous, then the initial value problem 


x= P(t)x()+f@),  x(to.)=x0 (to €|a,0)), 


has the unique solution: 


x(t) = X(t)X~1(to) xo + X(t) | X-1(r) f(r) dr. 


Example 8.3.1: Find a particular solution of 
. 1 et 1 
x(t) = P(t) x(t) + f(t), where P(t) = 3 , &(t)= ae 


given that 


is a fundamental matrix for the complementary system x = P(t)x. 
Solution. We seek a particular solution x,(t) of the given nonhomogeneous vector differential equation in the 
form 


xp(t) = X(t)u(t), 


where u(t) = (ui(t), u2(t))? is an unknown 2-vector of functions ui(t), u2(t) to be determined. Substituting x,(t) 
into the given equation leads to 


X(tju(t) =f(t) or = u(t) = X 1 (4) (0). 


The inverse of the fundamental matrix is 


1 e2t 1 1 
-1 
Xx (= 7a | 4t » | =5 


ono [F SS] [S]-[ 7] 


By integrating and eliminating the constants of integration, we get 


-f[T Jee] 


Therefore, 


Hence, a particular solution becomes 


3 et 2t 2t 
P@=| e* 2 e& |, FH=] & |, 
e 7% et 4 —2 
given that 
5t e2t 0) 
X(t)=|] e* O e 


et -1 —-1 


is a fundamental matrix for the complementary system x = P(t)x. 
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Solution. The determinant of X(t) is the Wronskian 
det X(t) = 3e% £0. 


Therefore, the inverse matrix exists everywhere and 


i et e2t est ‘ eat e74t  e73t 
x") _ ; a 2 ett —ert — bt i a 2e72t —et —1 
€ —e4t edt _ bt —e-2t e-t =i 


We seek a particular solution in the form x,(t) = X(t)u(t), where d = X~1(t)f(t). Integration yields 


; a: ert est e2t S 
v= i X71 (t)f(t) dt +e = a / Ze? —e! et! | et | dt+ lee 
ett 9¢5t _-6t|] |_9 ©3 
l 0 0 
=3 i} 3 | dt+c=] t | +e, 
3 t 
where c = (c1,C2,¢3)7 is an arbitrary constant column vector. Substituting this result into the solution form 
x(t) = X(t)u(t), we get the general solution 
git ce" + cp e2t 
x(t) =X(t)} t | +X@c=t] e | +] caret tcset 
t —2 c, e — cg — 63 


8.3.1 Equations with Constant Coefficients 


In this subsection, we will reconsider the driven vector differential equation (8.3.1) when the corresponding matrix 
has constant entries: 
y(t) =Ay(t) + f(t), t € |a, bl. (8.3.4) 


From Theorem 8.16, page 455, it follows 
Corollary 8.10: If A is a constant square matrix, then the initial value problem 


y=Ay(t)+f(t), — y(to) = yo 


has the unique solution: 


ie is 
yas vy, +f eA) t(7) dr = eA) yy + eM | e * f(r)idr, (8.3.5) 


to to 


The general solution of the nonhomogeneous vector equation (8.3.4) is the sum 


y(t) = yn(t) + yp(t), 


where y;,(t) = e“'c, with arbitrary constant vector c, is the general solution (which is usually referred to as the 
complementary function) of the corresponding homogeneous equation y = Ay, and y,(t) is a particular solution of 


the nonhomogeneous equation (8.3.4): 


t t 
Yp= 7 eAC—1) f(r) dr = a eA? £(r) dr, (8.3.6) 
to 


where tp is an initial point. We clarify the construction of the general solution with the following elaborative 
examples. 
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Example 8.3.3: Consider the following system of inhomogeneous constant coefficients differential equations y = 


Ay +f with 
1 1 et 
a-[2 4] om toe[S] 
Since the given matrix A has two distinct real eigenvalues A; = —3 and Ag = 2, we get the fundamental matrix 
(propagator) 


ot 1 [ 4e2 4+ 6-3 e2t — e—3t 
(t) —e = a F e2t — 4e-3t ¢2t 4. 4 e—3t 


Then we multiply e“—7) by f(r) to obtain 


4 ett T) a e7 3(t— T) e2(t—7T) _ e—3(t-T) eT 
4 el (t—T) — 4e-3t- 7) e2(t 7) +4e 3(t—T) 

A e2t— Te - e 3tt4ar Sis e2t—3T _ ebb ter 

ap 2t—-T _ e7 3t+4r aif e2t—3T ele se 32 


eA T) 


ot fe T 1 9} eft / oe e73T 1 ae —e2T 
e one Ee Tee e73T 7 ee 4e2t : 
Choosing the origin as the initial point when ¢ = 0 and using antiderivatives of the four functions 
t t 
af e dr=—e # +1, of e'dr=e*_-1, a=1,3, b=4,2, 
0 0 


we get 


Upon simplification, we obtain 


foronn-e [4] -4f))---Galtl-ab) 
ef] G-) 16-3) 


The last two terms can be dropped because they can be included in the complementary function e** ¢ with appro- 
priate constant vector ¢ = (c1,¢2)’. Substituting this integral into Eq. (8.3.6), we obtain a particular solution 


At 


Using Maxima, we can check the results: 

load(linearalgebra) $ 

A: matrix([1,1],[4,-2]); 

x0: matrix([a],[b]); /* arbitrary constants */ 
ev(matrixexp(A,t).x0); /* solution of the homogeneous equation */ 
f(t) := matrix([exp(t)], [exp(-t)]); 

integrate (matrixexp(A,t-s) .f(s),s,0,t); 


Example 8.3.4: Find a particular solution of the driven system 


11 -4 


Hy=avo +t, A=| 95 U9]. t= [SP]. 
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Solution. The general solution of the complementary system y = Ay is 


yn(t) = eA*e, 
where 
At _ t 1 + 10t —At a t 1 0 t 10 —4 _ C1 
— oe = 4 - igi | =" | o 1 | F 8" | os 19) 822 mle 


is a column vector of arbitrary constants. Therefore, the general solution of the homogeneous equation y = Ay 


becomes 
At. ot C1 t 10c, = 4c9 
yn(t) =e*"c=e a +te Bs 7 ite , 


We seek a particular solution y,(t) of the given inhomogeneous vector differential equation in the form 
_ Abie x] TRIO ~=ab ux(t) 
A aa | 25t 1—10t | | uo(t) |’ 
where u(t) = e~“' f(t). From Eq. (8.3.5), it follows that 


[ Aeraar 
er 
| 


dret-t fe + 10¢— 107) (14+ 2e7) —4(t-—r)(14 | 
Te |14.5¢(3 + 10e7 — 27) + 27 (57 — 7 — 25€7) 


_ [et (344+ 102?) — 3 — 4¢ 
~ jet (7+ 5¢ + 257) — 7— 11¢] ° 


To check our calculations, we differentiate y,(t) to obtain 


y(t) et 7+24t+10t7| [4 
YOM) = © N94 55 4 2542 ti? 


On the other hand, 


he | = ,[3+4t+ 1007] [3+ 4¢ 
Yp~lo5 —9| \© |7 + 5¢4 252 Aad 
— [5+4+24¢+ 10] , | 5 
~ 1124 55¢+ 2542/}° ~ 1244] 


sO 


2 t 
De roe ‘ | 5 eed 


12+ 55¢+ 2542/° ~ |12+¢ 14¢ | ye) 


Mathematica is helpful to verify the particular solution obtained: 


A = {{11, -4}, {25, -9}} 
f[t_] = {{1 + 2*E"t}, {1 + t}} 
Integrate [MatrixExp[A*(t - s)].f[s], {s, 0, t}] 


Example 8.3.5: Consider the nonhomogeneous equation 


dy(t) 


MO — ayo +O, A=[7 3]. #O=| cae |: 


cost 


Cute 


The characteristic polynomial y(A) = det(AI — A) has two complex nulls (eigenvalues of the matrix A) Ai. = +j. 
The fundamental matrix is 


gate cost + 2sint —d5sint 
— sint cost —2sint | 
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Therefore, the general solution y(t) is the sum of a particular solution, 


xf hee _f —5cos(r) sin(t — 7) 
yp(t) = | ent) f(r) dr = | a ee ers 7) — 2sin(t — =| 


a —5t sint 
~ 9 |t cost — 2t sint+sint] ’ 


and the general solution y;(t) of the associated homogeneous equation 


yi) =e ese | cost + 2sint | Sy | —5sint | . 


sint cost — 2sint 


where c = (ci, 2)? is a column vector of two arbitrary constants c, and cz. The following Maple commands were 
used to check the calculations. 


with(LinearAlgebra) : 

A := Matrix(2, 2, [2, -5, 1, -2]) 

funct := t-> <0, cos(t)> 

u := MatrixExponential(A, t-s).funct(s) 
simplify(map(int, u, s=0.. t)) 


1. In each of the exercises, (a) through (h), find the general solution of the nonhomogeneous planar system of equations 
x(t) = P(t) x(t) + f(t), given that X(t) is a fundamental 2 x 2 matrix for the complementary system. 


1/2 ¢ Pe 
@ POse| 5 ole 2s) 4 |» Roel a oy 
— tan 2t sec 2t sin 2t sint cost 
(by Pt -| sec 2t ek . 1 FE AY) = cost al 
l-e-** et et et cosht 
@ Pe=[" fe fox]. t=[4 |. xe-[o oh |, 
1 


1 1 2t 2t tcos2t —t sin2t 
ou =| oe 1 Ik beac KO=| (oo t cos 2t li 
1 [sec? 2t (2 + cos 4t — sec? 2t cos 4t 8 
(e) Pw=s |e, BE a , f= | 
2 3 csc” 2t cos 4t 2 esc 4t (2 + cos 4t) 24 


? 


sec 2t — sin 2t 
i a ; csc 2t cos 2t 


(g) PW) =|) ale gst] 7 x =| ele 
(h) 2O=2 11 0 be f=" a x=] > 7]. 


2. In each of the exercises, (a) through (f), find a particular solution of the nonhomogeneous system of equations x(t) = 
P(t)x(¢) + f, given that X(t) is a fundamental 3 x 3 matrix for the complementary system. 


(a) 
1 3 e* -1 et 0 et 2 et 
P=5 Qe’ 4 —2e'|, X(t)=!]0 ef e*!|, fale]; 
1 -e* 1 e 1 0 —e* 
(b) 
Qet_—et ett 4 4 00-%_4 eat 
1 
~ 2 sinht a u 
ee aet —8e' sinh? t e =e 
et et e2t 1 
KOH=/8 1 Dl, fete}; 
—t t —2t t 
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(c) 
6t? + 4 —8t* 8? 
ae ee ee 
3 t 2t t ? 
1+ 2¢ 0 0 0 
2t 60488 
x@=|1t 12 of, f= late ah 
0 1 1 
(d) 
1 1 2t O 1 cos 2t sin 2t 1 
VA oars —t 1 ti, X(t)=t]O —sin2t cos2t|], f=2t}1] ; 
QO -—2t 1 1 -—cos2t —sin2t 1 
(e) 
6—1/t e **(6t—1)-8 e** (1 — 6t) 
P(t) = 4 —6—1/t—4te~™ —4e7%# 
4 Ate ** — 8 2 1/t—4ter* 
Ber =e-2 2t 3 
X(t) = et e~ oO}, f=t]}2]; 
t ett — _p-2t 9. 2t 2 
(f) 
+-6 347 3 a a ig 
2 
P(t)=|-12-2 6+2 3], X(t)= |2¢2 @ -1], f=3]t 
12 -6 O 0 22 2 0 


3. In exercises (a) through (h), use the method of variation of parameters to find the general solution to the vector equation 


y = Ay +f with a constant 2 x 2 matrix A. 


(a) ie AF f= eae (e) ie | 
Oley aly 225]: Olas] 
(c) lag ab c ek (e) | = : | 
(8). ee fay, w [$3] 


4. In exercises (a) through (d), use the method of variation of parameters to find the general solution to the vector equation 


y(t) = Ay(t) + f(t) with a constant 3 x 3 matrix A. 


2 =A 


—2 —2 sinht —2 -2 

(a) |} 4 2 -2), f= |10cosht] ; (c) | -2 1 
-1 3 1 5 -4 -2 

2 6 -2 50 e! 3 -2 

(b) 6 2 -2|, f= ]21le*]; (d) i =i 
-1 6 1 9 —2 2 


4 0 

> rae erties as (aD 
6 et 

3 

2 cf. | Der? 
—2 0 


5. In exercises (a) through (d), use the method of variation of parameters to find a particular solution to the vector 
equation y(t) = Ay(t) + f(t) with a constant 2 x 2 matrix A that satisfies the initial conditions y(0) = (1,—1)”. 


[43] ita) o [fat] = [e) 
| ea wets | @ [2 


F 
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8.4 Method of Undetermined Coefficients 


In this section, we present the technique applicable to nonhomogeneous constant coefficient differential equations 
when the forcing functions are of a special form (compare with §4.7.2). Namely, we consider the inhomogeneous 
vector linear differential equation 


Yay) +t@), — t€ (ad), (8.4.1) 


with a constant square matrix A and the nonhomogeneous term f(t), also called the driving term or forcing term, 
to be of a special form: it is assumed that f(t) is a solution of some undriven constant coefficient system of 
differential equations. In other words, the forcing term f(t) in Eq. (8.1.7) is a linear combination (with constant 
vector coefficients) of products of polynomials, exponential functions, and sines and cosines because only in this 
case f(t) = Mf(t) for some constant square matrix M. The main idea of the method of undetermined coefficients is 
essentially the same as for a single linear differential equation when we make an intelligent guess about the general 
form of a particular solution. Namely, the method considers the following two cases: 


e The forcing vector function f(t) is not a solution of the complementary equation y(t) = A y(t). 
Then a particular solution has the same form as the driving function f(t). 


e The input term f(t) is a solution of the complementary equation y(t) = A y(t). Then a particular 
solution has the same form as the driven function f(t) multiplied by a polynomial in t of a degree equal to the 
multiplicity of the corresponding eigenvalue. 


To understand the method of undetermined coefficients better, we recommend the reader go over the following 
examples; however, before doing this, we must first make some observations. It is convenient to introduce the 
derivative operator D (where Dg = dg/dt = g for any differentiable function g) and rewrite the given vector equation 
(8.4.1) in operator form: 

(DI — A) y(t) = F(t), 
where I is the identity matrix. If we choose its solution as a power function y(t) = at? with some constant column 


vector a, then 
(DI — A) at? = apt?~! — Aat?. 


The right-hand side is always a polynomial in t of degree p unless Aa = 0, that is, a belongs to the kernel of the 
matrix A (consult §7.2.1). So, if the matrix in Eq. (8.4.1) is not singular (det A # 0) and y(t) is a polynomial in t, 
then (DI — A) y is a polynomial in ¢ of the same degree as the vector y(t). 


Example 8.4.1: Using the method of undetermined coefficients, solve the system of differential equations 
ve =xt+2yt+t, y =22e+y-t. 


Solution. This system can be rewritten in a vector form: 


. act dy(t) _ _ {1 2 = 1 
Since the eigenvalues of the matrix A are 41; = —1 and A2 = 3, and the control number (consult §4.7.2) of the 


column vector f is 0, we are looking for a particular solution as a linear function: 
yp(t) =a+tb, 


where a and b are some constant column vectors to be determined. Substitution into the given system of equations 
leads to 
y=b=Aa+tAb+fi(t). 


Equating coefficients of like powers of t, we obtain 
b=Aa and Ab+f/t=0. 


So 
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oes (t 3)ED-G) 


Therefore, a particular solution becomes 
-1 1 
Ei] [3] 


Now we solve the given vector equation using the variation of parameters method. First, we determine the exponential 
matrix using Sylvester’s method: 


and 


= A-83 1 
eAt =. t Z(-1) + ert Z(3); Z(—1) — —=3 — 3 


A particular solution of this system of differential equations is 
y(t) = eA*u(t), where u(t) = / eAtE(t) dt = 2] i te’ dt = 2] e(t —1), 


because Z3)f(t) = 0. Therefore, 
- 1])/1 -1 1 al |1 1 th es 1 
vn=u-n8 2, 3 [afeened LS] <0-0 [3] 
Example 8.4.2: We consider a problem of finding a particular solution to the nonhomogeneous vector equation 


y(t) =Ay(t)+f(t), A= | ; ; | , f(t)= 7| ae ' 


Solution. We break f(t) into the sum of a polynomial (having a control number o = 0) and the exponential 
function (with control number o = —2): 


= —5 0 7 _ 24 

F(t) = | ft }+7| 1 |f7| 0 on, 
Since the eigenvalues of the matrix A are \; = 5 and Ay = —2, the second eigenvalue matches the control number 
og = —2 of the exponential term. Therefore, we seek a particular solution y,(t) of the given nonhomogeneous vector 


differential equation in a similar form as the forcing function f(t) and add an additional term to incorporate this 
match: 
yp(t)=atbt+ce *+dte™, 


where a, b, c, and d are some vectors to be determined later. Differentiation yields 
Yp(t) =b—2ce-* + de * — 2dte™* = b+ (d— 2c) e~™* — 2dte~™. 

We substitute the assumed solution into the given inhomogeneous vector differential equation and get 
b + (d — 2c) e~* — 2dte~** = Aa+ Abt + Ace~* + Adte~*! + f(t). 


By collecting similar terms, we obtain the following algebraic equations for a, b, c, and d: 


—35 
b=Aa+t | bs (8.4.2) 
Abs | ols (8.4.3) 
als A, 
d— 2c = Ac+ | z | : (8.4.4) 


—2d = Ad. (8.4.5) 
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From Eq. (8.4.3), we find the vector b: 


eee ciara 


because 


Solving Eq. (8.4.2), we obtain 


2 eed _{[-385]\)_ 7 | 2-4} } 1} 7 | 388] _ |-266 
os (» E — "Fo [3 1) |-9} ~~ 70 |-12| ~ | 8.4 |” 
From Eq. (8.4.5), it follows that d is an eigenvector of the matrix A corresponding to the eigenvalue \ = —2. 
Therefore d = a(4,—3)", where a is any nonzero constant. From Eq. (8.4.4), we obtain 


ame-e-[8]-o[4]-[8] 


This system of algebraic equations can be rewritten in the form 
3 4 4a — 49 
[3 4]en[ 52]. 40) 


Since the matrix B= A+ 2I= | : : | is singular, the algebraic equation (8.4.6) has a nontrivial solution if and 


only if the right-hand side vector (4a — 49, —3a)" is orthogonal to solutions of the adjoint problem: 

B’z=0 or F [== (al: 
Hence, z = (21,21) = 21(1,—1)” is the general solution of B’z = 0 with arbitrary constant 2;, and the vector 
(4a — 49, -3a)? must be orthogonal to the vector (1,—1)7: 

(4a — 49, —3a) 1 (1, -1) => da — 49 + 3a = 0. 
The latter defines the value of a to be a = 7, which finally identifies the vectors d = 7(4, —3)7 = (28, —21)” and 
—7 —4 
=[o]+e[3] 

where cz is an arbitrary constant. Note that the vector c2(—4, 3)" is an eigenvector of the matrix A corresponding to 


the eigenvalue \ = —2. Therefore, this vector can be included in the general solution of the homogeneous equation 
y(t) = Ay(t). Finally, we get the general solution: 


_ [-26.6 eed = ee i oe ls =e 
vi =[aa +7] e+ [el ¢ +74] te +alife +c 3 |e 


1. In exercises (a) through (d), apply the technique of undetermined coefficients to find a particular solution of the 
nonhomogeneous system of equations y(t) = Ay/(t) + f(t), given the constant 2 x 2 matrix A and the forcing vector 
function f(t). 


1 5 10 sinht 10 -9 Je" 
(a) Fe 3, mie Be a ) F | t= > S| 


ow) [% a]. e=[rel: @) [Fy] t=[am]- 
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8.5 The Laplace Transformation 


Consider a nonhomogeneous system of ordinary differential equations with constant coefficients subject to the 
initial condition: 
y(t) = Ay(t)+f(t), — y(0) = yo, (8.5.1) 


where y = dy/dt is the derivative with respect to t, and f(t) is a given vector function. For some matrices A and 
forcing terms f(t), the system (8.5.1) may have a unique constant or periodic solution with the property that all 
solutions of the system approach this unique solution as t +> +00. This constant or periodic solution is called the 
steady state solution. We analyze when it happens using the Laplace transform. Note that the initial conditions 
are irrelevant in the long run because eventually any solution will look like a steady state trajectory. 

Recall that the Laplace transform of a function y(t) is denoted as 


eiy)(a) =") # | * eMy(t) dt. (8.5.2) 


Since the Laplace transform of the derivative is L[y] = Ay” — y(0), we reduce the given initial value problem (8.5.1) 
to the algebraic vector problem 


Ay’ —y(0)=Ay" +f" or (AT—A)y” =f" +y(0), 
where I is the identity square matrix. Solving the above system of algebraic equations, we get 
y” = (AI— A)! (y(0) + £") = yx +z, (8.5.3) 
where R)(A) = (AI— A)! is the resolvent to the matrix A, and 
yh =(AI—A)"y0), yp = (AL- A) £”. 
Application of the inverse Laplace transform to both sides of Eq. (8.5.3) yields 
y(t) = £7" [Ry(A)] y(0) + £7 [Ry (A)f”] = yn) + yp). (8.5.4) 


In accordance with Theorem 8.6, page 434, the vector function y;,(t) = C~'[R)(A)] y(0) is the solution of the 
corresponding initial value problem for the homogeneous equation 


Yn(t) = Aynlt), — yn(0) = yo, (8.5.5) 
and the function y,(t) is the solution of the driven equation subject to the homogeneous initial condition 
Yp(t) = Ayp(t)+£(t), — yp(0) = 9. (8.5.6) 
The formula (8.2.4), page 442, gives us the explicit solution of the IVP (8.5.5) : 
yn(t) = e“* yo, 


which is equal to £~1[R)(A)] y(0). Therefore, we establish the relation between the resolvent Ry(A) of a constant 
square matrix A and the exponential matrix: 


eAt=cCl(R\(A)} or = R\(A) = (AI- A) = £ [e"). (8.5.7) 
Finally, the solution (8.5.4) of the initial value problem (8.5.1) is the sum of two functions: 


yn(t) =£-*[Ry(A)] y(0) = e** y(0), 
yp(t) =£~* [Ry(A) f”]. 


These formulas allow us to establish the existence of a steady state solution. 
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Theorem 8.17: If all eigenvalues of the constant matrix A have negative real parts, and the smooth forcing 
vector function f(t) is periodic with period T, then the system (8.5.1) has a unique periodic steady state solution 
and T is its period. 


Example 8.5.1: Let us consider the initial value problem (8.5.1) with the following matrices: 


21 10 —-2 0 0 
A=] -22 -l1l 2 and f(t) = |sint], yo= |0 
110 9 11, 0 0 


Since the resolvent of the matrix A is 

1 d? + 22+ 103 10\ + 92 —2(\+1) 
R)(A) = ———_..—_ —22(\ + 1) \? —10A— 11 2(A + 1) , 
(A+1) (7 +97) | q19y41012 = 9A+ 911-4? - 10-11 
we get from Eq. (8.5.9) that 


_, [| 120A + 92, 42 — 10\ — 11,9 4+. 911)" 
1 ee ee ee 


y(t) =f [R(A) £”] =L +1) (2 +92) (02 +1) 


Careful evaluation of the inverse Laplace transforms of each component of y(t) = (yi(t), yo(t), y3(t))” leads to 


1 1 1 
yi(t) = 5 et + rT (51 sint — 41 cost) + ah (9 cos 9t — 11 sin 9¢) , 
1 
yo(t) = 730 (9 cost — 99 sint — 9 cos9t + 11 sin 9¢) , 
11 1 1 
y3(t) = oy e¢ + 30 (460 sint — 451 cos t) + 720 (99 cos 9t — 20 sin 9t) , 


Obviously, the term tet (1,0, 1)" dies out, leaving the steady state solution, which has two periods: 27 caused 
by the driving term f(t) and 27/9 inherited from the eigenvalues +9j of the matrix A. 


Example 8.5.2: When the Laplace transformation is applied to a system of linear differential equations, it does 


not matter whether the given system is in normal form or not. Let us consider the following initial value problem: 
z—y =4y — 3x +sin(t), 
See Og) =1, yO)=-8 

24 + y = 2a — 4y — cos(t), 


Application of the Laplace transform to both sides of the differential equations gives 
da’ —1— dy +8 = dy” — 32% + (142) "," 
a? og ban” A, 


where x” and y” are Laplace transforms of the unknown functions x(t) and y(t), respectively; here L {sin(t)] = 
(1+ 7)~! and £[cos(t)] = A(1+ )* are Laplace transforms of the driving terms. Solving this system of 
algebraic equations with respect to 2” and y”, we get 

, 4—A+3\ z 18+ 19\ + 15\? + 24)? 


“C4300 = "GE Ceayd 


Application of the inverse Laplace transform provides the answer: 


1 
x(t) = 5 tee — 2 cost —sint|] H(t), 
1384 4, , 56 a3 


22 31 
) = |—— = —~ = cost — — sint| H(t 
y(t) re + ere gg cost — ge sin (t), 


where H(t) is the Heaviside function, Eq. (5.1.5) on page 274. One can easily identify the transient part (exponential 
terms) and steady state part (trigonometric functions). 
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Example 8.5.3: (Discontinuous forcing input) Consider the following initial value problem with an intermittent 
forcing function: 


H)= Av) +H) +H), y@)=0, A= 7 29], 


where 
H(t) — H(t -1) ‘ 
fi(t) = 0 » Alt) = | sint [H(t) — H(t — 7)| | | 


Here H(t) is the Heaviside function, Eq. (5.1.5) on page 274. Application of the Laplace transform to the given 
initial value problem yields the algebraic equation 


(L— A)y” =ff +42, 


where y” is the Laplace transform (8.5.2) of the unknown function y(t), and 


l1—e> fb 1 0 
0 ’ 2 142 J—er**]" 


Calculations show that the resolvent Ry(A) = (AI— A)~' of the matrix A is 


= At a” ee A+2 —5 
R,(A) = L[e j@-=| eve dt= 347 i An? |* 


Therefore, 


y” = R)(A) (fy + £2) 
1 A+2 _ 1 —5 —\n 
sory PT] O-2) + aeeag la] O27: 


Let us introduce the auxiliary functions that are defined as the inverse Laplace transforms of the following expressions: 


g(t) =L7" Pees = (2+ sint — 2cost) H(t), 
m(t) = £7 |] = (1 00st) HE, 
gH] Cae =6§ (cost 1+ 5 sint) H(t), 


1 t 
no= cL eae = (2 cost + tsint —2+ 5 sint — 5 cost) H(t). 
Then we express the required solution through these functions: 


a -a-2))... InO~att—) 
y(t) = £~ [x] = ee =ase= i > Be 4 | : 


1. In each exercise (a) through (h), use the Laplace transform to solve the vector differential equation y(t) = Ay(t) +f 
subject to the initial condition y(0) = (1,1)7, where a constant 2 x 2 matrix A and the driving term f are specified. 


ie) 
| 
we 
ew 
| NT | 
ar) 
II 
eri! | 
ee 
ao 
oo 
om | 
+ 
es 
im Ge 
oo 
fa 
<— 
| rs: | 
Ve 
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wo ot 
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Mh 
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2. Use the method of Laplace transform to solve the given initial value problems. Here « and y denote derivatives with 


respect to ¢. 


et y—-Qy=e"*+e™, (0) = 1, 

(a) Qy — 3¢ — 4a + By =e —e**, y(0) =0 
(b) 34 — y + By = 9e** +1, x(0) =0, 
y+2¢—-3y-c=7t_-Ut; y(0) =3 
e+y—-—x+y=3sint+cost, x(0) = 0, 

(c) y — 2% —x — 2y=sint — 2 cost; y(0) =—-1. 
(a) &+2y —y = 12t - 3, x(0) = 0, 
Qy—&+y 4+ 6x = 6? +3; y(0) =1. 

34 + y — 6a = —6¢”, a(0)=1, #0) =2, 

fe) y+ 3% +y = —9e"*; y(0) =—3, y(0) =—12 
(t) & + 8y — 32 = 90, «(0)=—1, #(0) =2, 
y+ 3y — 3c =3 +90"; y(0)=—2, y(0) =0. 

&—3y +22 =-2, a(0)=—1, #(0) =0, 

(s) j+4—6y =0; y(0)=0, (0) =2. 
(h) €+y4+ 2x =5, x(0)=3, «(0)=-1, 
y+ 6t+y=t; y(0) =3, (0) = —3. 


3. In each exercise (a) through (h), use the Laplace transform to solve the vector differential equation y(t) = Ay(t)+f 
subject to the initial condition y(0) = (0,1,0)", where a constant 3 x 3 matrix A and the driving term f are specified. 


Bi Ds 4 27 et 
(a) [45 4 45], f= |34e™| ; 
4 2 8 76 
1 -9 0 32t 
(c) |-1 3 —3]/,f= | 16]; 
0 2 1 —16¢ 
3.2 4 22 
(e) |10 4 10], f=e' }21] ; 
4° 2 28 6 
1 —8 1 52 cost 
(g -l1 3 —3/, f=] 26 sint 
1 2 1 —52 sint 


) 


3 4 10 15t 
(b) |10 1 —10], f = |45¢] ; 
2 2 -5 0 
—-4 8 4 8e! 
(d) |3 1 3],f=] 0]; 
-1 1 -9 et 
3 4 -6 10 
(f) [6 5 -9|,f=e'] 8]; 
Dy A hi 3 
-4 8 4 117 
(h) | % 1 71),f=e°| 56 
-1 5 -9 17 


4. For each of the following matrices and given vector function f, solve the inhomogeneous system of equations of second 
order x(t) + Ax(t) = f(t) using Laplace transform. Initial conditions can be chosen homogeneous. 


(a) fe li 


(©) ik al 
© [Sa 
(8) ie on 


(b) K "| _ fs eal 
o Bee 
pense 
ee 
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8.6 Second Order Linear Systems 


For a constant n x n matrix A, let us consider a second order vector equation 


—,+Ax=0 or x+Ax =O, (8.6.1) 


where x(t) is a vector function with n components to be determined. Recall that all vectors are assumed to be 
column vectors. Any nonsingular matrix ®(t) that satisfies the matrix differential equation 

$+ A&=0 (8.6.2) 
is called the fundamental matrix for the vector equation (8.6.1). When a fundamental matrix is known, the 


general solution of the vector equation (8.6.1) can be expressed through this matrix. 


Theorem 8.18: If ®(¢) is a fundamental matrix for Eq. (8.6.1), then for any two constant column vectors 
a = (a1, @2,...,@n)? and b = (bj, b2,...,bn)", the n-vector u = ®(t)a + ®(t)b is the general solution of the 
vector equation (8.6.1). 


Fortunately, a fundamental matrix can be constructed explicitly when A is a constant matrix in the system 
(8.6.1). 


Theorem 8.19: Any solution of the vector equation (8.6.1) can be represented as 


sin (tvA) 


x(t) = cos («vA) x(0) + = x00) (8.6.3) 


where x(0) and x(0) are specified initial column vectors. 


Applying the Laplace transform to the initial value problem 
x+Ax=0O, x(0) = XQ, x(0) = vo, 


we obtain 


Mx” — x(0) —Ax(0)+ Ax?’ = 


0, 
where x“(A) is the Laplace transform of the unknown vector function x(t). The above algebraic system of equations 
is not hard to solve: 


x 1 
L : 
— ipa EA 
Application of the inverse Laplace transform yields Eq. (8.6.3) because 
sin (tvA) 7 Pa | 1 = 1 co} | il 1 
VA MI+A]  25/A M—jVA AL+jVAI’ 


r 1 1 1 
Wa) =£" | aA a) =$6" |S taal. B 
cos ( ) MI+A| 2 M—-jVA A+jVA 


Here we used factorization \7I+ A = (1 - jvA) (1 +jVA) , assuming that a root V/A exists. As usual, j denotes 


(0). 


the unit vector in the positive vertical direction in the complex plane, so j? = —1. The matrix equation (8.6.2) has 
two linearly independent fundamental matrices 
-1 

&(1)=(VA) sin(tVA), — &3(t) =cos(tVA), (8.6.4) 


each of which is a derivative of another: @2(t) = ®; and ®,(t) = —A®». Both matrices do not depend which root 
VA has been chosen and they are solutions of the second order matrix differential equation (8.6.2), subject to the 
initial conditions 

6,(0) =I, #,(0)=0 and ,(0)=0, &,(0)=I, 
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where I is the identity n x n matrix and 0 is the zero square matrix. Using these two fundamental matrices (8.6.4), 
we solve the initial value problem 


x+ Ax = 0, x(0) = Xo, x(0) = vo 


explicitly: 
x(t) = Bo(t) xo + Pi(t) vo. (8.6.5) 


Now we turn our attention to the initial value problem for a driven vector equation 
x+Ax =f, x(0) =xo, x(0) =vo, (8.6.6) 


where xg and vo are specified vectors. Its solution is the sum of two vector functions: x(t) = x;,(t) + x,(t), where 
x),(t) is the complementary function, which is the solution to the IVP x + Ax = 0, x(0) = xo, x(0) = vo. The 
second term, x,(t), is a solution of the IVP (8.6.6) with x9 = 0 and vo = O. Since the complementary function is 
given explicitly by Eq. (8.6.3), our main concern is a particular solution x,(t). 

If the forcing vector function f(t) is an intermittent one, the most appropriate method to determine x,(t) is the 
Laplace transformation. Using it, we get 


-1 l L 
x,(t)=L reat | ; (8.6.7) 
If the driving function is smooth, it is convenient to apply the variation of parameter method. Note that the 
Lagrange method works for an arbitrary input, but practical application of the variation of parameters method 
becomes cumbersome when the forcing function f(t) is piecewise continuous. 
According to the variation of parameters method, we seek a particular solution of the nonhomogeneous equation 
in the form 
Xp(t) = ®, (t) uy (t) + ®,(t) Up2(t), (8.6.8) 


where the fundamental matrix functions ®)(¢) and ®2(t) are given in Eq. (8.6.4), and vector functions u;(t) and 
uz(t) are to be determined. Next we proceed in a way similar to the scalar case (see §4.8). This leads to a system 
of algebraic equations for first derivatives of u,(¢) and ug(t): 


Its solution is 
ti (t) = Bo(t) F(t), a(t) = — i (t) f(t) 
because their Wronskian is ®,(t)®»(t) — &,(t)#2(t) = 1. Integrating, we get 
sin (tVA) 


u(t) = ‘| cos (1vA) f(t)dt, u(t) =— / qe fat, (8.6.9) 


Substituting these formulas into Eq. (8.6.8), we obtain 
¢ sin (( — 7)VA) 
X,(t) = 

(= | Te 


As an example, consider the mechanical system of two bodies and three springs discussed in §6.1.2. It was shown 
(see page 345) that this system is modeled by the vector differential equation: 


f(r) dr = = [ sin (G = r)VA) f(r) dr. (8.6.10) 


dx} 
1 Ge (ki + kg)ai — koro = 0, 
drs 
Mz a + (ko + kg)t2 — kot = 0, 
dt 
where ki, kg, and ks are spring constants, and x1, x2 are displacements (from their equilibrium positions) of two 


bodies with masses m, and mz, respectively. 
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The given system of differential equations can be written either as the second order vector equation 


d?x kitke _ ko Q1 
Pres +Ax=0, with A= a coe la = | ne , (8.6.11) 
m2 m2 
or as the first order vector equation 

0 0 1 0 Yi 

dy : 0 0 0 1 Yo 

a 7 BY: with B= bt ag of Y= ly] > (8.6.12) 
Ae mths 9 0 yA 


where y| = X1, y2 = X2 are displacements of two bodies, and y3 = 41, y4 = £2 are their velocities. In this particular 
physical problem, the obtained matrix A is positive, which means that all its eigenvalues are positive. For a positive 
matrix, the square roots of its eigenvalues are called (circular) natural frequencies. Let 1, Az be two distinct 
positive eigenvalues of the matrix A given in Eq. (8.6.11), then the 4 x 4 matrix B has four distinct pure imaginary 
eigenvalues: +jw, and +jwo, where w; = V/A; and we = Wrz are natural frequencies, and the solution of Eq. (8.6.12) 
becomes 


y(t) =e” yo, (8.6.13) 
where 4-column vector yo is comprised from the initial displacement xp and the initial velocity vo: 


yo = (X0»V0)- 
Example 8.6.1: (A Multiple Spring-Mass System) Suppose that the spring constants for this system have 
the following values: k, = 8, kg = 2, kz = 8, and the masses are m = 2/3, mg = 1. Then the system of equations 
(8.6.11) becomes 


d?x 15 =3 a(t) 
qe +Ax=0, where A= | 2 10 | , x(t) = ee : (8.6.14) 
The matrix of this system is positive because its eigenvalues are \; = 16 and Ay = 9. The general solution 


is expressed through the formula (8.6.3), where x(0) = x9 = (a1,a@2)" is the initial displacement at t = 0 and 
x(0) = vo = (bi, b2)” is the initial velocity. Using the Sylvester auxiliary matrices 


| 6 | 


teen 6 | 2M aeeg Gla 


A-16 1/1 3 
9-16 7 


we construct two fundamental matrices: 


sin (¢ VA) sin (3t) sin (4t) 
®)(t) = er a Z9) + — Zv16)> 


®,(t) =cos(t VA) = cos(3t Z 9) + cos (4t) Z6). 
(9) (16) 


According Eq. (8.6.5), we express the solution of the initial value problem for the vector equation (8.6.14) explicitly: 


1/1 3 sin 3t 1/6 -3 sin 4t 
x()=2)) | (x0 cos3¥ + w 3 )+|8 1 | (x0 cosat + v Z i 


The given mechanical system has two natural frequencies w; = 3 and w2 = 4. Therefore, the physical system is 
comprised of two natural modes of oscillations. For instance, if the initial displacement is x(0) = xo = (7,0)7 and 
the initial velocity is x(0) = vo = (0, 28)7, the vector x(t) becomes 


x(t) = A cos 3t + | sin 3t + y cos 4t + | | sin 4t 


_ | V17 cos (3¢t — a) + 875 sin (4t — 8), 
| 2V17 cos (3t — a) — V5 sin (4t — 8), 
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where a = arccos (1/1/17) © 1.32582 and 8 = arccos (1/5) + 1.10715. If we transfer the given problem (8.6.14) to 
the first order vector equation (8.6.12), its solution is expressed through the exponential (4 x 4)-matrix B: 


1 3 0 0 6 -3 0 0 
Br cosst |2 6 0 0 cos4t |—-2 1 0 O 
F100 LB 7 0 0 6 -8 
0 0 2 6 0 O -2 1 
0 0 1/3 1 0 0 3/2 —3/4 
sin 3t | 0 0 2/3 2 sin4t | 0 0 -1/2 1/4 
7 |-3 -3 O 0 7 |-24 -12 0 0 
-6 -18 0 0 -8 —4 0 0 


The next application of eP* to the initial vector yo = (7,0,0, 28)" yields the same solution. 


Example 8.6.2: (Hypocycloid) Consider a plane curve generated by the trace of a fixed point on a small circle 
of radius r = 1 that rolls within a larger circle of radius R = 5. Such a curve is called hypocycloid. The coordinates 
of this curve are expressed as 


u(t) = 4 cos(t) + cos(4t), 
y(t) = 4 sin(t) — sin(4t), 


where t¢ is the angle of inclination of the center of the small circle. Indeed, an equation of a circle of radius r in polar 
coordinates is 
Z=2X+r cos#, y=yotr sind, 


where (20, yo) is its center. Since the small circle has radius r = 1 and it is rolled inside the large circle, its center 
is moved along the circumference of radius 4. Therefore, zo = 4 cos(t) and yo = 4 sin(t). Now we relate the angle 0 
with inclination ¢ of the line that connects the center of the small circle with the origin. A pivot point on the small 
circle starts at point S and moves along the hypocycloid curve by rotating in a negative direction with respect to 
its own center (Zo, yo). In polar coordinates, the length of any arc is proportional to the angle it ascribes. Since the 
center of the small circle moves in a positive direction with respect to the origin, and the small circle moves in the 
opposite direction, we get 6 = —(5t— t) = —4t. 

Now we show that the coordinates (2, y) of the hypocycloid are solutions of the following initial value problem: 


z— 3y+4e=0, y+3ar+4y =0, x(0)=5, y(0) = y(0) = (0) = 0. 
Upon application of the Laplace transform, we get 


Ma’ —5rA—3Ay" +42” =0, My” + 3Aac” — 154+ 4y” =0. 
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Solving this system of algebraic equations with respect to z’ and y’ and applying the inverse Laplace transform, 
we obtain the coordinates of the hypocycloid: 


60 
At +17A\2 + 16 


5A (A + 13) 


x(t) = co} soo =4cost+ cos(4t), y(t) = Lot | | =4sint — sin(4¢). 


Example 8.6.3: (Forced Oscillations) Consider a spring-mass system that consists of two masses m, and ma, 
connected by two springs to a wall, and there is an external periodic force, f(t) = sin(wt), acting on the mass, m, 
(see Fig. 8.14 on text page). 


Figure 8.14: Spring-mass system with external force. 


kay kg (a2 — #1) + f(t) kg (a2 — 21) 


Tm |—— 
Figure 8.15: The free-body diagram. 


Using Newton’s second law and referring to Fig. 8.15, we derive the system of linear second order equations that 
simulate the motion of the given spring-mass system: 


My L= kx, t ko (x9 X1) { f(t), Mot = —ko (x9 = x1). 


In the system, assume that k, = 8, kg = 4, m; = 1, and w = 2. However, we are going to choose the value of the 
second mass mz in such a way that in the resulting steady periodic motion the mass m, will oscillate due to its own 
natural frequencies. If we denote the ratio kz/m2 = 4/mg by a, the given system can be written in vector form: 


a fx 4 12 —4] |x] _ |sinwt 

dt? |y —a allyl — 0 |}? 
where the initial conditions can be chosen homogeneous. Application of the Laplace transform reduces the given 
vector differential equation to the algebraic system of equations: 

2.1 L L_ - 2D i Be 
Nay + 12x27 — dy =r et My” — ax + ay” = 0, 

where x’ and y” are Laplace transforms of unknown functions x(t) and y(t), respectively. The above system of 
algebraic equations can be solved without a problem, which gives 


LOW a+? LW a 
240? Bat(atlaete!’? 4 ~ 2402 8at(atl2 etry 
To find the inverse Laplace transforms of 2(\) and y”(A), we need to determine the points where their common 
denominator is zero. This denominator is a product of two terms, \? + w?, which is due to the oscillating external 
force f(t) = sinwt, and 8a + (a + 12)A? + \+, which defines natural frequencies of oscillations of the mechanical 
system. Taking the inverse Laplace transform, we obtain 


v(t) =au(t)+2n(t), — y(t) = y(t) + yn (2), 
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where %,,(t) and y(t) are the steady periodic terms caused by the external force f(t) = sinwt: 


2 s t) 
_(t) <2 5 i (w a) sin(w 
u(t) eae Ne w (8a — (12 + a)w? + w4)’ 
ae 
ae - thes sin(w 
Yo(t) = Pig Are w (8a — (12 + a)w? + w4)’ 


and the terms x#,(t) and y,(t) represent natural oscillations because they are sums of residues over all four pure 
imaginary roots +jw, (k = 1,2) of the equation 8a + (a + 12)\? + A* = 0: 


2 2 
Silt) = 2KS > Res x” (A) e**, yn(t) = 2R5— Res y”(A) e™. 
k=1 k=1 


= Jee — jwr 


When a = w? (so mz = 1), the first mass will not contain the oscillating part with frequency w and it will oscillate 
due to its own natural frequencies only, which are approximately 1.53073 and 3.69552; therefore, in this mechanical 
system, mass mz will neutralize the effect of the periodic force on the first mass. 

This example can be used to model the effect of an earthquake on a multi-story building. Typically, the structural 
elements in large buildings are made of steel, a highly elastic material. We suppose that the k-th floor of a building 
has mass m, and that successive floors are connected by an elastic connector whose effect resembles that of a spring. 
This example also has an electrical analogy that cable companies use to prevent unsubscribed users from seeing some 
TV channels. 


1. Consider a molecule of carbon dioxide (COz2) that consists of three atoms. Numbering the atoms from 1 to 3, starting 
from the left, we have three position coordinates 71, x2, and 73. We also assume that the atoms in the molecule act as 
three particles that are connected by springs. The atoms are in the positions, with equilibrium distances between them 
denoted as di2 and d23, called the equilibrium bond lengths. For small displacements, the force is proportional to the 
amount that the distances between the atoms differ from the equilibrium distance. Show that the equations of motion 
for the three atoms of masses m1, m2, and m3 are 

mii1 = ki2 (a2 — 41 — diz) 


Mote = ke3 (x3 — G2 — do3) — ki2 (w2 — 41 — diz), 


m3i3 = —k23 ("3 — V2 — daa). 


Show that this system of equations can be reduced to a homogeneous system 


fe Kaa 44) 

t=, wows 
kos k 

ie = (ys — ya) — (ve — mn), 
k23 

¥3 = ——— (ys — ya), 
m3 


where y1 = %1, yo = X2 — diz, and y3 = x3 — diz — deg. Rewrite this system in the vector form y(t) + Ay(t) = 0, 
where y(t) = (y1,y2,y3)". In reality, the numerical values of coefficients of the matrix A are always truncated rational 
numbers. For this particular example, choose m1 = 3, m2 = 15, m3 = 45/2, ki2 = 20 , ko3 = 45, di2 = 3, and d23 = 1. 


Find the general solution of the corresponding vector equation y(t) + Ay(t) = 0. 


9) SSS SSG 


A molecule of carbon dioxide 
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2. Construct two fundamental matrices for the given system of differential equations of the second order x + Ax = 0, 
where the square matrix A is specified. 


50 14 37 «1 8 4 14 10 
e) fe A py fe a (c) k | i?) Fe | 
50 14 50 «61 60 11 67 3 
(e) E ra (f) i ra ; (8) Ee = my) Ke | 
3. In each exercise, (a) through (d), solve the initial value problem. 
(a) £-2y=0, y+8r4+17y=0, 2(0)=0, «(0)=4 y(0)=15, ¥(0) = —32. 
(b) #+11l¢a4+10y=0, y+14%+15y=0, 2(0)=-1, 4(0)=0 y(0) =1, y(0) =6. 
(c) ¢+6x-—3y=0, y—5r+4y=0, 2x(0) =3, £(0)= 24, y(0) =5, y(0) =0. 
(d) #+15¢—-—2ly=0, y—14%+22y=0, 2(0)=5, (0) =3, y(0) =0, 9¥(0) =2. 
4. Consider a mechanical system of two masses m1 and m2 connected by three elastic springs with spring constants k1, ka, 


and k3, respectively (see Fig. 6.4, page 344). Find the natural frequencies of the mass-and-spring system and determine 
the general solution of the corresponding system of differential equations 


m,&+ kia — ke(y—2x) =0, m2y + kay + ke(y — 2) =0. 


(a) m1 =5, m2 =1, ki = 230, ko = 50, kg = 11. 
(b) my = 13, m2 =11, ky = 13, ko = 143, kg = 11. 
(c) my = 25, m2 = 23, ki = 25, ko = 575, kg = 23. 
(d) mi = 9, m2 = 20, ki = 306, ko = 180, k3 = 740. 
(e) mi = 31, m2 = 32, ki = 31, ke = 992, kg = 32. 
(f) M1 1, m2 i ky Le i) AO, kg i, 
(g) mi = 21, mz = 19, ky = 21, ko = 798, k3 = 19. 
(h) my 1, m2 1, ky 6, ko 2. kz 3. 
5. For each of the following matrices and given vector function f, solve the inhomogeneous system of equations x+ Ax = f. 
Initial conditions can be chosen homogeneous. 


oe) cof m fg). cof) 
© [2 a). so- feet @ [Eco F 
o Ea) co-f) © [af eo et] 


6. The Fermi-Pasta-Ulam (FPU) problem bears the name of the three scientists who were looking for a theoretical physics 
problem suitable for an investigation with one of the very first computers. The original idea, proposed by Enrico Fermi, 
was to simulate the one-dimensional analogue of atoms in a crystal: a long chain of particles linked by springs that 
obey Hooke’s law (a linear interaction), but with a weak nonlinear correction 


tin = (Un+1 + Un-1 — 2Un) +a [(un+1 — un)? — (Un — Un—1)°| ‘ (8.6.15) 


where a is a parameter. Solve the linear three-dimensional version of the FPU-system assuming that a = 0. 


7. Prove Theorem 8.18, page 469. 


Summary for Chapter 8 


1. The system of linear differential equations in normal form 


dx(t) 


ae P(t) x(t) + f(t) or x(t) = P(t) x(t) + f(t), (8.1.1) 
where P(t) is n x n matrix, is called a nonhomogeneous system of equations. The homogeneous system 
x(t) = P(t) x(t) (8.1.2) 


is called the complementary system for Eq. (8.1.1), and its solution is referred to as a complementary vector-function 
(which depends on n arbitrary constants). The column vector f(t) is usually called the nonhomogeneous or driving 
term or the forcing or input function. 
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10. 


Summary for Chapter 8 


Let xi(t), x2(t), ..., xXx(t) be a set of solution vectors of the homogeneous system (8.1.2) on an interval |a, b|. Their 
linear combination 

x(t) = c1 x1 (t) + co Xo(t) +--+ + en xx (8), 
where cj, i= 1,2,...,k, are arbitrary constants, is also a solution to Eq. (8.1.2) on the same interval. 


Let P(t) be an n x n matrix that is continuous on an open interval (a,b). Any set of n solutions x1(t), x2(t), ..., Xn(#) 
to the equation x(t) = P(t) x(t) that is linearly independent on the interval (a,b) is called a fundamental set of 
solutions (or fundamental solution set). The n x n matrix 


where its column vectors are solution vectors of the vector equation x(t) = P(t) x(t), is called a fundamental matrix 
if det X(t) £0. 


Let P(t) be an nxn matrix with entries p,;;(¢) that are continuous functions on some interval. Let xz(t), k = 1,2,...,n, 
be n column solutions to the vector differential equation x = P(t)x. Then for an n xn matrix formed from these column 
solutions 

X(t) = [xi(t), xe(t), ..., Xn(t)], 


we define its Wronskian as the determinant 


W(t) = det X(t) = C exp (/ trP(t) at), trP(t) = pir + p22 +++: + pan, 


where C’' is some constant. Therefore det X(t) is either never zero, if C # 0, or else identically zero, if C = 0. The above 
formula is named after Abel. 


Let X(t) be a fundamental matrix for the homogeneous linear system x = P(t)x. Then the unique solution of the 
initial value problem 


x = P(t) x(t), x(to) = Xo (8.1.8) 
is explicitly expressed through the propagator 
x(t) = ®(t, to) xo = X(t)X7" (to) xo. (8.1.9) 
Let A be an n X n matrix with constant entries and let y,(t) be the k-th column of the exponential matrix e“!. Then 
the vector functions yi(t), yo(t), ..., yn(t) are linearly independent. 
Let X(t) = [yi(¢), ye(t), ..., yn(t)] be a fundamental solution set for the vector differential equation y(t) = A y(t) 


with constant square matrix A. Then 
eA(—to) — K(t)K 7} (to). 


For any constant n x n matrix A, the vector 


y(t) =e c=cryi+coy2t+:::+enyn 


is the general solution of the linear vector differential equation y = A y(t). Here yx(t) is the k-th column of the 
exponential matrix ®(t) = e’ and c = (c1,c2,...,Cn)? is a constant column vector with entries cy, k = 1,2,...,n. 
Moreover, the column vector 


satisfies the initial condition y(0) = xo. 


If all solutions of the homogeneous linear vector differential equations with constant coefficients 


y(t) = Ay(t), (8.2.1) 


where A is an n xX n constant matrix, that starts in a small neighborhood of the origin approach zero when t is large, 
we call the origin the attractor or sink. If opposite, all solutions leave the origin, we call it a repeller or source and 
refer to the origin as unstable. 


A critical point x* of an autonomous vector equation x = f(x) is stable if all solutions that start sufficiently close to 
x* remain close to it. A critical point is said to be asymptotically stable if all solutions originated in a neighborhood 
of x* approach it. There are four kinds of stability: 


e A center is stable, but not asymptotically stable. 


e A sink is asymptotically stable. 
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e A source is unstable and all trajectories recede from the critical point. 


e A saddle point is unstable, although some trajectories are drawn to the critical point and other trajectories recede. 


11. There are several types of critical points: 


e Proper node (stable or unstable) 

e Improper node (stable or unstable) 
e Spiral (stable or unstable) 

e Center (always stable) 


e Saddle point (always unstable) 


12. A particular solution of the nonhomogeneous linear vector equation with variable coefficients 
y(t) =P(t)y +£(0), (8.3.1) 


can be obtained explicitly 


yt) =X) f XA Har= x [ X7!(r) £(r) dr, 


provided that the fundamental matrix X(t) is known. 


13. If A is a constant square matrix, a particular solution to the vector equation y(t) = A y(t) + f(¢) is 


t t 
woth = / eA) f(r) dr = et / ot ear 


to to 


14. The method of undetermined coefficients is a special technique used to find a particular solution of the driven vector 
equation x’(t) = Ax+f(t) with constant matrix A when the driving term f(t) is a linear combination (with constant 
vector coefficients) of products of polynomials, exponential functions, and sines and cosines. 


15. The resolvent of a constant square matrix A is the Laplace transform of the exponential matrix: R (A) = (AI— A)7* 
L le. 


16. The second order constant coefficient vector differential equation x + Ax = 0 has two fundamental matrices: 
AW? sin (ta-"”?) and cos (ta-¥?). Their definitions do not depend on the chosen root. 


Review Questions for Chapter 8 


Section 8.1 


1. In each exercise, verify that two given matrix functions, X(t) and Y(t), are fundamental matrices for the given vector 
differential equation x = P(t)x, with specified square matrix P(t). Find a constant nonsingular matrix C such that 


X(t) = Y(t)C. 
14+¢' -t1 t e 1[3e'—4t 2t—e' 
(a) P=| — S|. x=([2 wee =% Bet — 442-242 — et | 3 
1 ¢ -1 et -1 l1+e'/6 
r= 4 ce) a | ; == be —1/t |? 


1 [#72 +¢ 1 -t t-1 1—3¢t 5t/2—-1 
P=3|' } rm x= |p t ie va t — 3t?/2]° 


2. Prove that if x(t) = u(t) + jv(t) is a complex-valued solution of the vector equation x’(t) = P(t) x(t) (det P 4 0,) 
for a given real-valued square matrix P(t) with continuous coefficients, then its real part u(t) = Rex = Rx(t) and its 
imaginary part v(t) = Imx = Sx(t) are real-valued solutions to this equation. 


3. Show that X(t) is a fundamental matrix for the linear vector system x = P(t) x if and only if X~1(t) is a fundamental 
matrix for the system x7 = —x7 P(t). 


4. Suppose that x(t) is a solution of the n x n system x = P(t) x on some interval |a, b|, and that the n x n matrix A(t) 
is not singular (det A 4 0) and differentiable on |a,b|. Find a matrix B such that the function y = Ax is a solution of 
y = By on |a, d|. 
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5. Rewrite the second order Euler equation at? + bt y + cy = 0 in a vector form tx = Ax, with a constant matrix A. 


6. In exercises (a) through (h), find a general solution to the given Euler vector equation (8.1.14) for t > 0 with the given 
2x 2 matrix A. 


2 -1 1 -1 1 3 5. 3 
@ [Fool O) | 4) © [i 3) @) [> 4): 
3.2 1. <2 -1 7 1 6 
© ft 3): © (a) @ fy jp a) |p §): 
7. Let yi(t), ..., Yn(t) be solutions of the homogeneous equation corresponding to (8.1.10), and x1 = (yi, y4,--- ye VE F 
nay X= (Ge Vas oe ye Dye be solutions of the equivalent vector equation x = P(t) x, where the square matrix P(t) 
is given in Eq. (8.1.13). Show that the Wronskian of the set of functions {y1,..., yn} and the Wronskian of the column 
vectors X1,..., Xn are the same. 


Section 8.2 of Chapter 8 (Review) 


1. In each exercise (a) through (d), verify that the matrix function written in vector form Y(t) = [e*!'vi, e*?'va] is the 
fundamental matrix for the given vector differential equation y = Ay, with specified constant diagonalizable matrix A. 
Then find a constant nonsingular matrix C such that Y(t) = e@'C. 


—_ {-3 —15 _ { |cos6t — 8 sin 6t 3 cos 6t¢ + 2 sin 6t 
(a) a=|; 3 |. x= (5 eeaist sin 6t |): 


mab vol (AD) 
a“ woe (Gh fl) 


ipl, =8 aot 4 cos 2t 2 cos 2t + 2 sin 2t 
ts ; | ; a (es 2t + cos ‘| : sin 2t i 


2. Compute the propagator matrix e“! for each system y(t) = A y(t) given in problems (a) through (d). 


(a) t=2—3y, y = 4x — 12y; (c) =a2+y, y = —4z + by; 
(b) =a —- 2y, y = 2x — 3y; (d) & = 3x — 2y, y = 5a — 3y. 


3. In each of exercises (a) through (h): 


e Find the general solution of the given system of homogeneous equations and describe the behavior of its solutions 
as t + +00. 


e Draw a direction field and plot a few trajectories of the planar system. 


@ le al ) [75 3): eb ok @) [} =); 
© [t 7]: © |? 3]: Cara a) [} 22). 


4. Find the general solution of the homogeneous system of differential equations x = Ax for the given square matrix A 
and determine the stability or instability of the origin. 


@ |i 2); ) [oe 7]; ele 6 |e @ ioe 
@ |S Ta): Oe @ |p =) [7 ae 


5. In each of exercises (a) — (f), solve the initial value problem y = Ay, y(0) = yo for the given 3 x 3 matrix A and 3 x 1 
initial vector yo. 


2 1 0 1 14 66 —42 1 
(a) Jo 2 0], y(o)= 2]; (b) [4 24 -14], y(o) = |-1]; 
0 0 1 3 10 55 —33 —1 
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0 1 O 2, —2 0 3 1 
(c) [4 3 -4], yo)=] 2]; (a) |o 4 of, yo=| 2]; 
1 2 -1 —1 —6 0 7 —1 
306«(COO 2 1 1 2 O 2 
(e) |O 3 —-2), y(0)=]1]; (f) | 0 1 Of, y(0)=]-1 
2 —2 1 1 -3 3 °5 2 


6. Find the values of the real constant a for which x = 0 is the steady state of x = Ax. 
1 -a 2 -a —8 13 
Ok Oh el ES Th 


7. A matrix A is said to be nilpotent if there exists some positive integer p such that A? = 0. Show that the following 
2 x 2 matrices are nilpotent and plot the phase portraits for the corresponding vector equation y(t) = A y(t). 


@ a=[) 2): @a=[5 |) ja-[i oo: qa) a=[5 oj]: 
als a] a(t of wale a] mane 


8. Show that the matrix A from Example 8.2.14 is not nilpotent. Evaluate the trace and determinant of this matrix. 


9. Find the general solution of the homogeneous system of differential equations y = Ay for the given square matrix A 
and determine the stability or instability of the origin. 


> tk 2 t 2 2 —5 5 2 
(a) |-1 0 -9|; (by |=t =2 Oz (c) |-10 5 4]; 
0 0 1 i © 4 —20 10 9 
101 =i 12: 2 5 1 
(ad) |1 0 1], ik & -G: Sis Gi ia f ls 
011 —5 4 8 -4 -§ -3 
2 71 14 2 i =o 
(g) |-1 4 1]; (h) |2 1 4/; (i) {1 3 2I; 
Ss i 3 2 6 2 1 4 
4-3 1 5 42 8 a ae 
) j1 2 3]; (k) }1 2 3]; (i) |3 1 2 
ae i 2-8 4 2 3 


10. Draw a phase portrait for each of the following linear systems of linear differential equations y(t) = A y(t), where 


1 1 1 1 2 - 
@) a=[) jf: w) a=[4 4): © a=|i o]. 
Then solve these systems of equations. Is there a line y = ka that separates solutions into different categories of 


behavior? 


Section 8.3 of Chapter 8 (Review) 


1. In exercises (a) through (d), use the method of variation of parameters to find a particular solution to the vector 
equation y(t) = Ay(t) + f(t) with a constant 3 x 3 matrix A that satisfies the initial conditions y(0) = (1, 2,3)”. 


3-3 1 0 2 1 -1 5 sint 
(a) |O 2 2], f= |29e*!; (b) ]O 1 1], £= ]-10 cost] ; 
5 1 1 39e" Qe a 2 
—3. 2. 5 sin 2t —3 1 =8 a 
(c) | 1 —-5 -3), f= |5 cos2t| ; (dy [4 +2 2), fe /4 
-3 7 8 23 e* 4 -2 8 4 
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2. In exercises (a) through (f), use the method of variation of parameters to find a particular solution to the vector equation 
y(t) = Ay(t) + f(t) with a constant matrix A. 


-16 9 4 3e! -1 -4 0 12 e%# 
(a) |-14 7 4], f= |4e-*]; (b) |-1 1 -1], f=] e |; 
—38 18 Il 4e* —-2 4 -3 det 
—2 16 4 53 cos 2t -3 8 -5 3e" 
(c) |-1 6 1], f= 13 : (d) |3 -1 3], f= | 6e*]; 
6 1 2 5 8 -8 10 9e? 
2 -1 4 2 sint 4 -4 -4 5 sin 2t 
(e 1 -1 1), f= ]2 cost]; (f) 6 —-8 -12], f= ]|5 cos2t 
-1 1 -1 1 -4 6 9 1 


3. In exercises (a) through (f), find a particular solution to the Euler vector equation ty(t) = A y(t) + f(t). Before using 
the method of variation of parameters, find a complementary solution of the homogeneous equation t y(t) = Ay(t). 


of J. e-e[3} ofS) eof 
of gh el) off Gi) 
off). emf of 2) [a 


4. In exercises (a) through (d), find the solution to the Euler vector equation ty(t) = Ay(t) + f(t). Before using the 
method of variation of parameters, find a complementary solution of the homogeneous equation t y(t) = Ay/(t). 


1 5 O 4 —2 3 1 12¢° 
(a) |} 1 0 1], f=(]4/; (c) |-8 13 5],f=]| 9# |; 
i =? 2 4 11 -17 -6 6 
1 10 O 9 3. -8 —10 Ot 
(b) | 1 O 1], f=]¢]; (d) | -2 7 9 |, f= | 2¢? 
I =2 2 1? 2 -6 -8 30¢3 


Section 8.4 of Chapter 8 (Review) 


1. In exercises (a) through (f), use the method of undetermined coefficients to determine only the form of a particular for 
the system x(t) = Ax(t) + f(¢) with a given constant 3 x 3 matrix A, and specified vector f. 


1 12 -8 8 1 12 -8 78 
(a) |-1 9 -4|,f=e*|8]; (b) |-1 9 —4], f =e” |43]; 
—2 9 -2 1 —7 6 -7 77 
1 -1 -2 24 e! —20 ll 13 29 
(c 1 -2 -—3],f=] 4e |; (d) | 16 O —-8], f=e' |96/; 
-4 1 -1 4e —48 21 31 42 
3-3 -4 12 -3 8 -5 1 
(e 9 —§ +12), f>e'] 4/5 (f) |3 -1 3], £=3e%}1 
-4 6 9 =3 8 -8 10 1 


2. Using matrix algebra technique and the method of undetermined coefficients, solve the nonhomogeneous vector equations 
subject to the initial condition y(0) = (1,0)7. 


.  |8 13 8sint+ 12cost] | .  |t —5 St — 7] . 
a =(; ¥ly- 3sint — cost F (4) y=} elyt [tae] 
(b) y= [2 Z]y— [0-24]. (e) y= [8 ]y— [2-24]. 

~la 12)/%~ | 447]? Y= \9 8) |r+a5¢]? 

. [7 —-4 st [2] . ._[-7 4 64e7** 
(c) =e s]yte RE (f) pals s)y + |e 


3. In exercises (a) through (d), apply the technique of undetermined coefficients to find a particular solution of the 


nonhomogeneous system of equations y(t) = Ay(t) + f(t), given the constant 2 x 2 matrix A and the forcing vector 
function f(t). 
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(a) 3 er as soe] (<) kK =i ie ale 
(b) E ile = [ia e-a| (a) s =I; pe Be 


4. Apply the technique of undetermined coefficients to find a particular solution of the nonhomogeneous system of equations 
y(t) = Ay(t) + f(t), given the constant 3 x 3 matrix A and the forcing vector function f(t). 


| i —16 69 A 3 
(a) |4 2 -2],f=e'|] 2 |; (d) |-14 7 4],f=e'|] 2]; 
8 4 O =e al —38 18 11 —2 
1 4 40 4 —19 12 84 3 
(6) (2. =? 2), foe }2] 4 (e) 2 5 2], f=e' ]1]; 
y 4.3 4 -8 4A 33 
1 4 #1 1 10 —8 2 —16 
(c) 3 -6 -1], f=37e" JO] ; (fy) |-2 1 2), f=] 2 
a 1 2 2 10 —8 


Section 8.5 of Chapter 8 (Review) 


1. Solve the initial value problems with intermittent driving terms, containing the Heaviside function H(t) (see Definition 


5.3 on page 274). 


£=-2e+y+4+ H(t) -— H(t-1), x(0) = 3, 
a) § = 6x + 3y +3H(t) —3H(t—2); y(0) =1 
b) & = 3x — 2y +17 cos2t[H(t) — H(t— 7), x(0) = 2, 
y =4c —y4+17 sin2t[H(t) — H(t—7)]; y(0) =1 
& = 3x —2y4 2t[H(t)-— H(t— 1), x(0) = 2, 
°) 4 = 5a —3y + H(t) — H(t— 1); y(0) =0 
a) & = 3x — 2y+2 sint (H(t) -— H(t—7)], x(0) = 1, 
y = 2e —y+4 cost (H(t) — H(t—7)]; y(0) =1 


Section 8.6 of Chapter 8 (Review) 


1. Construct two fundamental matrices (8.6.4) for the given system of differential equations of the second order X+Ax = 0, 


where the square matrix A is specified. 


of obs oA ow f8 yh 
(e) te ne (f) BE ae (s) E Ar (h) Ei a 


2. Construct two fundamental matrices (8.6.4) for the given system of differential equations of the second order X+ Ax = 0, 
where the square matrix A is specified. 


tt 4 9 0 -9 it 2. O 
(a) |O0 1 —-11; (b) |0 9 —-9]; (c) |-2 4 -2 
—. if a a. 0 -1 1 


3. In each of exercises (a) through (d), solve the initial value problem. 


(a) #+15¢—Ty=0, 7—42x4+22y=0, x(0)=0, 4(0)=1, y(0)=—2, (0) =2. 
(b) #+15a—2y=0, —147r+22y=0, x(0) =2, &(0) =0, y(0) =—2, 4(0) =3. 
(c) #+32a—-y=0, y—2e+2y=0, 2(0)=1, 2(0)=2, y(0) =—3, 9(0)= 

(d) #+5¢—2y=0, y—8xe+5y=0, x(0)=1, £(0)=2, y(0) =2, y(0) =0. 
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4. Consider a plane curve, called hypocycloid, generated by the trace of a fixed point on a small circle of radius r that 
rolls within a larger circle of radius R > r. Show that the coordinates of this curve are solutions of the initial value 
problem 


&—(R-2r)y/r+(R—-1)a2/r =0, 


x(0) = R, «(0) = y(0) = 9(0) = O. 
y+ (R-2r)a/r+(R—-r)y/r =0, (0) (0) = 4(0) = 9(0) 
5. Consider a mechanical system of two masses m1 and m2 connected by three elastic springs with spring constants k1, ka, 
and kz, respectively (see Fig. 6.4, page 344). Find the natural frequencies of the mass-and-spring system and determine 
the general solution of the corresponding system of differential equations 


m,#+ kx — ke(y— 2x) =0, m2y+k3y + ke(y— 2x) =0. 


(a) m1 1, m2 3, ky 1, i) 6, kg 3. 

(b) My 7, m2 8, ky 7, ko 56, k3 8. 

(c) mi =7, m2 = 2, ki = 427, ko = 14, kg = 86. 
(d) M1 4, m2 11, ky 4, ko 44, kg 11. 

(e) mi =1, m2=1, ki = 34, ko =6, kz = 39. 
(f) mi =1, m2 =4, ki =57, ko =8, kg = 312. 
(g) mi =11, m2 =5, ki = 539, ko = 110, kg = 245. 
(h) mi = 34, m2 = 29, ki = 34, ko = 986, kg = 29. 
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Phase portrait of a damped pendulum 


Qualitative Theory of Differential 
Equations 


Nonlinear differential equations and systems of simultaneous nonlinear differential equations have been encountered 
in many applications. It is often difficult, if not impossible, to solve a given nonlinear differential equation or system 
of equations. This is not simply because ingenuity fails, but because the repertory of standard functions in terms 
of which solutions may be expressed is too limited to accommodate the variety of solutions to differential equations 
encountered in practice. Even if a solution can be found, its expression is often too complicated to display clearly 
the principal features of the solution. 

This chapter gives an introduction to qualitative theory of ordinary differential equations when properties of 
solutions can be determined without actually solving equations explicitly or implicitly. Its theory originated from 
the independent work of two mathematicians at the turn of 20th century, A. M. Lyapunov and H. Poincaré. We are 
often primarily interested in certain properties of the solutions as, for example, growing without bound as t > om, 
or approaching a finite limit, or having a periodic solution, and so on. Also, some attention is given to the influence 
of coefficients on the solutions of the systems. 


9.1 Autonomous Systems 


Some solution graphs and phase plots for nonautonomous vector equations x(t) = f(t, x) are so irregular that they 
display very little apparent order. Since it is hard to visualize solutions of nonautonomous equations even in the 
two-dimensional case, we concentrate our attention on autonomous vector differential equations in normal form: 


x = f(x), (9.1.1) 
where 
x(t) fila, 22,...,@n) 
x(t) = eer f(x) = eal) 
an(t) fn(@1,@2,--52n) 
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are n-column vectors, and a dot stands for the derivative, x = dx/dt, with respect to variable t, or time. The term 
autonomous means self-governing, justified by the absence of the time variable t in the vector function f(x). It is 
assumed that components of the vector function f(x) are continuously differentiable (or at least Lipschitz continuous) 
in some region of n-dimensional space. Then, according to the existence and uniqueness theorem 6.4, page 372, there 
exists a unique solution x(t) of the initial value problem 


x(t) = f(x), x(to) = Xo, (9.1.2) 


that is defined in some open interval containing tp. A maximum interval in which the solution exists is called the 
validity interval. Solutions to autonomous systems have a “time-shift immunity” in the sense that the function 
x(t — c) is a solution of the given system for an arbitrary c provided that x(t) is a solution. 


Definition 9.1: A point x* where all components of the rate vector function f are zeroes, f(x*) = 0, is called a 
critical point, or equilibrium point, of the autonomous system (9.1.1). The corresponding constant solution 
x(t) = x* is called an equilibrium or stationary solution. The set of all critical points is called the critical 
point set. 


Definition 9.2: A critical point x* of the autonomous system dx/dt = f(x) is called an isolated equilibrium 
point if there are no other stationary points arbitrarily closed to it. 


The concept of equilibrium plays a central role in various applied sciences, such as physics (especially mechanics), 
economics, engineering, transportation, sociology, chemistry, biology, and other fields. If one can formulate a problem 
as a mathematical model, its equilibrium solutions can be used for forecasting the future behavior of very complex 
systems and also for correcting the current state of the system under control. There is no supporting theory 
to find equilibria for all possible vector functions f(x) in Eq. (9.1.1). However, there is a rich library of special 
numerical methods for solving systems of nonlinear algebraic equations, including celebrated numerical methods 
such as Newton’s or Chebyshev’s methods, which are usually applied in combination with the bisection method. 
Nevertheless, sometimes numerical algorithms fail to determine all critical points due to a machine’s inability to 
perform exact calculations involving irrational numbers. As an alternative, computer algebra systems offer convenient 
codes to solve the equations, when possible, including symbolic solutions. 

Critical points of the autonomous equation x = f(x) are simultaneous solutions of the vector equation f(x) = 0. 
If f(x) = (fi(x), fo(x),---,fn(x))7 and x = (x1,22,...,2n)", then equilibrium points are solutions of the system 
of n algebraic equations 

fi(ai, Xa, see tn) => 0, 
0 


fo(a1,02,++-,@n) = (9.1.3) 


Prd Di My a 65 ae) = 0, 


with n unknowns £1, %2,..., Y,. The solution set of each equation fx (71, %2,...,U%n) =0, k = 1,2,...,n, is referred 
to as the k-th nullcline. Therefore, equilibrium or stationary points are intersections of all k-nullclines. Each k-th 
nullcline separates solutions into disjoint subsets: in one of them the k-th component of the vector solution always 
increase, while in the other subsets the k-th component decreases. 

The equilibrium points can be classified by the behavior of solutions in their neighborhoods. Imagine a circle with 
radius € around a critical point x*—it is the set of points x such that ||x—x*|| < ¢, where ||x|| = \/v? + 73 +---+22. 
Next imagine a second smaller circle with radius 6. Let us take a point xo in the 6-circle. If the trajectory starting 
at Xo leaves the e-circle for every ¢ > 0, then the critical point is unstable. If, however, for every ¢-circle we can 
find a 6-circle such that the orbit starting at an arbitrary point xp in the d-circle remains within the ¢-circle, the 
critical point is stable. Now we give definitions of such points, attributed to A. Lyapunov. 


Definition 9.3: Let x* be an isolated equilibrium point for the autonomous vector equation (9.1.1), that is, 
f(x*) = 0. Then the critical point x* is said to be stable if, given any ¢ > 0, there exists a 6 > 0 such that 
whenever the initial condition x(0) is inside a neighborhood of x* of distance 6, that is, 


|| x0) — x" || < 6, (9.1.4) 
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Example 9.1.1: The equilibrium points of the nonlinear 
system 

& =4a—y’, 

y=rr+y, 
are found by solving the algebraic system 

Ar — y? = 0, 

x+y =0. 
The z-nullcline is a parabola 4% = y?, and the y-nullcline 1 0 2 3 _ 
is the straight line y = —ax. Their intersection consists of 
two stationary points: the origin (0,0) and (4, —4). As seen Figure 9.1: Example 9.1.1. 


from Fig. 9.1, these nullclines separate the plane into sub- 
domains where one of the components increases or declines. 


then every solution x(t) of the initial value problem 


x= f(x), x(0) = xo 


satisfies || x(t) — x*|| < ¢ for all t > 0. 


Definition 9.4: A critical point x* is called asymptotically stable or an attractor if, in addition to being 
stable, there exists a 6 > 0 such that whenever the initial position satisfies Eq. (9.1.4), we have for every solution 
x(t) of the initial value problem (9.1.2): 
lim || x(t) — x*|| =0 => lim x(t) = x". 
t—+00 too 

If all solutions approach the equilibrium point, then the critical point is globally asymptotically stable. If 
instead there are solutions starting arbitrary close to x* that are infinitely often a fixed distance away from it, 
then the critical point is said to be unstable or a repeller. 


For an asymptotically stable point, trajectories approach the stationary point only when they happen to be in 
close proximity of the equilibrium. On the other hand, if a solution curve stays around the critical point, then it is 
stable, but not asymptotically stable. 


Definition 9.5: A set of states S C R” is called an invariant set if for all xo € S, the solution of the initial 
value problem (9.1.2) belongs to S for all t > to. 


Therefore, if an invariant set S contains the initial value, it must contain the entire solution trajectory from that 
point on. 


Definition 9.6: For every asymptotically stable critical point, its basin of attraction or the region of 


asymptotic stability consists of all initial points leading to long-time behavior that approaches that stationary 
point. A trajectory that bounds a basin of attraction is called a separatrix. 


9.1.1 Two-Dimensional Autonomous Equations 


In this subsection, we consider only autonomous systems, given by a pair of autonomous differential equations, 


&(t) = f(a(t), y(t), w(t) = g(a(t), vy), (9.1.6) 
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for unknown functions x(t) and y(t). This system of dimension 2 is called a planar autonomous system. By 
introducing 2 x 1 vector functions 
x(t) | | f(x,y) | 
x(t) = ; f(x) = : ; 
@) | y(t) ©) = | g(z,y) 


we can rewrite Eq. (9.1.6) in the vector form (9.1.1), page 483. A solution of such a system is a pair of points 
(x(t), y(t)) on the xy-plane, called a phase plane, for every t. As t increases, the point (a(t), y(t)) will trace out 
some path, which is called a trajectory of the system. Planetary orbits and particle paths in fluid flows have 
historically been prime examples of solution curves, so the terms orbit, path, or streamline are often used instead 
of trajectory. 

Let us consider a curve in a three-dimensional space parameterized by t +> (t, x(t), y(t)). At each point of this 
curve, there is a tangent vector, which is obtained by differentiating with respect to ¢: 


(1, £(t), 9) = (1, F(a(t), y)), 9(@@), y(4)))- 


This tangent vector can be computed at any point (t,x, y) in R® without knowing the solution to the system (9.1.6). 
Therefore, the set of such tangent vectors is called the direction field for the system (9.1.6). It is completely 
analogous to the definition of the direction field for a single equation. The solution curve to Eq. (9.1.6) must be 
tangent to the direction vector (1, f(x(to), y(to)), g(a(to), y(to))) at any point (to, x(to), y(to)). 

A picture that shows the critical points together with the collection of typical solution curves in the xy-plane is 
called a phase portrait or phase diagram. Usually, these trajectories are plotted with small arrows indicating 
the direction of traversal of a point on the solution curve. When it is not possible to attach arrows to streamlines, 
the phase portrait is plotted along with a corresponding direction field. Such pictures provide a great illumination 
of solution behaviors. See, for instance, the phase portrait of the damped pendulum on the opening page 483. 

While a solution of the planar system (9.1.6) is a pair of functions (a(t), y(£)), solutions to the autonomous 
systems can be depicted on one graph. Another way of visualizing the solutions to the autonomous system is to 
construct a slope or direction field in the xy-plane by drawing typical line segments having slope 

dy _ dy/dt _ g(x,y) 


dx da/dt f(x,y)’ 


(9.1.7) 


However, the trajectory of a solution to Eq. (9.1.7) in the xy-plane contains less information than the original graphs 
because t-dependence has been suppressed. To restore this information, we indicate the direction of time increase 
with arrowheads on the curve, which show the evolution of a point traveling along the trajectory. 
Equation (9.1.7) may have singular points where f(x,y) = 0 but g(z,y) #0. Points where f(x,y) and g(a, y) 
are both zero 
f(z,y)=0, g(x,y) =0 (9.1.8) 


are called points of equilibrium, or fixed points, or critical points of a system x = f(x). If (a*, y*) is a solution 
of Eq. (9.1.8), then 

at)=2", y(t)=y" 
are constant solutions of Eq. (9.1.6). In applied literature, it may be called a stationary point, steady state, or 
rest point. Assuming uniqueness of the initial value problem for the system (9.1.6), no other trajectory x(t) = 
(x(t), y(t)) in the phase plane can touch or cross an equilibrium point (a*,y*) or each other. 

The points where the nullclines cross are precisely the critical points. In some cases the complete picture of 
the solutions of Eq. (9.1.6) can be established just by considering the nullclines, steady states, and how the sign 
of dy/dx = y/« changes as we go between regions demarcated by nullclines. However, this level of detail may be 
insufficient, and we must study more carefully how solutions behave near a stationary point. For example, solutions 
may approach a steady state directly or may show an oscillatory behavior by spiraling into the equilibrium solution. 

A second order differential equation 

& = f(a,2,t) 
can be interpreted as an equation of motion for a mechanical system, in which x(t) represents displacement of 
a particle of unit mass at time t, & its velocity, % its acceleration, and f the applied force. After introducing a 
new variable y = &, the given second order equation % = f(z,#,t) becomes equivalent to the system of first order 
equations 
E=Y, y = f(z,y,t). 
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A mechanical system is in equilibrium if its state does not change with time. This implies that « = 0 at any constant 
solution. Such constant solutions are therefore the constant solutions (if any) of the equation 


Fe, 0, t) = 0. 
A typical nonautonomous equation models the damped linear oscillator with a harmonic forcing term 
G+ke+ wear = F cosut, 


in which f(x, %,t) = —k& —wax+ F coswt, where k, wo, w, and F are positive constants. We convert this equation 
to the system of first order equations: 


£=y, y = —ky — wert F cosut. (9.1.9) 


To visualize, we ask Maple to plot the corresponding phase portrait for some particular values of coefficients k = 0.1, 
2 2 
we =wi =4,P=3: 


with(DEtools): 
DE1:=diff (x(t) ,t)=y(t); DE2:=diff (y(t) ,t)=-.1*y(t)-4*x(t)+3*cos (4*t) ; 
phaseportrait([DE1, DE2],[x, y],t =-3..3, [[x(0)=1,y(0)=0], 
[x(0)=0,y(0)=2]] ,x=-2.2..2, color=blue, linecolor=black, stepsize=0.05) 
DEplot3d({DE1,DE2},{x(t),y(t)},t = 0 .. 11, [[x(0)=1,y(0)=0]], 

scene = [t, x, y], linecolor = black, stepsize = 0.005) 


Figure 9.2: Phase portrait of a damped harmonic oscillator, Eq. (9.1.9), plotted with Maple: (a) 3D plot and (b) its 
projection on the zy-plane. 


As we see from Fig. 9.2(b), projections of solutions on the xy-plane can cross each other, so the notion of direction 
field is no longer meaningful. Of course, we could make the equation autonomous by inserting a third variable, but 
we can’t plot three-dimensional phase portraits with this function. 

There are no equilibrium states for the system (9.1.9). Stationary points are not usually associated with nonau- 
tonomous equations, although they can occur, for instance, in the Mathieu equation 


&+(a+ 8 cost)a =0, 
in which the origin is an equilibrium state. 


Example 9.1.2: (Pendulum) We visualize the stability concepts and basin of attraction with an example of an 
oscillating pendulum, which consists of a bob of mass m that is attached to one end of a rigid, but weightless, rod 
of length @. The other end of the rod is supported at a point O, which we choose as the origin of the coordinate 
system associated with our problem. The rod is free to rotate with respect to the pivot O. If pendulum oscillations 
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occur in one plane, the position of the bob is uniquely determined by the angle @ of inclination of the rod and the 
downward vertical direction, with the counterclockwise direction taken as positive (see Fig. 6.5 on page 348). 

The motion of the bob is determined by two forces acting on it: the gravitational force mg, which acts downward, 
and the resistance force acting in a direction always contrary to the direction of motion. Assuming that the damping 
force is proportional to the velocity, the equation of motion for the pendulum can be obtained by equating the sum 
of the moments about pivot O, —@ (6 + mg sin 6), to the product of the pendulum’s moment of inertia (mé?) and 


angular acceleration (see details in [14]). The resulting equation becomes 
m6 = —¢ (kb + mg sin 6) or 64+76+w?sind =0, (9.1.10) 


where y = k/(mé), w? = g/€, g is the acceleration due to gravity, and k is the positive coefficient of proportionality 
used to model the damped force. 
We convert the pendulum equation into a system of two first order autonomous equations by introducing new 
variables: + = 8 and y = 6. Then 
f=y, y=—w*sing— yy. (9.1.11) 
The critical points of Eq. (9.1.11) are found by solving the equations 


y = 0, —w*sing-—yy=0 <> sinzg=0. 


SS 
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Figure 9.3: Example 9.1.2: The basin of attraction for a damped pendulum, plotted with MATLAB®. 


Therefore, we have countably many critical points (n7,0), n = 0,+1,+2,..., but they correspond to only two 
distinct physical equilibrium positions of the bob: either downward (x = 6 = 0) or upward (x = 6 = 7m). If the 
mass is slightly displaced from the lower equilibrium position, the bob will oscillate back and forth with gradually 
decreasing amplitude due to dissipation of energy caused by the resistance force. Since the bob will eventually stop 
in the downward position, this type of motion illustrates asymptotic stability. 

On the other hand, if the mass is slightly displaced from the upper equilibrium position 6 = 7, it will rapidly 
fall, under the influence of gravity, and will ultimately converge to the downward position. This type of motion 
illustrates the instability of the critical point x = 6 = 7. By plotting the trajectories starting at various initial points 
in the phase plane, we obtain the phase portrait shown in the opening figure on page 483. A half of stationary 
points (2k7,0), where k = 0+1,+2,..., correspond to the downward stable position of the pendulum, and the other 
half, ((2k + 1)7,0), where k = 0+1,+2,..., correspond to the upward unstable position. Near each asymptotically 
stable critical point, the orbits are clockwise spirals that represent a decaying oscillation. The basins of attraction 
for asymptotically stable points are shown in Fig. 9.3. It is bounded by the trajectories that enter the two adjacent 
saddle points ((24 +1)z,0). The bounding streamlines are separatrices. Each asymptotically stable equilibrium 
point has its own region of asymptotic stability, which is bounded by the separatrices entering the two neighboring 
saddle points. 

Now we consider the case when the resistance force is proportional to the square of the velocity, —k6|6| instead 
of —k@. Then the modeling system of differential equations becomes 


g=y, y= -w’sing—yylyl. 
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This system has the same set of critical points (na,0), n = 0,+1,+2,..., half of them (for n even) stable, and the 
other half (odd n) unstable. The phase portrait presented in Fig. 9.4 confirms this. 

Finally, consider the ideal pendulum when the resistance force is neglected, which corresponds to the case 
when k = y = 0. If the bob is displaced slightly from its lower equilibrium position, it will oscillate indefinitely 
with constant amplitude about the equilibrium position « = 6 = 0. Since the damping force is absent, there is no 
dissipation of energy, and the mass will remain near the equilibrium position but will not approach it asymptotically. 
This type of motion is stable but not asymptotically stable (see Fig. 9.21 on page 516). 
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Figure 9.4: Example 9.1.2: Phase portrait for a damped pendulum when the resistance force is proportional to the 
velocity squared, plotted with Mathematica. 
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Figure 9.5: Example 9.1.3, phase portrait of the non- Figure 9.6: Basin of attraction for the spiral point 
linear system, plotted with Mathematica. (0,1) is shaded, plotted with Mathematica. 


Example 9.1.3: Locate the equilibrium points, and sketch the phase paths and the basin of attraction to the 
autonomous system of equations 


&=(24y—2)(l-y-a2/2), y=au(5+2y). 
Solution. The stationary points occur at the simultaneous solutions of 


(2y—2x)(1—y— 2/2) =0, x(5+2y) =0. 
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These are four solution pairs of the above algebraic system of equations: (0,0), (—5,—2.5), (0,1), (7,-2.5). By 
plotting a direction field and critical points, we see from Fig. 9.5 that (0, 1) is a spiral point, (—5, —2.5) is an attractor 
(asymptotically stable node), the origin is a saddle point, and (7,-2.5) is an unstable node. 


Problems| In all problems, dot stands for the derivative with respect to t. 


Locate all the equilibrium points for each of the systems in Problems 1 through 8. 


1 L=Yy, 9 & = 22 — 2? — 32y, 
| lga=aty’—2; © (y= 38y — 4y? — ay; 
C= —Y, é=y(l—-2’), 
10. <. 5 
y=a(4-c2 iF 


Th & =4a — 2? — cy, 


a 13. 


= 2(2— 
a a(2—y), iZ 
&=—4a-—a7%—-«xy, 
y= yy — 32); 

é=a(x7+2y—-1), 
y=y(l—y—32). 


15. 


1 


ee 


y = by — 4y? — cy; 


Each of the systems in Problems 17—19 has infinitely many equilibrium points. Find all of them. 


g=y’—-1, & = cos(y), & = sin(x) + cos(y), 
et (; = sin(x); (; = sin(z); He . = sin(x) + sin(y). 


In Problems 20 through 25, rewrite the given scalar differential equation as a first order system and find the equilibrium 
points. 


20. #-a? +4¢=0. 22. &— 24+ 202 = 0. 24. &+ (cosx)& +sina = 1. 
21. +524 3¢ = 0. 23. #-4eé+a° =0. 25. #+ ~i3y — Qn? =0. 


26. Compare the phase diagrams of the systems 


L=Y, £= vy, 
. and ‘ 
Y=, y =—-“. 
In Problems 27 through 30, use the information provided to determine the unspecified constants. 


27. The system 
&=ax+ 2xy, y = by — 3xy 


has equilibrium points at (a, y) = (0,0) and (1, 2). 


28. The system 
&= ax + bry + 3, y = cx + dy? —4 
has critical points at (x,y) = (3,0) and (1,1). 
29. Consider the system 
&=x+ay’, y=ba? +y—y’. 
The slopes of the phase-plane orbits passing through the points (x, y) = (1,2) and (—1,1) are 9 and 3, respectively. 


9.1. Autonomous Systems 


30. Consider the system 


& = ax” + by — 2, 


yacotyty’. 
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3l. 


The slopes of the phase-plane orbits passing through the points (x, y) = (1,2) and (—2,3) are 1 and 14/11, respectively. 


Consider a ball of radius R that can float on the water surface, so its density p is less than 1. 


(a) Suppose that the center of the ball is beyond k units of the water’s level when the ball is in equilibrium position. 
Find the algebraic equation of degree 3 for k using Archimedes’ law of buoyancy stating that the upward force 
acting upon an object at the given instant is the weight of the water displaced at that instant. 


(b) Suppose that the ball is disturbed from equilibrium at some instant. Its position will be identified by the distance 
z(t) of the center from its equilibrium at time t. Apply Newton’s second law of motion to show that the function 
z(t) is a solution of the differential equation (9.1.12), where g represents the acceleration due to gravity. Your 


(9.1.12) 


derivation should be based on equating the acting force mzZ, where m is the mass of the ball, to the net downward 
force (which is the ball weight minus the upward buoyant force). 


(c) Rewrite differential equation (9.1.12) as an equivalent two-dimensional first order system. 


In each of the systems in Problems 32 through 43: 


( 
(b 


) 
(c) 
) 


a) Locate all critical points. 


Use a computer to draw a direction field and phase portrait for the system. 


Determine whether each critical point is asymptotically stable, stable, or unstable. 


(d) Describe the basin of attraction for each asymptotically stable equilibrium solution. 

32. ¢=2(1-y), y=y(44+z2). 40. = (4+ a2)(3y—-2), y=(4-—c2)(y+ 52). 
33. £=(5+ y)Qr+y), y=y(3-2). 4l. ¢=(3-—2)(3y—2), y=y(2—x2-2”). 
34. @= 32-42? -—ay, y=4y—y? — day. 42. ¢=ax2(1-2y+32), y= 2y+ay—-z. 
35. ¢=34+y, y=1— 42’. 43. ¢=a2(1-2y+32), y=(1—y)(3+2). 
36. = y(38—-—2x-2y), y= —xe-—2y—cy. 44, ¢ =1+4+2y—2?, y=y-a. 

37. &¢=5ay—2, y=l0y—y?—2?. 45. & = 2y — 32? +4, y = 38y + 3a? —a. 
38. ¢=(34+2)(y—32), yoy(8+2r—-2°). 46. & =1+2y — az’, y=y—3e. 

39. ¢=y+4e, you—tar—hy. 47. &é=a+2y— 2”, y=yt4e’. 


48. 
49. 


50. 


51. 


The differential equation # + « — 4a + «* = 0 has three distinct constant solutions. What are they? 


Let f(x) be a continuous vector field in Eq. (9.1.1), page 483. Prove that if either of the limits lim:.4.. x(t) or 
limy—.o x(t) exists, then the limit vector is an equilibrium solution for the system (9.1.1). 


Consider the harmonic oscillator equation ¥ + sign(y) + y = 0 modified by the Coulomb friction. Here sign(v) gives —1, 
0, or 1 depending on whether v is negative, zero, or positive. Rewrite this differential equation as a system of first order 
differential equations. By plotting phase portraits show that the motion stops completely in finite time regardless of 
the initial conditions. 


Every nonautonomous system of differential equations can be transferred to “equivalent” autonomous system of equa- 


tions by introducing additional dependent variable. For example, consider the forced Duffing equation (or Duffing 
oscillator, named after Georg Wilhelm Christian Caspar Duffing (1861-1944)) 


Etakz—x+2° =A cosut, 
where a is the friction coefficient, A is the strength of the driving force which oscillates at a frequency w, t represents 


time and « = da/dt. Convert Duffing’s equation into autonomous system of first order differential equation upon 
introducing new variables v = « and 0 = wt. 
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9.2 Linearization 


We will restrict ourselves by considering isolated equilibrium points, that is, equilibrium solutions that do not have 
other critical points arbitrarily close to it. All functions are assumed to have as many derivatives as needed. 

Suppose f(21,%2,...,%n) is a real-valued function of n variables 71,272,...,@%p that takes values in R, i-e., 
f:R” 6 R. To illustrate the Taylor series expansion in the multi-dimensional case, we consider first f as a function 
of two variables. The Taylor series of second order about the point (a, b) is 


flay) ®& F(a,b) + fol a,b) (w — a) + fy(a,0) (ya) 
+5 Ufex(a,8) (w ~ a)? +2 fay(a,b) (@—a)(y~b) + fyy(a,0) (y— 6), 


where f,(a,b) denotes the partial derivative of f with respect to x evaluated at (a,b), etc. Now, let a = (a,b) and 
x = (x,y) be the 2-vectors in R?. Then the above expansion can be written in compact form as 


(x) = fla) + VF(a) (x—a) +5 (xa)? D*f (a) (xa) +, (9.2.1) 


where Vf(a) = (f2(a,b), fy(a,b)) is the gradient of f evaluated at a, and D? f(a) = [D,D;f(x)],_,, is the Hessian 


matrix’® of second partial derivatives evaluated at the point a (D; = 0/0z;). Once the vector notation is in use, the 
formula (9.2.1) becomes valid for arbitrary n-dimensional space. The last term in Eq. (9.2.1) is 


5 (x — a)? D?f(a) (x — a) = O (\|x—al?), 
where the O (||x||?) means the Landau symbol: 
g=O(llxI?) <=> Ig(x)| < C[|x|I? 


for some positive constant C. Recall that ||x|| is the norm of x, i-e., the distance of x from the origin: ||x||? = 
x? +23+---+a2. The second order term in Eq. (9.2.1) is bounded by a constant times the square of the distance of 
x from the point a. Actually, the formula (9.2.1) follows from the one-dimensional Taylor series if one will consider 
g (t(x — a)) and expand it into the Maclaurin series in t. 


Example 9.2.1: Consider a function of two variables f(x,y) = cos(a) sin(y). Using Maclaurin’s series for trigono- 
metric functions, we obtain 


_[oenre| [eur] 1, 1, 
f(x,y) = » » Geen | ~~ ge eget 


Suppose we want to find the Taylor series about the point (7,4). Evaluating the gradient Vf = (sin(z) sin(y), cos(x) 
cos(y)) at that point, we get its Taylor series expansion at (x, Zz) to be 


fey) =-14+5(@—a) +5 (y-Z) +. 


Let us consider the nonlinear vector differential equation x = f(x), where f(x) is a vector-valued function. To 
derive the Taylor series for a vector function f(x) = (f1,..., fn)” in a neighborhood of a point x*, we simply expand 
each of the functions f;(x), 7 =1,2,...,n, in a Taylor series using formula (9.2.1). The higher order terms are again 
more complicated but we disregard them and keep only terms of the first order: 


F(x) = F(x") + DE(x") (x — x") +0 (x — x"). 


Here Df(x*) is the Jacobian matrix evaluated at the point x*: 


[D f(x*)] ig S12 aon: (9.2.2) 


i ~ Ox; 


x=x* 


“6Named after a German mathematician, Ludwig Otto Hesse (1811-1874). 
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Thus, for x near x”, 
f(x) = f(x*) + Df(x*) (x — x*). 


This is the linearization of a vector-valued function of several variables. Notice that upon neglecting terms of higher 
than first order, the linear approximation will only potentially give a good indication of the vector function f(x) 
while ||x — x*|| remains small. 

Consider the autonomous equation for a smooth vector function f(x): 


x = f(x). (9.2.3) 


Suppose x* is an equilibrium point, meaning that f(x*) = 0. Therefore, if the solution starts at x*, it stays there 
forever. The question we want to answer is: What happens if we start near x*? Will it approach the equilibrium? 
Will it behave in some regular way? To answer these questions, we expand the function f(x) into the Taylor series 


x = f(x*)+Df(x*)(x—x*)+---. 
ae 


“co 


If we disregard the “---” terms, then 


x = Df(x*) (x — x*). 


This is the same as F 
— (x — x*) = Df(x*) (x — x”). 
& (= xt) DECK") (x — 2°) 
Upon introducing a new dependent variable y = x(t) — x*, the solutions of Eq. (9.2.3) may be approximated near 
the equilibrium point by the solutions of the linear system 


y=Jy, y(t) = x(t) — x", (9.2.4) 


where J = Df(x*) is the Jacobian matrix (9.2.2) evaluated at the equilibrium point. When will this be true? The 
answer gives us the linearization theorem, also known” as the Grobman—Hartman Theorem. It says that 
as long as Df(x*) is hyperbolic, meaning that none of its eigenvalues are purely imaginary, then the solutions of 
Eq. (9.2.3) may be mapped to solutions of Eq. (9.2.4) by a 1-to-1 and continuous function. In other words, the 
behavior of a dynamical system near a hyperbolic equilibrium point is qualitatively the same as the behavior of 
its linearization near this equilibrium point, so solutions of nonlinear vector differential equation (9.2.3) may be 
approximated by solutions of linear system (9.2.4) near x*, and the approximation is better the closer you get to 
x*. Equation (9.2.4) is called the linearization of the system (9.2.3) at the point x*. 

Note that the linearization procedure cannot determine the stability of an autonomous nonlinear system but only 
asymptotic stability or instability. A heuristic argument that the stability properties for a linearized system around 
a critical point should be the same as the stability properties of the original nonlinear system becomes more clear 
when we turn our attention to the plane case. 


9.2.1 Two-Dimensional Autonomous Equations 


The two-dimensional case is singled out first for detailed attention because of its relevance to the systems associated 
with the widely applicable second order autonomous equation a = f(x,«#). However, the most extreme instabilities 
occur for systems of dimensions more than one. 


Definition 9.7: Let the origin (0,0) be a critical point of the autonomous system 


eee F 


y =ca+dy4 


“TThis theorem was first proved in 1959 by the Russian mathematician David Matveevich Grobman (born in 1922) from Moscow 
University, student of Nemytskii. The next year, Philip Hartman (born in 1915) at Johns Hopkins University (USA) independently 
confirmed this result. 
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where a, b, c, d are constants and F(x, y), G(x, y) are continuous functions in a neighborhood about the origin. 
Assume that ad # bc so that the origin is an isolated critical point for the corresponding linear system, which 
we obtain by setting F = G =0. The system (9.2.5) is said to be almost linear near the origin if 


F(x, y) 


+0 and AD, as V/a2+y? > 0. 


We now state a classical result discovered by Poincaré” that relates the stability of an almost linear planar 
system to the stability at the origin of the corresponding linear system. 


Theorem 9.1: Let A,, A2 be the roots of the characteristic equation 


7 = (a+ d)X+ (ad — bc) —0 


d |a(t) a 0b] |x(t) 

eee = 9.2.6 

zal le ally ey 
corresponding to the almost linear system (9.2.5). Then the stability properties of the critical point at the 
origin for the almost linear system are the same as the stability properties of the origin for the corresponding 


linear system with one exception: When (a — d)? + 4bc = 0, the roots of the characteristic equation are purely 
imaginary, and the stability properties for the almost linear system cannot be deduced from the linear system. 


for the linear system 


Corollary 9.1: Let J* = J(x*) = Df(x*) be the Jacobian matrix (9.2.2) evaluated at the equilibrium point x*. 
Its characteristic polynomial reads as 
d? — (tr J*) A + det J* = 0. 


1. If det J* < 0, then J* has eigenvalues of opposite sign and x* is a saddle point. 


2. If det J* > O and the trace of the matrix J* is positive, then the real parts of the eigenvalues of J* are 
positive and x* is unstable. 


3. If det J* > 0 and the trace of the matrix J* is negative, then the real parts of the eigenvalues of J* are 
negative and x” is locally stable. 


4. If det J* > 0 and (tr J*)? > 4det J*, then x* is a node; and if (trJ*)? < 4det J*, then x* is a spiral. 


5. If for each (x, y) € R?, det I(x, y) > 0 and (tr J)(z, y) < 0, then x* is a global attractor. 


Example 9.2.2: Consider the system 
&= e+ y’, y = —2y + 42. 


Linearization of this system around (0,0) yields 


: 2 O 
x(t) =IJx(t), | where J(0,0) = F ‘) , x= | 
Phase portraits for the original nonlinear system and its linearization are presented in Figures 9.7 and 9.8, respec- 
tively. Since the eigenvalues of the matrix J(0,0) are real numbers of opposite signs, the origin is an unstable saddle 
point. 
The given system of differential equations has another critical point 


(a*,y") = (-271, 21/8) ~ (—0.793701, 1.25992). 


“8 Jules Henri Poincaré (1854-1912) was a French mathematician who made many original fundamental contributions to pure and 
applied mathematics, mathematical physics, and celestial mechanics. 


9.2. Linearization 495 


The Jacobian at this point 
os 2 2y 9. gale 
J(x*,y")= B 4) = |_ger _9 
r=a* ,y=y* 


has two purely imaginary eigenvalues +2jV/3. Therefore, this point is not hyperbolic, and linearization is inconclusive. 


Example 9.2.3: (Electric Circuit) Consider an electric circuit consisting of a capacitor, a resistor, and an 
inductor, in series. Suppose that these elements are connected in a closed loop. The effect of each component of 
the circuit is measured in terms of the relationship between current and voltage. An ideal model gives the following 
relations: 

vr =f(ir) (resistor), 

Lip =v, (inductor), 

Cic =ic (capacitor), 


where vg represents the voltage across the resistor, 7g represents the current through the resistor, and so on. The 
function f(x) is called the v-i characteristic of the resistor. For a passive resistor, the function f(x) has the same 
sign as x; however, in active resistor, f(x) and x have opposite signs. In the classical linear model of the RLC-circuit, 
it is assumed that f(a) = Ra, where R > 0 is the resistance. 

According to Kirchhoff’s current law, the sum of the currents flowing into a node equal the sum of the currents 
flowing out: 

ip =tL =1¢. 
Kirchhoff’s voltage law states that the sum of voltage drops along a closed loop must add up to zero: 
UR +v, +uc = 0. 

Assuming that f(x) = 23 + Rx, we introduce two new variables: 1 = ig =iz = ic and y = vo. Since vz is known 


to be vy = Liz = L &, we find ve = —vc — vy, = —y — Liz. This allows us to model such a circuit as the system of 
first order differential equations: 


Li = —-ax? — Rr-y, 
Cy =x. 


Since this system has the only one critical point, the origin, we linearize the system around this point, with the 
corresponding Jacobian 


te =—Rht—y, 


7 —, ge ae 1 
y= 2%; 


i/c 0 


Since the Jacobian matrix J has two real eigenvalues of different signs, 1,2 = -# oe x \/ R? + 4/C, the origin is 
an unstable saddle point. 


Example 9.2.4: (FitzHugh—Nagumo model) Neurons are cells in the body that transmit information to the 
brain by amplifying an incoming stimulus (electric charge input) and transmitting it to neighboring neurons, then 
turning off to be ready for the next stimulus. Neurons have fast and slow mechanisms to open ion channels in response 
to electrical charges. Neurons use changes of sodium and potassium ions across the cell membrane to amplify and 
transmit information. Voltage-gated channels exist for each kind of ion. They are closed in a resting neuron, but 
may be open in response to voltage differences. When a burst of positive charge enters the cell, making the potential 
less negative, the voltage-gated sodium channels open. Since there is an excess of sodium ions outside the cell, more 
sodium ions enter, increasing the potential until it eventually becomes positive. Next, a slow mechanism acts to 
open voltage-gated potassium channels. Both of these diminish the buildup of positive charge by blocking sodium 
ions from entering and allowing excess potassium ions to leave. When the potential decreases to or below the resting 
potential, these slow mechanisms turn off, and then the process can start over. If the electrical excitation reaches a 
sufficiently high level, called an action potential, the neuron fires and transmits the excitations to other neurons. 
The most successful and widely used model” of neurons has been developed from Hodgkin and Huxley’s 1952 
work. Using data from the giant squid axon, they applied a Markov kinetic approach to derive a realistic and 
biophysically sound four-dimensional model that bears their names. Their ideas have been extended and applied to 


"9The British neuroscientists Alan Hodgkin (1914-1998) and Andrew Huxley (1917-2012) were awarded the Nobel Prize in 1963. 
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Figure 9.8: Example 9.2.2, phase portrait of the lin- 
Figure 9.7: Example 9.2.2, phase portrait of the non- ear approximation at the origin, plotted with Math- 
linear system, plotted with Mathematica. ematica. 


0.0F 


Figure 9.9: Phase portrait in Example 9.2.4, plotted Figure 9.10: Nullclines in the FitzHugh-Nagumo 
with Mathematica. model, plotted with Mathematica. 


a wide variety of excitable cells. Sweeping simplifications to Hodgkin—Huxley were introduced®® by FitzHugh and 
Nagumo in 1961 and 1962: 


V=V-V?/3-WtIlext, W=a(V+b-cW), 
where experimentally estimated dimensionless positive parameters are a = 0.08, b = 0.7, c = 0.8, and Jext is an 
external stimulus current. Here V is the membrane potential and W denotes the strength of the blocking mechanism 
with W = 0 (turned off) when V = 0. 

To plot the phase portrait of the FitzHugh—Nagumo (FN for short) model, we first identify critical points by 
plotting V-nullcline, which is the N-shaped cubic curve obtained from the condition V =0, and W-nullcline, which 
is a straight line obtained from the condition W = 0. As seen from Fig. 9.10, the equilibrium point (V*,W*) is not 
at the origin (for instance, if [ox = 0.2, the critical point is at (—1.06939, —0.46174)). We can check the stability of 
the equilibria by linearizing around the critical point and computing the eigenvalues of the linear system 


dilo), 1-(v*)? -1] [v 

dt |w}] a —ac| |w|- 
The eigenvalues can be computed easily, obtaining two complex conjugate values A;,2 ~ —0.1 + 0.28j. Therefore, 
the origin is an asymptotically stable spiral point and the system will oscillate before reaching it. 


80Richard FitzHugh (1922-2007) from Johns Hopkins University created one of the most influential models of excitable dynamics. 
Jin-Ichi Nagumo (1926-1999) from the University of Tokyo, Japan, made fundamental contributions in the fields of nonlinear circuit 
theory, bioengineering, and mathematical biology. 
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It is convenient to scale variables, which leads to the following homogeneous system of equations: 


v=-v(v-—a)(v—-1)-u, w =e(v — wv), (9.2.7) 


where the parameter 0 < a < 1. The only equilibrium of the scaled FitzHugh—Nagumo system (9.2.7) is the origin. 
To analyze the response of the FN model on an instantaneous current pulse J, we replace the system (9.2.7) with 


v=-v(v—a)(vu-1)-—wt J, w = e(v — Ew). (9.2.8) 


When J increases, the equilibrium point (0,0) moves into the first quadrant of the (v, w)-plane. This equilibrium is 
asymptotically stable for small values of J but becomes unstable for larger values of J. This happens when the real 
part of the eigenvalues changes sign at two locations V{ = +./1— ac. Once the real part becomes zero and then 
positive, even infinitesimally small perturbations will become amplified and diverge away from the equilibrium. In 
this case, the model exhibits periodic (tonic spiking) activity (see Fig 9.9). 


9.2.2 Scalar Equations 


In this subsection, we illustrate linearization stability analysis for some second order nonlinear autonomous scalar 
differential equations = f(z,#). For any its solution x(t), the vector-valued function (x(t), «(t)) describes a path 
in the (a,#)—plane, called the Poincaré phase plane. 


Example 9.2.5: (Pendulum) The pendulum equation 6+ Y 6 +w2 sin@ = 0 can be rewritten as an autonomous 
(nonlinear) system of differential equations: 


L=y, y= —yy —w? sing. 
For small values of « = 0, we may replace sinz by «x to obtain a linear system of equations: 


w= y, y= -yy-w* 2, 


til etal Gp 


When + = 0, we obtain the ideal pendulum (linear) equation 6 +w?6 = 0, which can be converted to a system of 
equations of first order by substitution: y, = w6, yo = y1 = w6. Therefore, 


alee oll 


Example 9.2.6: (Duffing Equation) When sin@ is replaced by its two-term Taylor approximation, the ideal 
pendulum equation 6 + w? sin@ = 0 becomes 


which can be written in vector form: 


6+? — w? 63/6 = 0. (9.2.9) 


A standard mass-spring harmonic equation # + w?x = 0 is derived by applying Hooke’s law: the restoring force 


exerted by a spring under tension or compression is proportional to the displacement. This assumption is valid only 


2kd 
when displacements are small. For larger ones, a restoring force can be modeled by fr(a) = ——— tan (=). where 
1 


the value 6 represents the maximum amount of x the spring can be stretched or compressed. This leads to the 


nonlinear differential equation 


.  2kd Tx 
mé+— tan (7) =0 (-d <a <0), 


where m is the mass of a particle. Assuming that the ratio |72/(20)| is small, we can replace the tangent function 
by its two-term Maclaurin approximation: 


. 2kb (ra leraey3 
mé+ oe (E45 (=) }=o (9.2.10) 
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Equations (9.2.9) and (9.2.10) can be united into one equation, called*! the Duffing equation: 


#+ws + Bx* =0. (9.2.11) 


ad[ey]_ y 
dt | y | | —w2a — Ba? |° 


The term 8x° could be thought of as a small perturbation of the standard mass-spring harmonic oscillator equation. 
If 6 < 0, then we have a soft spring and the corresponding solution is no longer bounded. 

When 6 = 0, we get an ideal spring—mass harmonic oscillator. 

For 6 > 0, it is a hard spring, so its slope is always negative: 


It can be rewritten as a system: 


dy  dy/dt _ _ wa + Ba? 
dx  da/dt — y 


The general solution of the latter is 2(y? + wa?) + Bat = C. 


1. Convert the following single equation of the third order into a system of differential equations of the first order; then 
find all its critical points and classify them. 
(a) gl!” _ 6'69" nis 962 — 8; (b) g!” (0)? 9” 93 —_ 8; (c) gl!” = (Ch 4 93 = 1. 
2. Convert the following single equation of the second order into a system of differential equations of the first order; then 
find all its critical points and classify them. 


(a) 646464 =1; (d) 6+ 67 —40=0; (g) 6+604+6° =8; 
(b) 6+ 26/0+0 =2; (e) 6+060— 40 =0; (h) 6 = 6? — 96; 
(c) 6—0+cos6 = 0; (f) 6 — 20 —sind = 0; (i) 6+0-06*=0. 


3. Find linearization of the following systems at the origin. 


&=a+a7?4+2y, z= xcosy, 
(a) ah (d) i 


y=yty y=y(cosz—1). 
Oneaek On aahie 

y=ytysing. c=. 

= 7e¥ r= =z 1 (3 4 2 
CE ene (eee ae tae 

y=y(e”—-1). y=r+t—-3ly+o'y) 


4. In each system of nonlinear differential equations, determine all critical points, and then apply the linearization theorem 
to classify them. 


Co (oo 7 ed eta 
y=y(3-—2-y). y=y(34+2y). 
é&=2(1—y), e) Je =e(3—y—2ay), 

oy ae ee (e) eames. 

« Jt=ty-4 g=2(2-ay+y’), 

a ae ae ce ae oe 


5. Each system of linear differential equations has a single stationary point. Apply Theorem 9.1 to classify this critical 
point by type and stability. 


(a) f = —5r+8y+1, (b) {5 = —3r + by, 


y=—4e +Ty—1. y=—22+5y—1. 


81 The Duffing equation is named after the German electrical engineer Georg Duffing (1861-1944). 
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&=4a+y-2, z= 7x+6y+7, 
(c) 9. 2 

y= 3x+2y+1. y = 2a — Ay + 2. 

& = 4a — 10y + 2, & = 5x — 2y — 2, 
(Ce (g) 4, 

y=u—2y+l. y=ut+3yts. 

&=2-5y+1, & = 10x — 24y — 4, 
(e) 4. (hy 9 

ya=uts3y+1. y = 4a — 10y+ 2. 


6. Consider the linear system of differential equations containing a parameter e: 
“z= ex —Ay, y = 9x + ey. 


Show that the critical point (0,0) is (a) a stable spiral point if e < 0; (b) a center if € = 0; (c) an unstable spiral point 
if € > 0. The value of parameter € = 0 is called bifurcation. 


7. Building upon the FitzHugh—Nagumo model, Hindmarsh and Rose proposed in 1984 a model of neuronal activity 
described by three coupled first order differential equations: 


&=y+3a" x z+1I, y=1 5a" Y; z=r(a(e+2)-2), 


=a +y?+2? andr 10’. Find eigenvalues of the linearized system at the critical point when J = 1. 


where r? 


8. Show that the critical point x* of a single differential equation « = f(x) is asymptotically stable if f’(x*) < 0 and 
unstable if f’(x*) > 0. 


9. Consider a pendulum of length @ revolving about a vertical axis at constant speed wo (radians/sec) and swinging 
horizontally in the plane perpendicular to the rod. Denoting its lumped mass by m, we get the equation of motion (see 
details in [14]) 

6= wo sin 0 cos 6 — w’ sind — 76, 
where 6 is the angular displacement of the pendulum in the vertical direction, w? = g /£, g is the acceleration due to 
gravity, y = «/(m2), the damping force is assumed to be approximately proportional to the angular velocity with the 
coefficient «. Find all critical points depending on the bifurcation parameter wo and consider two cases when wo < w 
and when wo > w. 


10. Consider the linear system of differential equations containing a parameter € 
roe — 2, y= a — 3y. 
Find all bifurcation points for parameter € and analyze stability depending on the values of this parameter. 


11. Show that for arbitrary positive ¢, the origin is a global attractor for the system 


t=(e—-y)-ex, y=(r—y)?-ey. 


12. Show that the two systems 
t=a(et+y)-y, yaya? ty)+2 


and 


2 2 . 2 2 
b=-a(0?+y)-y,  p=0-y(o +94) 
both have the same linearizations at the origin, but that their phase portraits are qualitatively different. 


13. Show that the origin is not a hyperbolic critical point of the system « = y, y = —a°. By plotting the phase portrait, 
verify that the origin is a stable stationary point but not asymptotically stable. 


14. Consider the three systems 
: 2 : 3 i 2 
E=x +y, L=y-x, ee EE 
(a) 4. 9 (b) 4. 2 (Gh ag 
yrr-y, yrrry,; y=y —@. 
All three have an equilibrium point at (0,0). Which two systems have a phase portrait with the same “local picture” 


near the origin? 


15. Consider the system of two differential equations 
. 2 . 
C=“, Y=y-. 
(a) Show that the origin is the only one equilibrium solution of the given system. 


(b) Find eigenvalues of the linearized system around the origin—equlibrium solution. 


(c) Find the general solution to the given system of differential equations. 
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9.3. Population Models 


About 2,500 years ago, an ancient Greek philosopher, Heraclitus, stated that nature is always in a state of flux. 
Today, rapid change in nature is an idea we all accept, if not welcome. Building a successful mathematical model 
that can predict species’ population levels remains a great challenge. The models must be adjusted to each specific 
population—dynamic systems cannot be used blindly. A mathematical model must predict behavior that does not 
contradict valid observations, else it is flawed. 

A remarkable variety of population models are known and used to describe specific interactions between species. 
Their derivation is usually based on suppressing or ignoring other factors that do not play a significant role. We 
need to remember Einstein’s warning that “everything should be made as simple as possible, but not simpler.” 

In this section, we present some continuous models from population biology that are part of a larger class of 
dynamic models, called Kolmogorov’s systems. Such models take the form #; = «;fi(@1,...,@n) for i = 1,...,n, 
where n is the number of species and the smooth functions f; describe the per capita growth rate for the ith species. 
In the planar case, the orbits of Kolmogorov’s systems when starting on the axes stay on the axes, and interior 
trajectories cannot reach the axes in finite time. 

Kolmogorov’s systems provide examples of autonomous systems that model multi-species populations like several 
species of trout, who compete for food from the same resource pool, as well as foxes, wolves, and rabbits, who interact 
in a predator-prey environment. These models originated in the first part of the twentieth century through works®? 
by Alfred Lotka (1925), Vito Volterra (1920s), Georgii Gause (1934), and Andrew Kolmogorov (1936). Later these 
systems were extended, generalized, and adapted to ecological models and areas not related to biology (for instance, 
economics and criminology). 


9.3.1 Competing Species 


In the absence of competitors, it is reasonable to model their population growth via a logistic equation (developed 
by Belgian mathematician Pierre Verhulst in 1838): 


P =dP/dt =rP —aP?. 


Here P = P(t) is the population size at time t, r is the intrinsic growth rate, and a <r is a measure of the strength 
of resource limitations. The smaller a is, the more room there is to grow. However, Verhulst’s model does not take 
into account many other factors that affect population growth. Species do not exist in isolation of one another. The 
simple models of exponential and logistic growth fail to capture the fact that species can assist or emulate members 
of the same population and fight, exclude, or kill species from another population. 

The competition between two or more species for some limited resource is called interspecific competition. 
This limited resource can be food or nutrients, space, mates, nesting sites—anything for which demand is greater 
than supply. When one species is a better competitor, interspecific competition negatively influences the other 
species by reducing its population size, which in turn affects the growth rate of the competitor. To be more specific, 
let us start with the following example of competition between hardwood and softwood trees, which one can observe 
in any unmanaged piece of forest area. 


Example 9.3.1: Hardwood trees grow slowly, but are more durable, more resistant to disease, and produce more 
valuable timber. Softwood trees compete with the hardwoods by growing rapidly and consuming the available water 
and soil nutrients. 

Competition is caused by resource limitations. The presence of softwood trees limits the amount of sunlight, 
water, land, etc., available for the hardwood, and vice versa. The loss in growth rate due to competition depends on 
the size of both populations. A simple assumption is that this loss is proportional to the product of the two. Given 
these assumptions about population growth and competition, we would like to know whether one species will die 
out over time, or whether there exist equilibrium populations. 


82 Alfred James Lotka was born 1880 in Lemberg, Austria-Hungary (now Lvov, Ukraine), and died in 1949 in New York. Lotka’s parents 
were US nationals and he moved to the USA in 1902, where he pursued his carrier as mathematician, physical chemist, biophysicist, 
and statistician. Vito Volterra (1860-1940) was a distinguished Italian mathematician and physicist, famous for his contributions to 
mathematical biology and integral equations. Even though he was born in a very poor family, Vito became a professor of mechanics at 
the University of Turin in 1892 and then, in 1900, professor of mathematical physics at the University of Rome. Georgii Frantsevich 
Gause (1910-1986), was a Russian biologist. Later, Gause devoted most of his life to the research of antibiotics. Andrey Nikolaevich 
Kolmogorov (1903-1987) was one of the most famous Russian mathematicians of the 20th century. His accomplishments included solving 
two of Hilbert’s problems, founding the axioms of probability theory, and many others. 
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Let x(t) denote the population of hardwood trees at time ¢t and x(t) be the population of softwood trees. 
Assuming that state variables x, > 0, 22 > 0, we consider the following logistic model: 


ry =7T1X1—- airy — bix1 22, (9.3.1) 


Le = rg%2 — azr3 — bo%1 22. 


In the growth rate rpxy — apxyxR — DEEL; (k = 1,2), the first term, r,v,, represents unrestricted growth, the 
second term, a,27, represents the effect of competition within a population, and the third term, b,v,x;, models 
competition between populations of different species. When the model is in the equilibrium point, we say that the 
system is in steady state because it remains there forever. At this point, all of the rates of change are equal to zero, 
and all of the forces acting on the system are in balance. Our first step is to locate equilibrium solutions by solving 
the algebraic system of equations: 


= ayxt — bx, 22 = 0, (9.3.2) 


T2X2 — ax — box%1 22 = 0. 
Factoring out 21 from the first equation and x2 from the second, we find three obvious (extinguishing) solutions: 


(0, 0), 
(0, r/a2), 
(71/a1,0), 


and the fourth at the intersection of these two lines: 


ie = 41%, + b1 22, (9.3.3) 


r= box + a2%. 


If these two lines do not cross inside the first quadrant (21, 22 > 0), then there are only three equilibria. In this case, 
the two species cannot coexist in peaceful equilibrium, and at least one of them will die out. This case is referred to 
as the competitive exclusion principle, Gause’s law of competitive exclusion, or just Gause’s law because it was 
originally formulated by Gause in 1934 on the basis of experimental evidence. It states that two species competing 
for the same resources cannot coexist if other ecological factors are constant. For instance, when gray squirrels were 
introduced to Britain in about 30 sites between 1876 and 1929, they dominated native red squirrels, which led to 
their extinction. Another example gives a competition in the 20th century between the USA and USSR, which 
resulted in disintegration of the latter. However, there is considerable doubt about the universality of the Gause law 
because it is a consequence of linearity of per capita growth rates, and it is not a biological principle. 

Only nonnegative population sizes are meaningful. We are interested in knowing the conditions under which 
%1,%2 > 0 and whether there exist equilibrium populations. It is reasonable to assume that a, > by, for k = 1,2 
since the effect of competition between members of the same species should prevail over the competition between 
distinct species. Therefore, a,a2 — b1b2 > 0, and the solution of Eq. (9.3.3) is 


agr, — bre a1r2 — bery 


1 ay,ag — by bo’ 7 a a2 — bbe ( ) 
The conditions for coexistence become 
a2r, — bre 2 0, 
ayrg — ber, 2 0, 
or, in other words, 
r r r r 
=e end ey (9.3.5) 
a2 by ay be 


The ratios K = 1r,/a; and M = r2/ag are saturation levels or carrying capacities in the absence of competition 
between species: the population stops growing of its own accord. However, these definitions become confusing for 
organisms whose population dynamics are determined by the balance of reproduction and mortality processes (e.g., 
most insect populations). In this case, these ratios have no clear biological meaning. Similarly, if we neglect the 
factor of competition within a population, the net growth becomes 


ThLlk — betRX; = 0) (rz _ bpX;) , k,j = 1,2. 
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In this case, the ratio r,/b; represents the level of population j necessary to put an end to the growth of population k. 
Therefore, peaceful coexistence is observed when each population reaches the point where it limits its own saturation 
level before it reaches the point where it limits the competitor’s growth. 


Now we analyze stability of the system (9.3.1) using a linearization technique (see §9.2). We consider each 
equilibrium solution separately based on the properties of the corresponding Jacobian matrix: 


J (x1, 22) = 


Ty — 2a,x1 — bi x2 —b\ 41 | (9 3 6) 


—bor2 r2 — 2a2%2 = box 


I. The origin (0,0), with the Jacobian matrix (which sometimes is also called the community matrix) 


30,0) =| ap 


r2 
has to be an unstable node because it has two positive eigenvalues. 


II. (K,0), with K =1r,/a,. The Jacobian matrix is 


I(K,0) = | =r, —biri/ay | . 


0 (a1r2 — bers) /a4 
with one negative eigenvalue A; = —r; and another one Az = (aire — bert) /at. 


III. (0,M), with M =r2/az. The Jacobian matrix is 


—_ (ria —= bire)/a2 0 
J(0, M) = | —bar2/a2 —T2 ’ 


with real eigenvalues Ay = (1r1a2 — bir2)/ag and Ay = —r2 < 0. 


IV. (x},23), with the Jacobian matrix 


I(at, 23) = | ea ae bai | (9.3.7) 


—box5 rq — 2ag@> — box} 


We will distinguish four cases, corresponding to the four possible sign combinations for numerators in Eq. (9.3.4), 
agar, — byrg and ayr2 — bgr;, but ignoring the possibilities of x] = 0 and x} =0. As Problem 1, page 511, shows, it 
is not possible for the system (9.3.1) to have a spiral point or a center. 
bo r2 a2 : 34% : ase 
Case 1: — < — < — and ajaz — bjb2 > 0. The asymptotically stable critical point (xj, 23) is in the first 
ay ry 1 
quadrant of the phase plane. Since the Jacobian matrices for the equilibria (K,0) = (r1/a2,0) and (0,M) = 
(0,r2/a1) both have negative determinants, these stationary points are saddle points (see phase portrait in 
Fig. 9.11). 


If interspecific competition is not too strong, the two populations can cohabit, but at lower sizes than their 
respective saturation levels. While the species may coexist, the price that they pay for competing with each 
other is that they do not reach their carrying capacities when the other species are absent. 

Case 2: = < = < 2 and a,a2 — b1b2 < 0. The critical point (9.3.4) is in the first quadrant of the phase plane, 
but acide thie deceriniieint of the Jacobian matrix is negative (so the discriminant is positive and greater 
than its trace), this stationary point is a saddle point—see Eq. (8.2.13) on page 452. The equilibria (K,0) and 
(0, M) are both asymptotically stable nodes. The steady state (0,0) is unstable. The phase portrait in Fig. 
9.12 shows the separatrices (in black) going through (xj,x3). The separatrix splits the phase plane into two 
regions; interior trajectories above the separatrix go to the steady state (0,72/a2) and below the separatrix 
they approach the stationary point (r1/a1, 0). 


From an ecological point of view, interspecific competition is aggressive and ultimately one population wins, 
while the other is driven to extinction. The winner depends upon which has the starting advantage. 
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Figure 9.11: Phase portrait in Example 9.3.2, plot- Figure 9.12: Phase portrait in Example 9.3.3, plot- 
ted with Mathematica. ted with Mathematica. 

r2 a2 T2 be . Saban’ . : ‘ : 
Case 3: — < — and — < —. There is no equilibrium in the interior of the first quadrant. The stationary 


ry 1 rl ay 
point (K,0) is asymptotically stable, while the critical point (0, 1/) is a saddle point (Fig. 9.13). All orbits 
tend to (K,0) as t + oo, corresponding to extinction of the x2 species and survival of the x, species for all 
initial population sizes. 
a2 r2 Wp) r2 : ee : ae : a 
Case 4: 3 < — and — < —. This case is similar to the previous one—no equilibrium solutions inside of the 
-— as 1 TY a1 ry 
first quadrant. Now (4,0) is a saddle point while (0, 1/) is an asymptotically stable node (Fig. 9.14). 
Interspecific competition of one species dominates the other and, since the stable node in each case is globally 
stable, the species with the strongest competition always drives the other to extinction. | 


Conditions in cases 2 through 4 rule out the possibility of coexistence of two species. The experimental evidence is 
somewhat equivocal and there are known models of Kolmogorov’s type for which the Gause principle does not work. 


Example 9.3.2: (Case 1) Determine the outcome of a competition modeled by the system 
& = 2 (180-—32-y), y = y (100 — x — 2y). 
Solution. A coexisting equilibrium is found by solving the system of algebraic equations 
180 — 3x —y=0, 100 — x — 2y = 0. 
By eliminating one variable, we obtain the critical point (52,24). The Jacobian matrix at this point becomes 


—156 —52 | 


3(62,24) = | a as 


with two negative eigenvalues A = —102 + 2/1041. The other equilibrium points (0,0), (0,50), and (60,0) are 
unstable stationary points. 


Example 9.3.3: (Case 2) Determine the outcome of a competition modeled by the system 
&=a(2-—2x-y), y=y(3—4a—-y). 
Solution. The critical points are obtained from the system 
2—2¢—y=0, 3—4¢—y=0. 


This system has one stationary point (4,1) in the first quadrant, which is a saddle point. The equilibria (1,0) and 
(0,3) are asymptotically stable nodes, which correspond to the situation of extinction for one species. The origin is 
always an unstable node (see Fig. 9.12). 


504 Chapter 9. Qualitative Theory of Differential Equations 


EL LL aa 


uw 


VL LOE EE a 
FEE ee ee ee eee 
Oe ee ee ee ee ee 
WA ee lg i le le le ig ie ea he 
\ RA +-4-4- 2-2-2 ee ee es a a 
K Kn <<< <2 ee eee ee 
RKO eee 


‘ KKK Ae 4 ee eee 


VAR KN RAR 
VAR AKKAKARKA 
VAN ANNARKA 


4—-<— <<< <4 -4 - s 


VAR KKK KR RRR 
LVR KR KKK RRA 
VAN KKK RRR 

VAN KKK RRR 
\ 
\ 
\ 
A 
if 
} 
f 
t 
t 
t 
i 
t 
f 
f 
t 
t 
t 
f 
i 


WRRRRR RRR RRR HH 
WRK RRR RRR RRR 
PAK RRR RRR RRR 


PALRRRRKR RRR RRR RK 
PYAAR RR RRR RRR RR 


mp 


15 2 2.5 ol 1 2 3 4 5 


x 


Figure 9.13: Phase portrait in Example 9.3.4, plot- Figure 9.14: Phase portrait in Example 9.3.5, plot- 
ted with Maple. ted with Maple. 


Example 9.3.4: (Case 3) Determine the outcome of a competition modeled by the system 
b=a2(2-2e-y), y=y(l-3c-y). 


Solution. This system has only three critical points inside the first quadrant: (1,0), (0,1), and the origin. According 
to the Grobman—Hartman theorem (see §9.2), we linearize the given system in neighborhoods of every point. The 


2 _ 
0 -2 
this critical point is asymptotically stable, and all trajectories approach the carrying capacity. The Jacobian matrix 


point (1,0) is a proper node because the community matrix has a double negative eigenvalue. Therefore, 


1 0 : ; ‘ ee ; te . 
Ee E| at another stationary point (0,1) has two real eigenvalues of distinct signs, so it is a saddle point. 


Example 9.3.5: (Case 4) Determine the outcome of a competition modeled by the system 


b=a2(3-—32-2y), g=y(4—ac-y). 
Solution. This system has only three critical points inside the first quadrant: (1,0), (0,4), and the origin (which 
is unstable). The former is a saddle point (unstable) because the corresponding community matrix a Fa has 


positive and negative eigenvalues. Another stationary point (0,4) is asymptotically stable because the Jacobian at 


: .,. |-d 0 
this point is i A 


Only stable equilibria are significant because they represent the population sizes for cohabitation. Unstable 
stationary solutions cannot be observed in practice. A point in the phase space that is not an equilibrium point 
corresponds to population sizes that change with time. These points on the phase plane present snapshots of 
population sizes that are subject to flux. In this case, biologists expect population sizes of two species to undergo 
change until they reach approximately the observable values. 

Competitive interactions between organisms can have a great deal of influence on species evolution, the structuring 
of communities (which species coexist, which don’t, relative abundances, etc.), and the distributions of species (where 
they occur). Modeling these interactions provides a useful framework for predicting outcomes. 


9.3.2 Predator-Prey Equations 


This subsection is concerned with the functional dependence of one species on another, where the first species 
depends on the second for its food. Such a situation occurs when a predator lives off its prey or a parasite lives 
off its host, harming it and possibly causing death. A standard example is a population of robins and worms that 
cohabit an ecosystem. The robins eat the worms, which are their only source of food. A few examples of parasites 
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are tapeworms, fleas, and barnacles. Tapeworms are segmented flatworms that attach themselves to the insides of 
the intestines of humans and animals such as cows, pigs. 

In 1925, during a conversation with Vito Volterra, a young zoologist by the name®* of Umberto D’Ancona, 
Volterra’s future son-in-law, asked Vito to explain his observation that the proportion of predator fish caught in the 
Upper Adriatic sea was up from before, whereas the proportion of prey fish was down. This phenomenon was later 
predicted by one of Volterra’s models. In the same year, A. Lotka published a book titled “Elements of Physical 
Biology” where he utilized the same model, which now is known as the Lotka—Volterra model. It is interesting 
that the predator-prey model was initially proposed by Alfred Lotka in the theory of autocatalytic chemical reactions 
in 1910. 

We denote by a(t) and y(t) the populations (or biomass or density) of the prey and predator, respectively, at 
time t. In 1926, Volterra came up with a model to describe the evolution of predator and prey based on the following 
assumptions: 


1. in the absence of predators the per capita prey growth rate is a constant, but linearly falls as a function of the 
predator population when predation is present; 


2. in the absence of prey, the per capita growth rate of the predator is a negative constant, and increases linearly 
with the prey population when prey is present. 


This leads to the following system of differential equations: 


1 da 1 dy 
-—~=r-— by(t), Ta + Ba(t). 


The positive constants r, b, 4, 8 represent the following: r is the per capita growth rate of the prey population when 
predators are not present and yu is the per capita death rate of predators when there is no food. The constants b 
and £ represent the effect of interaction between two species. The above system can be rewritten as 


&=a(r—by), y=y(-pwt ba). (9.3.8) 


Since each of the equations in (9.3.8) is separable, we can solve the system explicitly: 


dy y _ y(-u+ Bx) roby 4 — ce + Be 
— == = —— dy = ——_ dz. 
dz «£ x (r — by) x 


However, we put this approach on the back burner (see Example 9.4.4, page 519) and pursue our main technique— 
linearization. First, solving the nullcline equations 
z(r—by)=0, y(—p+ fx) =0, 


we obtain two critical points: 


The Jacobian matrix at the origin, 
r 0 


has two real eigenvalues of opposite sign. Therefore, it is a saddle point. At another critical point, we have 
x(4 *) es —ba | | 0 ag 
B’ b By —p+ Ba a rB/b 0 : 
m 


ia 
Since its characteristic equation \? + ry = 0 has two pure imaginary roots, +j,/rp, the stationary point (4. r) is 


not hyperbolic, and the Grobman—Hartman theorem (see §9.2) is not applicable. We analyze this case in the next 
two sections, and now just plot the phase portrait. From Fig. 9.15, it follows that trajectories are closed curves in 
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Figure 9.15: Phase portrait of the Lotka—Volterra model (9.3.8), plotted with Mathematica. 


the first quadrant that spiral around the critical point (4. ): Since no orbit can cross a coordinate axis, every 
solution starting in the first quadrant remains there for all time. 

By plotting two populations of prey x(t) and predators y(t) with respect to time t, we see from Fig. 9.16 that the 
oscillation of the predator population lags behind that of the prey. Starting from a state in which both populations 
are small, the population of prey increases first because there is little predation. Later, the predators increase in 
size because of abundant food. This causes heavier predation, and the prey population shrinks. Finally, with a 
diminishing food supply, the predator population also decreases, and the system returns to the original state. 


Example 9.3.6: Consider the Lotka—Volterra system of differential equations 
& = x (0.3 — 0.024 y), y = y (0.02 a — 0.4) 


for x(t) and y(t) positive when t > 0. The critical points of this sytem are the solutions of the simultaneous algebraic 
equations 


«(0.3 —0.024y)=0,  —-y (0.02 — 0.4) =0, 


namely, the points (0,0) and (20,12.5). The Jacobian at the origin, J(0,0) = nm = | has two eigenvalues 
A = 0.3 and ’ = —0.4; therefore this point is a saddle point, unstable. The Jacobian at another stationary 
point J(20,12.5) = rn aa has two pure imaginary eigenvalues, so linearization does not provide enough 


information about its stability. The phase portrait on Fig. 9.15 confirms that the trajectories are closed curves, and 
this point is a center (stable, but not asymptotically stable). 


The Lotka—Volterra model of interspecific competition, also known as the predator-prey model, is frequently used 
to describe the dynamics of biological systems in which two species interact, one as a predator and the other as prey. 
Even though it is a very simple model, it explains cyclic variations in populations of two species observed in reality. 
However, its simplicity leads to some obvious flaws: 


83Umberto D’Ancona (1896-1964) wrote more than 300 scientific articles and textbooks. His field of interest was extremely vast and 
ranged from physiology, to embryology, hydrobiology, oceanography, and evolutionary theory. Umberto interrupted his study at the 
University of Rome during World War I to go and fight as an artillery officer, where he was wounded and decorated. In 1926 he married 
Luisa Volterra. 
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Figure 9.16: Variations of the prey (in black) and predator (in dashed blue) populations with time for the Lotka— 
Volterra system (9.3.8). 


1. There is no possibility of either population being driven to extinction. 


2. Changing the birth r and death zs rates does nothing but change the period of the oscillation, i-e., none can 
dominate. 


3. A prey population in the absence of predators would grow exponentially toward infinity. 


4. Population orbits always go in a counterclockwise direction while there are known quantitative data for which 
the direction should be clockwise. 


5. The population size of prey usually oscillates even in the absence of predators, most likely due to climatic 
variations and to epidemics. 


There are many known refinements to the Lotka—Volterra (LV for short) model that we cannot analyze due to size 
constraints. Therefore, we consider only one more model by introducing a self-limiting term for the growth of the 
prey and predator, reducing the equations to a logistic model: 


nie a (9.3.9) 
y=y(—n- ay + Ba), 


where, as usual, x(t) denotes the population of prey, and y(t) stands for the population of the predator. Let us 
consider the nullclines, which are solutions to 


z£=0: x=0 or r—axr—by=0, 
y=0: y=0 or -p-ayt+ Cr =0. 


This system of algebraic equations has four critical points: 


00, (@-§) (0), (Greeters) 


Since all coefficients are positive, we have to disregard (0,—u/a) as being biologically meaningless. Therefore, this 
system has at most three nonnegative solutions, but only one 
» rat bu 


* rp — ap 
<oe eae 9.3.10 
7 = aa+b8 609» ~ aa tbs ( ) 
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Figure 9.17: Phase portrait of predator-prey model Figure 9.18: Phase portrait of predator-prey model 
when r6 > ap, plotted with Mathematica. when rf < ap, plotted with Mathematica. 


will be in the first quadrant when rf — ay > 0. When r@ < ap, we have two critical points on the boundary of the 
first quadrant. Phase portraits of these two possible cases when rf > ay and rf < ap are plotted in Figures 9.17 
and 9.18, respectively. On the boundary y = 0 (which corresponds to extinction of predators), we have one critical 
point « = r/a, which is unstable when rf > ay and asymptotically stable when rB < ay. There is no (positive) 
critical point on another boundary x = 0. The origin (0,0) is always an unstable saddle point. 

It is not obvious whether the trajectories of the LV model are closed paths or spirals (or something else?) in a 
neighborhood of the stationary points. To complete the phase plots, we need to determine the correct behavior of 
the trajectories near steady states, i.e., perform the linear stability analysis. For the Jacobian matrix, we obtain 


— 2ax — by —ba 
(x,y) = |" 
(79) | By — pe — 2ay + as| 
Hence, at (0,0), we have 
r 0 


so that the eigenvalues r and —y are of opposite sign, showing that the origin is a saddle point. At (r/a,0), we have 


r —r —rb/a 
J (<. 0) = 
a | 0 -—p+ Br/ | 
The eigenvalues of J(r/a,0) are thus Ay = —r < 0 and Ag = —y + Br/a. In the case when Gr > ap, the eigenvalue 
A2 is positive, so that there is no interior steady state, and the critical point (r/a,0) is unstable (saddle). When 
Gr < ap, both eigenvalues \1,2 are negative, and the stationary point (r/a,0) is a stable node, so the interior steady 


state exists. 
Finally, we consider the linear stability of (a*,y*). We have 


ra + bu : 
* * < ad + bB Bag 
I(x" ,y") = By" 3aap + 2bBu — raB 
a aa + bB 


When rf > ap, the matrix J(x*,y*) has two complex conjugate eigenvalues with a negative real part. Therefore, 
the stationary point (2*, y*) is an asymptotically stable spiral point (see Fig. 9.17). 


Example 9.3.7: Consider a simple example of interactions of parasites on hosts modeled by the following system 
of differential equations: 
h = h(a — bp), p =p(ch? —d-sp), 
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where h(t) is the population density of the hosts, p(t) is the mean number of parasites per host, and a, b, c, d, s 
are some positive constants. The parasite-host system has two critical points in the first quadrant: the origin and 


rae bd+as a ; : _ ja—by —bx . 
(a*,y*) = (\ ob $) The Jacobian matrices J(x, y) = | ay ae Sd oey| at these points are 


5(0,0)=|5 9] ana sey") = |aaee a | 
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Figure 9.19: Example 9.3.7: phase portrait of the model describing interactions of parasites on hosts, plotted with 
Mathematica. 


Since the eigenvalues of the matrix J(0,0) are Ay = a > 0 and Ay = —d < 0, the origin is an unstable saddle point. 
At the stationary point («*,y*), the trace of the Jacobian matrix is tr J(2*,y*) = — < 0, and its determinant is 
det J(x*,y*) = 34 (bd +as) > 0. Therefore, (x*,y*) is always a stable equilibrium solution for any positive values 
of coefficients: it is a spiral sink when (tr J 7 < 4det J and it is an attractor when the latter inequality fails. 

Cycle variations of predator and prey as predicted by the Lotka—Volterra model (9.3.8) have been observed in 
nature. A classical example of interacting populations in which oscillations have been observed is the data collected 
by the Hudson’s Bay Company in Canada during the period 1821 — 1940 on furs of the snowshoe hare (prey) 
and Canadian lynx (predator). Even though the data may not accurately describe the total population sizes, the 
plotting graphs of the hare and lynx unambiguously indicate that their trajectories are going clockwise while models 
(9.3.8) and (9.3.9) predict the opposite direction. Various suggestions have been made to explain the anomaly. One 
possibility is that the predator-prey models are too sensitive to actual perturbation in populations: relationships 
among species are often complex and subtle, especially when the number of distinct species exceeds three. 


Nevertheless, mathematical models of biological systems have a long history of use not only in population 
dynamics but also in other disciplines such as economic theory. Frequently, a system of the Lotka—Volterra type is 
used to give some indication of the type of behavior one might expect in multidimensional cases. These models predict 
observable cohabitation of distinct species or industries. Biological experiments suggest that initial population sizes 
close to the equilibrium values cause populations to stay near the initial sizes, even though the populations oscillate 
periodically. Observations by biologists of large population variations seem to verify that individual populations 
oscillate periodically around the ideal cohabitation sizes. 
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9.3.3 Other Population Models 


There are situations in which the interaction of two species is mutually beneficial, for example, plant-pollinator 
systems. The interaction may be facultative, meaning that the two species could survive separately, or obligatory, 
meaning that each species will become extinct without the assistance of the other. A mutualistic system can be 
modeled by a pair of differential equations with linear per capita growth rates: 


&£=a(r—azr-+by), y=y(wt+ecx— dy). (9.3.11) 


In the above system, the mutualism of the interaction is modeled by the positive nature of the interaction terms cx 
and by. In a facultative interaction where two species can survive separately, the constants r and yp are positive, 
while in an obligatory relation these constants are negative. In each type of interaction there are two possibilities, 
depending on the relation between the slope a/b of the z-nullcline and the slope c/d of the y-nullcline. Since their 
analysis is very similar to those described in §9.3.1, we restrict ourselves to one example. 


Example 9.3.8: Consider the following model: 


g=2(1-2+3y), y=y(8+ar—-5y). 
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Figure 9.20: Example 9.3.8: phase portrait of the model describing mutually beneficial interaction of species, plotted 
with Mathematica. 


The above system has four critical points: 
(0,0), (0,06), (1,0), (7,2), 


but only one of them, (7,2), is biologically meaningful because it leads to mutual coexistence. The Jacobian at this 
point is 


J(7,2) = EB “al . wih $e, ace a= oe. 


Since this matrix has two negative real eigenvalues \y = —(17 + V177)/2 + —15.1521 and Ay = —(17 — V177)/2 & 
—1.84793, the critical point (7,2) is an attractor. 

Now we go in another direction and briefly discuss the impact of humans on population dynamics—a very 
important and rapidly developing subject. In studying models for competition between two species (§9.3.1), we 
begin with the system (9.3.1). We will consider only the harvesting of one of the two species, say z-species. With 
constant-yield harvesting, the model is 


&=a(ry— aya — by) — A, Y = Yy(r2 — agu — boy). 


The z-nullcline, instead of being the pair of lines « = 0, 71 = a,x + bry, is now the curve x (r1 — aia — biy) = H, 
which is a hyperbola having the lines « = 0 and ayx + biy = r, as asymptotes and which moves away from these 
asymptotes as H increases. 


Example 9.3.9: (Example 9.3.2 revisited) Determine the response of the system 
= «(180— 3x2-y), y = y (100 — x — 2y) 


to constant-yield harvesting of the x-species. 
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Solution. With no harvesting, there is an asymptotically stable equilibrium at (52,24). A harvesting system has 
the form 
& = x (180 — 32 — y) — HA, y = y (100 — a — 2y). 


Equilibria are given by the pair of equations 
x (180 — 3a — y) = H, x + 2y = 100. 
Replacing x by 100 — 2y in the first of these equations, we obtain a quadratic equation for y: 
(100 — 2y) (5y — 120) = H = y” — 74y + (120 + H/10) = 0. 


From the quadratic formula, we have 


y = 37+ 169 — H/10. 


For H = 0, these roots are y = 50 (which gives x = 0) and y = 24 (which gives x = 52). There is an asymptotically 
stable stationary point at (52,24) and a saddle point at (0,50). As H increases, these equilibria move along the line 
x + 2y = 100 until they coalesce at (26,74) for H = 1690. 


Systems of equations in Problems 1 and 2 can be interpreted as describing the interaction of two species with populations 
x(t) and y(t). In each of these problems, perform the following steps. 


(a) Find the critical points. 


(b) Find the corresponding linearization for each stationary point. Then find the eigenvalues of the linear system; classify 
each critical point as to type, and determine whether it is asymptotically stable, stable, or unstable. 


(c) Plot the phase portrait of the given nonlinear system. 


(d) Determine the limiting behavior of x(t) and y(t) as t + oo, and interpret the results in terms of the populations of the 
two species. 


1. Equations modeling competitions of species: 


oh eae eee © (panes 
o i.) 
Oo Oa eral 
Sr bee rem oe ea 
© [icyenaa iD eet ores 
enemies ae 
Oo Rie a ee 
2. Predator-prey vector equations 
eer oo ae mes 


on Rea 
yYy=yY Y=U\s z)- 
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3. 
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What is the outcome of a competition of two species modeled by the system 


é=2(12—2-y-27), x”)? 


y=y(16-3r-y 


4. Determine the qualitative behavior of a predator-prey interaction modeled by the Holling system. 


10. 


@ {220s ats), e) Ve" 


x 


g=y( 
oft Otte) 
Ofte a a 
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. Show that the stationary point (x*,y*) in the first quadrant («* > 0, y* > 0) of the predator-prey system modeled by 


hoe az aM 
c+A)’ ote A aA 


is unstable if K > A+2M, and asymptotically stable if M< K <A+2M. 


. Let (a*,y*) be a critical point (9.3.4) of the system (9.3.1). Show that the discriminant of the Jacobian matrix (9.3.7) 


does not contain quadratic terms: 
det J(xj, 23) = rire — berix{ — birex3. 


. The populations x(t) and y(t) of two species satisfy the system of equations 


t=2(7-30-2y), y=y(4—20-y) 


(after scaling). 
conditions? 


Find the stationary points of the system. What happens to the species in the cases with initial 


(a) c=1, y=3; (b) cx=3, y=1. 


. Suppose that you need to model populations of blue whales and fin whales that inhabit some part of Pacific ocean. 


Since they both rely on the same source of food, they can be thought of as competitors and their populations can be 
modeled by the system of equations (9.3.1), page 501, where x1(t) represents the population of blue whales and x2(t) 
stands for the population of fin whales. Units for the population sizes might be in thousands of species. The intrinsic 
growth rate of each species is estimated at 3% per year for the blue whales and 5% per year for the fin whale. The 
environmental carrying capacity is estimated at 15 thousand blues whales and 35 thousand fin whales. The extent to 
which the whales compete is unknown, and you simplify your model by choosing b; = bz = 6 in Eq. (9.3.1). 


(a) Estimate an interval of values of b for which coexistence of both types of whales is possible. 


(b) Find equilibrium solutions for b = 0.02 and determine the type and stability of each critical point. Describe what 
happens to the two populations over time. 


(c) Answer the previous question for b = 0.05. 


. One of the favorite foods of the blue whale is called krill. These tiny shrimp-like creatures are devoured in massive 


amounts to provide the principal food source for the huge whales. If x(t) denotes the biomass of krills and y(t) denotes 
the population of whales, then their interaction can be modeled by the following Lotka—Volterra equations: 
& = x (0.3 — 0.0075y) , y = y(—0.2 + 0.00252) . 


Determine the behavior of the two populations over time. 


Redo the previous problem based on the logistic equation 


¢=2 (0.3 aa 


0.0075y) ,  y=y(—0.2 +.0.00252). 
The maximum sustainable population for krill is 1200 tons/hectare. In the absence of predators, the krill population 


grows at a rate of 30% per year. 


9.3. 


11. 


12. 


13. 


14. 


15. 


16. 


17. 


18. 


19. 
20. 
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Suppose your pond contains two kinds of freshwater fish: green sunfish and bluegill. You estimate their carrying 
capacities as 800 species of each population, and their unrestricted rate to be 20%. The corresponding competition 
model becomes 


. x y 
=ftetia-- 00 i = 0.2 (i- -0. i 
£=0 mat ao — 0.016 y g=0.2y soe — 0.008 « 


where z(t) is the population of green sunfish and y(t) is the population of bluegill. Which species appears to be the 
stronger competitor? What outcome would you predict if we started with 100 of each species? 


You have both green sunfish and bluegill available for stocking your pond. You model their interaction with the following 
system of equations: 
: x , y 
=0.2 (1- =~ - 0.06y), =0.12 (1-4 -008 i" 
7 “\"~ 7200 2 - YX" ~ 800 : 
Which species appears to be the stronger competitor? 
The robin and worm populations at time t years are denoted by r(t) and w(t), respectively. The equations governing 
the growth of the two populations are 
r(t) = 3w(t) — 30, 
w(t) = 15 — r(t). 
If initially 8 robins and 10 worms occupy the ecosystem, determine the behavior of the two populations over time. 


For competition described by the Holling—Tanner system of equations with r=s=1, K = 10, andh=5: 


= ar (1-2) - ye”, ius 4), 


which of two species outcompetes the other? 

In an ecosystem, let x(t) stand for the population of preys, and y(t) and z(t) denote populations of two distinct predator 
species that compete for the same source of food—prey x(t). Suppose that their interaction is modeled by the following 
system of equations: 


& =x (12 — 2y — 32), 
j= (-2-4 20 — 32), 
z=z(-l+z-y). 


Find all critical points in the first quadrant of the above system and classify them as stable, asymptotically stable, or 
unstable. 

In an ecosystem, let x(t) and y(t) stand for two populations of competing preys, and z(t) denote populations of predator 
species that prey on both preys, x(t) and y(t). Suppose that their interaction is modeled by the following system of 
equations: 


2 
3 
i=u($-2-y-2), 
2=2(4a + 3y —4). 
Find all critical points in the first quadrant of the above system and classify them as stable, asymptotically stable, or 


unstable. 


Termite assassin bugs use their long rostrum to inject a lethal saliva that liquefies the insides of the prey, which are 
then sucked out. We may be interested in quantifying the number of termites that can be consumed in an hour by the 
assassin bug. Determine the equilibrium behavior of the corresponding predator-prey system modeled by 


a j= an: 
7 30 epi)? ~""\eal0 3/7 


Consider the predator-prey system 


&=5xa—axcy+ex(3—-2), y = ry — 3y, 
containing a parameter e. For this system of equations, a bifurcation occurs at the value ¢ = 0. 


(a) Find all critical points for « = 1,—1, and 0. 
(b) Find the Jacobian matrices at all these points. 


(c) Determine stability at each equilibrium point. Identify cases where linearization does not work. 


In each exercise from Problem 1, determine the response of the system to constant-yield harvesting of the x-species. 


In each exercise from Problem 1, determine the response of the system to constant-yield harvesting of the y-species. 
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9.4 Conservative Systems 


Many differential equations arise from problems in mechanics, electrical engineering, quantum mechanics, and other 
areas where conservation laws may be applied. In particular, a conservation law states that some physical quantity, 
which is usually energy, remains constant. In reality, a physical system is never conservative. However, mathematical 
models often neglect effects such as friction, electrical resistance, or temperature fluctuation if they are small enough. 
Therefore, we operate with idealized mathematical models that may obey conservative laws. We will see later that 
in many cases mathematical expressions that have no physical meaning behave conservatively. In this section we 
analyze mathematical models using systems of autonomous differential equations for which conservation laws can 
be applied. 
Consider a mechanical system that is governed by Newton’s second law, 


F=mj, j=dy/d’, 


where the force F = F'(y) depends only on displacement y. By dividing the initial equation F = m¥% throughout by 
the bothersome mass m, we can rewrite it as 
9+ f(y) =9, (9.4.1) 
where f(y) = —F(y)/m. Now we show that Eq. (9.4.1) possesses a conservative law. We first multiply the equation 
by y, obtaining 
yi + fy)y=0. 


Recalling the chain rule of calculus, we see that the first term on the left-hand side is 


ee: eee 
=a, E (y) : 
Likewise, if II(y) denotes an antiderivative of f(y), we express the second term as 
fyI= FMW), with Sy) = sly) 
yy = dt Y), dy Y) = FY): 


Using these derivative expressions, we form the differential equation 


col fe ee 
~ |= (y)2 + M(y)| =0. 
3 [ga + | 
Recognizing in the expression (1/2)(y)?, the kinetic energy, and in II(y), the potential energy of the system, we see 
that the total energy is a constant (denoted by K): 


Ey, 9) = 5 (y)? +My) = K. (9.4.2) 
Equation (9.4.2) is the underlying conservative law. For our mechanical system, if y(t) represents a displacement, 


then the term (1/2)(y)? is kinetic energy per unit mass, and II(y) is potential energy per unit mass. 
Differential equation (9.4.1) can be recast as the first order autonomous system 


Ly = %2, 
LQ = =f (ai), 
where x(t) = y(t) and x2 = y(t). Thus, the conservative law (9.4.2) takes the form 
act lg d 
E(#1,%2) = 3 72 +21) = K, where a l(a) = f(a1). 
1 


The family of curves obtained by graphing E(21,22) = K for different energy levels K is a set of phase-plane 
trajectories describing the motion. It is called the Poincaré phase plane of the differential equation (9.4.1). Solving 
Eq. (9.4.2) with respect to the velocity v = y = x2, we obtain 


v =a, =4+V2(K- I(y))\/? ; Kc is a constant. 
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Hence, the (real-valued) velocity exists only when K —II(y) > 0. This leads to the following three types of trajectory 
behavior. 

1. The potential function II(y) has a local minimum at Ymin. Then the level curves E(a1,x72) = K, where K is 
slightly greater than II(ymin), are closed trajectories encircling the critical point (ymin, 0). Therefore, the equilibrium 
solution y = ymin is a center. 


2. The potential function H(y) has a local maximum at ymax. Then the level curves E(a1,22) = K ina 
neighborhood of the critical point (ymax,0) on the phase plane (21,22) = (y,y) are not closed trajectories. For 
K > U(ymax), there are trajectories moving away from the critical point because their velocities (v = +/2 x 


(kK — II(y))/ *) are either positive or negative . However, for K < I (ymax), there is an interval around it where 
we do not observe any solution (the root (K — II(y))/ * is purely imaginary). Therefore, the equilibrium solution 
Y = Ymax is a saddle point. 

3. Away from critical points of II(y), the level curves may be part of a closed trajectory or may be unbounded. 


Example 9.4.1: (Pendulum, Example 2.6.18 revisited) The differential equation 6 + w2sin@ = 0 used to 
model the motion of the ideal pendulum in the plane does not account for any resistive force, obeying the conservation 
law. It can be rewritten in the Poincaré phase plane form: 


L=yY, 


oe 2 s 
y= —w* sina, 


where xz = @ is the angle of the bob’s inclination. Now we express the dependence of the velocity y = 6 with respect 
to inclination x. To find it, we write 

dy  y  dy/dt — w? sing 

dx « da/dt y 


and separate variables 
y dy = —w* sin x dz. 


Then integrate both sides from + = 0 to « = x on the right-hand side and from y = 0 to y = y on the left-hand side. 
This yields 


1 
5 y’ +w" (1—cosz) = K (9.4.3) 


for some constant K. The term w?(1—cos@) = w?(1—cos) represents the potential energy of the pendulum 
bob, measured relative to a zero reference at the horizontal line of the downward position. Since the first term 
y’?/2 = «?/2 corresponds to the kinetic energy, Eq. (9.4.3) describes the conservation law of mechanical energy for 
the undamped pendulum. Using the trigonometric identity 1 — cosa = 2 sin?(«/2), we solve Eq. (9.4.3) and express 
y = & in terms of x explicitly: 

y= =+ (2K — du? sin? (x/2))””. (9.4.4) 


We set up a frame of Cartesian axes x, y, called the phase plane, and plot the one-parameter family of curves 
obtained from Eq. (9.4.4) by using different values of energy level K. This leads to the picture (see Fig. 9.21), called 
a phase diagram or a phase portrait for the problem, and the solution curves are called the phase paths or 
trajectories. Various types of phase paths can be identified in terms of K. The critical value of the energy K = 2w? 
corresponds to the paths joining (—7,0) and (7,0). These two curves 


y = £2w4/1 — sin?(x/2) = +2w \cos =| (K = 2w?) 


2 


separate the family of solutions into disjoint sets where K > w? and where K < w?, with different properties. 
Therefore, each of these curves (plotted in blue in Fig. 9.21) is called a separatrix because it separates phase curves 
representing two distinct behaviors: one is periodic, called oscillation, and plotted within shaded areas in Fig. 9.21 
where the pendulum swings back and forth, and the other is aperiodic, called rotation, where the pendulum swings 
over the top. Each separatrix approaches an equilibrium point without reaching it: they are unstable saddle points 
((2k + 1)a,0), & =0,41,+2,..., that correspond to the upward position of the pendulum. The basins of attraction 
for stable stationary points (km, 0) are shaded regions in Fig. 9.21 bounded by separatricies. Each of these points 
has its own region of asymptotic stability, which is bounded by separatricies entering the two neighboring unstable 
points. 
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Figure 9.21: Example 9.4.1: The level curves for an undamped pendulum, plotted with Mathematica. 


A given pair of values (x,y), or (x,%), represented by a point P(x,y) on the diagram is called a state of the 
system. A state gives the angular velocity y = & at a particular inclination x = 6. A given state (a,#) serves also 
as a pair of initial conditions for the original pendulum equation; therefore, a given state determines all subsequent 
states, which are obtained by following the trajectory that passes through the point P. 

The direction of the point moving along these trajectories coincides with the direction field arrows that are 
tangent to the phase paths. When y = « > 0, x must increase as t increases. This leads to the conclusion that the 
required directions are always from right to left in the upper half-plane. Similarly, the directions are always from 
left to right in the lower half-plane. 


9.4.1 Hamiltonian Systems 


We now discuss a class of autonomous first order systems that satisfy a conservative law. A system is called 
a Hamiltonian system™ if there exists a real-valued function that is constant along any solution of the system. 
They are completely described by a scalar function H(q, p), where both q and p are vectors with the same dimension, 
called generalized coordinates and momentum, respectively. Therefore, Hamiltonian systems necessarily have even 
dimensions. For simplicity, we consider only two-dimensional systems. 

Let H(x,y) be a smooth function of two independent variables. If we replace x and y in H(x,y) by functions 
x(t) and y(t), the composition H(a(t), y(t)) will become a function of the variable t. According to the chain rule, 
its derivative becomes 

H = © H(a(t),y() = SS 
x dt Oy dt 


If these two functions 7 = x(t) and y = y(t) are solutions of the two-dimensional autonomous system 


a f(x,y), (0.4.5) 
y= g(x,y), 
then this system is called a Hamiltonian system if 
OH OH 
po eyes i =, 9.4.6 
&= f(x,y) ay? g(x,y) Da (9.4.6) 


The function H(z, y) is called the Hamiltonian function or simply the Hamiltonian of the system (9.4.5). Since 
the composition H (a(t), y(t)) is a conservative quantity of the system, we get the general solution of Eq. (9.4.5): 


A(x(t), y(t) = ¢, 


for some constant c. The following theorem gives the necessary and sufficient conditions for a system (9.4.5) to be 
conservative. 


84Sir William Rowan Hamilton (1805-1865) was an Irish physicist, astronomer, and mathematician. Shortly before this death, he was 
elected the first foreign member of the United States National Academy of Science. 
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Figure 9.22: Example 9.4.2: The level curves, plotted with Mathematica. 


Theorem 9.2: Consider the two-dimensional autonomous system (9.4.5). Assume that f(x, y) and g(a, y) along 
with their first partial derivatives are continuous in the xy-plane. Then the system is a Hamiltonian system if 
and only if 


—= Sos for all (a, y). (9.4.7) 


Example 9.4.2: Consider the autonomous system 
& = 4y? + sinz, 
y = 32? +1-y cosa. 


Setting f(x,y) = 4y? +sinz and g(x,y) = 3x7 +1 —y cosa, we apply the Hamiltonian test (9.4.7): 


of Og 

— =cosr=-—. 

Ox Oy 

Since our system is Hamiltonian, there exists a function H(x, y) such that 

OH 
— = —9(x,y) = —327 —1+y cosz, 
Ox 

OH 

— = f(z,y) = 4y? +sinz. 

Oy 


For integration, we choose one of these equations, say the latter, and compute an anti-partial-derivative, obtaining 
H(z,y)=y* +y sina + k(2), 
where k(x) is an arbitrary differentiable function of x. Using the former equation, we get 


on 
Ox 


Therefore, the derivative of k(x) is 


=y cosz+ k'(x) = —g(a, y) = —327 —1+y cosz. 
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Figure 9.23: Example 9.4.3: The level curves, plotted with Mathematica. 


where we dropped an arbitrary constant of integration. Hence, the Hamiltonian becomes 


H(z,y) =y* +y sing — 2° — 2. 


Example 9.4.3: (Pendulum ) If we approximate sin@ by 0 — 63/3! in the pendulum equation 6+ w? sind = 0, 
we will arrive at the following nonlinear system: 


t=y, y=-ct+2°/6. 


The level curves of the trajectories can be obtained from the differential equation 


which, after separation of variables, gives 


1 1 1 il 
yay = (0% ~2) de = su 5° -5e te 
Therefore, the Hamiltonian of this system reads 
1 1 1 
H Se 2 Spa Se 4 
(y=sr +52 - 52 


To plot these level curves (Fig. 9.23), we use the following Mathematica script: 


ContourPlot[(1/2)*y*y + (1/2)*x*x - (1/24) *x*x*x*x, {x, -4, 
4}, fy, -3, 3}, ContourShading -> None, ContourLabels -> True, 
PlotRange -> {-5, 5}, Contours -> 19, AspectRatio -> Automatic] 
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Hamiltonian tar Lota-Voterre model: H(x.y)=beta x+D y-alphe In{x}-a tty} 


Figure 9.24: Example 9.4.4: Lotka—Volterra level curves, plotted with Mathematica. 


Example 9.4.4: (Lotka—Volterra model) Consider the predator-prey model (9.3.8): 


z=a(r—by), y=y(br-p). 


Upon multiplying the former equation by (Ga — 4) /x and the latter by (r — by) /y and subtracting the results, we 
obtain the exact equation 


— —-——_ — =0 
x dt y dt 
or 


qy (82 + by —wlne —r Iny) =0. 


Therefore, the Hamiltonian of the predator-prey equations is H(x,y) = 8a + by—yInx«—r In y. All orbits of the 
Lotka-Volterra model evolve so that they keep the Hamiltonian constant: H(x(t), y(t)) 
claim that H(z, y) is a concave function, which follows from the equations 


H(x(0),y(0)). Now we 


Ha=a2,. HyHae,, He Sa. = 6. 
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The stationary point is the point where VH = 0, that is, (a*,y*) = (4. r) is a critical point. Since at this 
point Hr Hy, — Hz, = ru/(«y)? > 0 and Hye > 0, Hyy > 0, the Hamiltonian attains a minimum at (2*,y*): 
H (a*,y*)=p—-—b-—p In§ —r ling. 

Therefore, all trajectories of the predator-prey equations are the projections of the level curves H(x,y) = Ho of 
the Hamiltonian. Notice that this corresponds to the unique steady state of the system (9.3.8), page 505. Since H is 
a strictly convex function with a unique minimum in the positive quadrant, every trajectory must be a closed curve. 
Thus, the orbits are a one-parameter (depending on the value of H) set of closed curves starting at the steady state. 


Example 9.4.5: Solve the system of autonomous equations 
é=y(1—-2’), y=a(y’—4). 
Solution. The stationary points occur at the simultaneous solutions of 


yU-—2) =0, a (y?—4) =0. 


These are five solution pairs of the above algebraic system of equations: (0,0), (1,2), (1,2), (—1,2), (—1, —2). 
The phase paths satisfy the differential equation 


dy _ z(y?—4) 
dx y(1—2?)’ 


which is a first order separable equation. Upon separation of variables and integration, we obtain the general solution: 


(y? —4) (1-2?) =c, a constant. 


Notice that there are special solutions (separatricies) along the lines x = +1 and y = +2 where c = 0. Streamlines 
cross the vertical axis x = 0 with zero slope, and paths cross the horizontal axis y = 0 with infinite slope. The 
directions of streamlines may be found by considering domains where « > 0 or & < 0, and similarly for y. 


1. For computing the motion of two bodies (planet and sun) which attract each other, we choose one of the bodies (sun) 
as the center of our coordinate system; the motion will then stay in a plane and we can use two-dimensional coordinates 
q = (a, 42) for the position of the second body. Newton’s laws, with a suitable normalization, then yield the following 


differential equations: 
q1 23 q2 


———;,  k=-——,;: 
3/2 3/2 
(q? + 43)*/ (q? + 3)*/ 


Find a Hamiltonian for this system of equations. 


a= 


2. Find the potential energy function H(y) and the energy function E(x, y) for the given equations. 


(a) #+ 2x —5a* = 0; (c) @+2e” -1=0; (e) &+cosax = 0; 
(b) #+3a? —6r+1=0; (d) #+2x/(1+ 27) =0; (f) +2 sina =0. 


3. Find Hamiltonians of the following systems. 


(a) & = Qn? — 3xy? + 4y3 — 2°, (c) & = 2ey sine — x’ siny, 
a c 
y=y? + 327y — 2ry; y= —y’ sine — 2x cosy — ry” cosa; 
= 2y (1 4? +4 y?) er ty & = —2y sin («? ae y?) 
(b) 5 2 2 2442 (d) . : 2 2 
y=2x(l+a +y7)er th; y = 2a sin (x +y’). 


4. For the following systems, determine whether they are Hamiltonian. If so, find the Hamiltonian function. 


(a) eee (b) more 


yay —2; 
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5. (Hénon—Heiles problem, 1964) The polynomial Hamiltonian®° in four degrees of freedom 


1 1 
Hap) =5 (a +02 +Pitp2) +92 - 34, a= (n,%), P= (p1,P2); 


3 


defines a Hamiltonian differential equation that can have chaotic solutions. Write the corresponding system of differential 
equations of first order and make a computational experiment by plotting some solutions. 


6. Find a Hamiltonian for the system # = y, y = x — 2”, and plot the level curves. What do you notice? 


7. Consider a solar system with N planets. Their motion can be modeled by a system of ordinary differential equations 
with a Hamiltonian 


n(a.p) => +p gy ei ae 
2m, =a Tae — asll’ 


where mo = 1 is the mass of the sun, and m; are masses relative to mo of the planets, g = 2.95 x 10~* is the gravitational 
constant. Write this system of equations explicitly, and make a computer experiment with N = 2 planets by solving 
numerically the system and plotting their solutions. 


8. Consider a second order linear system 
z= —azx — by, y = —bxa —cy, 


where a, 6, and c are constants. 


(a) Show that the system is equivalent to a first order Hamiltonian system, with Hamiltonian 
A(2,y,u,v) = 3 (az? + Qbry + cy? + u? +07), where & = u, y= v. 


(b) Show that the system has a stable equilibrium at the origin if a@ > 0 and ac > b?. 
9. Let q = (q1,q2,---;Qn),; P = (P1, p2,---,Pn). Consider a Hamiltonian 


N i-1 


Har)=$> 20h +2 Vs (lla - ail 


t=2 j=l 


that can be used to model the interaction between N pairs of neutral atoms or molecules with Lennard-Jones potential®® 


vse) men [(2)"— (2)"] =e (2) *-2(82)'] 


where ¢€ is the depth of the potential well, o is the finite distance at which the inter-particle potential is zero, r is the 
distance between the particles, and rm is the distance at which the potential reaches its minimum. At rm, the potential 
function has the value —e. The distances are related as rm = 2/65. Here q: and p; are position and momenta of the 
i-th atom of mass ™;. 

Write a system of differential equations with the corresponding Hamiltonian. 


10. Using the energy function E(x, y, z) = 27 + 4y? + 92’, show that E(z, y, z) is a constant along the motion of the system 


&=9rz—-—4yz, y=9yz+uz, 2=-2 Ay”. 


Then show that the equilibrium point (0,0,—a) (a > 0) is neutrally stable. 


11. When expressed in plane polar coordinates (r,@) given x = r cos0, y = r sin, show that the Hamiltonian equations 
(9.4.6) become 
_ 10H : 1 OH 
7 00’ or Or’ 
12. Sketch a phase portrait for each of the following Hamiltonian systems written in polar coordinates (r,@). Add the 
equation @ = 1. 


- JreaDe trad, ae |? ifr <4, 
(a) =f (b) =f 


0, otherwise; r(r—4), otherwise. 


85The corresponding system was published in a 1964 paper by French mathematician Michel Hénon (1931-2013) and American 
astrophysicist Carl Heiles (born in 1939). 

86Sir John Edward Lennard-Jones (1894-1954) was a British mathematician who was a professor of theoretical physics at Bristol 
University and then of theoretical science at Cambridge University. He may be regarded as the founder of modern computational 
chemistry. 
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9.5 Lyapunov’s Second Method 


The linearization procedure, discussed in §9.2, can be used for determination of the stability of a critical point only 
if a corresponding linear system has the leading matrix without purely imaginary eigenvalues. This approach is of 
limited use because it provides only local information: the results are valid only for solutions that have initial values 
in a neighborhood of a critical point. There is no information about solutions that start far away from equilibrium 
points. 

In this section we discuss a global method that provides information about stability, asymptotic stability, and 
instability of any autonomous system of equations, not necessarily almost linear systems. This method*” is referred 
to as Lyapunov’s second method in honor of Alexander Lyapunov, who derived it in 1892. 

The Lyapunov method is also called the direct method because it can be applied to differential equations 
without any knowledge of the solutions. Furthermore, the method gives us a way of estimating the regions of 
asymptotic stability. This is something the linear approximation would never be able to do. 


Definition 9.8: Let x* be an isolated critical point for the autonomous vector differential equation 
x = f(x), xER", (9.5.1) 


so f (x*) = 0. A continuously differentiable function V : U ++ R, where U C R” is an open set with x* € U, is 
called a Lyapunov function (also called a weak Lyapunov function) for the differential equation (9.5.1) at x* 
provided that 


1. V(x*) =0, 
2. V(x) > 0 forx €U \ {x*}, 


3. the function x 4 VV(x) is continuous for x € U \ {x*}, and, on this set, V(x) = VV(x) - f(x) < 0. 
If, in addition, 


4. V(x) <0 for x € U \ {x*}, then V is called a strong (strict) Lyapunov function. 


Theorem 9.3: [Lyapunov] If x* is an isolated critical point for the differential equation (9.5.1) and there 
exists a Lyapunov function V for the system at x*, then x* is stable. If, in addition, V is a strong Lyapunov 
function, then x* is asymptotically stable. 


We outline the proof only for a two-dimensional case. The idea of Lyapunov’s method is very simple. 
Let f(t) = (x(t), y(t)) denote a trajectory of the vector equation 


It is reasonable to assume that the level set S = {(x,y) € R? : V(x, y) =c} of the Lyapunov function V is a closed 
curve in the xy-plane and contains the only critical point of the system (9.5.2). The gradient VV is an outer normal 
for the level curve S = {x € R” : V(x) =c} because it points in the direction where the function V(x, y) increases 
fastest. If @(t) = (x(t), y(x)) is a trajectory of the system (9.5.2), then T(t) = «(t)i+ y(t)j is a tangent vector to 
it at time t. The derivative of V with respect to the given system (9.5.2) is a dot product of two vectors: 


V(t) = Velw,y) &(t) + V,(x,y) g(t) = VV(2,y) T(t) = |VV] - IL] cosa, 


where 6 is the angle between VV and T at time t. Recall that cos@ < 0 for 7/2 < 6 < 37/2; therefore, V is negative 
when the trajectory passing through the level curve V(x, y) = c is going inward. At points where V =0, the orbit 
is tangent to the level curve. Thus, V is not increasing on the curve t +> (x(t), y(t)) at t = 0, and, as a result, 
the image of this curve either lies in the level surface S, or the set {(a(t), y(t)) : # > O} is a subset of the set in 
the plane with outer boundary S. The same result is true for every point on S. Therefore, a solution starting on 
S is trapped: it either stays in S, or it stays in the set {x € R? : V(x) < c}. So the region where V(x) < 0 is 


87 Aleksandr Mikhailovich Lyapunov (1857-1918) was a famous Russian mathematician, mechanician, and physicist who made great 
contributions to probability theory, mathematical physics, and the theory of dynamic systems. In 1917 he moved to Odessa because of 
his wife’s frail health. Shortly after her death Lyapunov committed suicide. 
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contained in the basin of attraction of x*. The stability of the critical point follows easily from this result. If V is 
a strict Lyapunov function, then the solution curve definitely crosses the level surface S and remains inside the set 
{x € R? : V(x) < c} for all t > 0. Because the same property holds at all level sets “inside” S$, the equilibrium 
point x* is asymptotically stable. oO 


A Lyapunov function V can be thought of as a generalized energy function for a system. A function V(x) 
that satisfies the first two conditions in Definition 9.8 is usually called positive definite on the domain U. If 
the inequality in the second condition V(x) > 0 is replaced by V(x) > 0, then the function V(x) is referred 
to as positive semidefinite on U. By reversing inequalities in these definitions, we obtain negative definite 
(V(x) < 0 for x € U \ {x*}) and negative semidefinite (V(x) < 0 for x € U \ {x*}). Obviously, if V(x) is positive 
definite/semidefinite, then —V(x) is negative definite/semidefinite, and vice versa. Some simple positive definite 
and positive semidefinite polynomial functions are as follows: 


az?+by*, az?+by*, azt+by?, and az’, byt, («—y)?, 
respectively, where a and 0 are positive constants. Also the quadratic function 
V(a,y) =a2 +bay+cey? (9.5.3) 
is positive definite if and only if a > 0 and b? < 4ac (a, b, and c are constants). 


Example 9.5.1: In the two-dimensional case, the function V = 1 — cos (a? + 4y?) of two variables (x, y) is positive 
definite on the domain U : —7 < a2? + 4y? < 7 since V(0,0) = 0 and V(z, y) > 0 for all other points in U. 

However, the same function is positive semidefinite on R : —a < x? + 4y? < 3a because there exists the ellipse 
of points, x? + 4y? = 27, where the function V (sz, y) is zero. 


Theorem 9.4: [Lyapunov] Let x* be an isolated critical point of the autonomous system (9.5.1). Suppose that 
there exists a continuously differentiable function V(x) such that V(x*) = 0 and that in every neighborhood of 
x* there is at least one point at which V is positive. If there exists a region R 5 x* such that the derivative of 


V with respect to the system (9.5.1), V(x) = VV (x) - f(x), is positive definite on R, then x = x* is an unstable 
equilibrium solution. 


There are many versions of Lyapunov stability theorems that make different regularity assumptions. Their power 
comes from simplicity because one does not need to know any solutions. However, finding an appropriate Lyapunov 
function is generally considered as an art to come up with good candidates. 


Example 9.5.2: Lyapunov’s function for the system of equations 
é=axr—Byt+y’, y= Brt+ay—ay 


can be chosen as V(a, y) = ax? + by, with positive coefficients a, b. Then the derivative of V with respect to the 
given system is 
- dV OV daz OV dy 
V SSS s + ——— 
dt Ox dt Oy dt 
If we choose a = b > 0, then 


= 2a [ax? + by*] + 2xy*(a — b) + 2Baxy(b — a). 


V =2a [ax? + by? | = 2aa (a? + im) 
and the origin is stable if a < 0 and unstable if a > 0. 
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Figure 9.25: Example of a Lyapunov function, plotted with Mathematica. 


Example 9.5.3: (van der Pol Equation) Consider a modification of the harmonic oscillator equation # = —x 
by adding a term —e (x? — 1) ¢ that has a damping effect when || > 1 and an amplifying effect when |z| < 1: 


éte(2?-1)é+2=0, (9.5.4) 


where € is a positive parameter. This (unforced) differential equation was introduced by Lord Rayleigh in 1883. 
The Dutch electrical engineer and physicist Balthasar van der Pol (1889-1959) investigated this oscillator more 
extensively when he studied the equation in 1926 as a model of the voltage in a triode circuit. Since its introduction, 
this differential equation has been suggested as a model for many different physical phenomena (for instance, as a 
model for the human heartbeat). Rewriting it as an equivalent system of equations 


r=Y, 


y=-ate(l—2’)y, 


we see that it has an equilibrium solution at the origin. Now we choose a Lyapunov function in the form V(z, y) = 
ax? + by?, which is positive definite when constants a, b are positive. The derivative of V with respect to solutions 
of the van der Pol equation is 


V(a,y) = (2ax, 2by) - (y,-2 +e(1—27) y) = 2ary — 2bry + 2hey” (1 — x”). 


By taking a = b, we eliminate the first two terms in V. Then, assuming that € > 0, we get V(a,y) = 2bey? (1— a?) > 
0 for arbitrary (x,y) # (0,0) and for any b > 0, whenever || < 1. Evidently V(,y) > 0 inside the unit circle 
x? +y? <1, so a solution trajectory starting in or on a circle of radius 1 about the origin must leave the stationary 
point. Also V (a, y) > 0 when a? < 1 and y 40. The Lyapunov theorem guarantees that every trajectory starting 
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inside the unit circle not only departs the origin but tends to a closed loop (see Fig. 9.33 on page 535) because 
V(a,y) < 0 for x? > 1. Therefore, the origin is an unstable stationary point for small z. The actual region of 
asymptotic stability includes this unit circle. 

On the other hand, when z is large, the term «? becomes dominant and the damping becomes positive. In this 
case, V (a, y) <0. Therefore, the dynamics of the system is expected to be restricted in some area around the fixed 
point. Actually, the van der Pol equation satisfies the Levinson—Smith theorem 9.7, page 529, ensuring that there is 
a stable limit cycle in the phase space. The van der Pol system is therefore a Liénard equation (see §9.6). 


Example 9.5.4: Determine the stability of the critical point at the origin for the system 


L= -y?, y = —2°. 


Solution. ‘The origin is not an isolated equilibrium solution for the corresponding linear system. Therefore, we 
cannot apply the linearization technique from §9.2. To apply Lyapunov’s instability theorem 9.4, we try the function 


V(a,y) = —axy. It is continuous, V(0,0) = 0, and it is positive at all points in the second and fourth quadrants. 
Moreover, its derivative with respect to the solutions of the given system is 
~ OV OV 
V=—f4+—y= 24+, 
Ox Oy : : 


which is positive definite in any domain containing the origin. Hence, the origin is an unstable critical point. 


Example 9.5.5: (SIRS Epidemic Model) Consider a population of individuals infected with a nonfatal disease. 
A simple SIRS model with disease-induced death and bilinear incidence that describes the spread of the disease is 
as follows: 

S=A-—pS — BIS+6R, 

I=6IS—(y+p+t+o)l, (9.5.5) 

R= 71 -(u+)R, 
where the dot stands for the derivative with respect to time t, and £ is the contact rate, ys is the average death 
rate, 1/7 is the average infection period, and 6 is the average temporary immunity period. Let N(t) = S+1+R, 
with N(0) > 0, be the total population, where S is the number of susceptible people, R is the number of recovered 
individuals with immunity, and J is the number of infected people. Then 

N=A-pN—-al = lim sup N(t) < A/p. 


trroo 
The region 
Q= {($,1,R)eR. : S+I+R< A/p} 


is a positively invariant set to Eq. (9.5.5). Let Ro = GA/p/(u+a+y) be the basic reproductive number. We 
summarize the stability properties of the SIRS system in the following statement, adapted from [33]. 


Theorem 9.5: When Ro < 1, the disease-free equilibrium Po(A/j1,0,0) is globally stable in Q. If Ro > 1, the 
disease-free equilibrium Po is unstable, and there is a unique endemic equilibrium P*(S*, 1*, R*) that is globally 
stable in the interior of 2, where 


S* = a rae (1-3) 
; (at+p)(O+p) +7 Roy 


re oe Bat 
(a+ p)(O+ 4) +E (1 =) 


Straightforward calculations can show the existence of the disease-free equilibrium Pp and the endemic 

equilibrium P*. To prove the global stability of Po in 2 for Ro < 1, we choose the Lyapunov function J in Q. Then 
dl 

qe 7 BS — tetas 


< [4 -G+uta)] I= +n +a) -1)I<0. 
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Therefore, the disease-free equilibrium Pp is globally stable in Q for Rg < 1. 
To prove the global stability of P* in the interior of 0, we consider the following equivalent system of (9.5.5): 


P=1[B(N -I-R)]-(y¥+u +0), 
R= 7I—-(u+5)R, (9.5.6) 
N=A-pN-al. 

The positively invariant set of (9.5.6), corresponding to the positively invariant set Q of (9.5.5), is OQ’ = {(U, R, N) € 


R§o: 1+R<N<A/p}, and, when Ro > 1, Eq. (9.5.6) has a unique endemic equilibrium, P*(I*, R*, N*), corre- 
sponding to P*(S*, I*, R*), where 


A a(o + p) ( 1 )| 
Ne=S*4+I* +R =— |1 —- — —W |1-— ]]. 
L (a+ p)(6+p) +H Ro 
Hence, when Ro > 1, the system of equations (9.5.6) can be rewritten as 
I= pI[((N-N*)-—(I-I*)-(R-R*)]), 
R= y7(I-I*)-— (w+) (R—- R*), (9.5.7) 
N =-—p(N — N*)—a(I — I"). 


Consider the Lyapunov function 


a8 
2 


I 

VU, R,N) =ay (1-7 a nz) aides ew cd ae 4 (Non), 
which is a positive definite function in region 2’. The total derivative of V(J, R, N) along the solutions of (9.5.7) is 
given by 

dv 

dt 
Since V is negative definite, it follows from the Lyapunov theorem that the endemic equilibrium P* of (9.5.6) is 
globally stable in the interior 0’; that is, the unique endemic equilibrium P* of (9.5.5) is globally stable in the 
interior of 2. 


= -afy (I - I")? - oB(u+ 6) (R- R*)’ — By (N- N*y’. 


The previous examples show that finding an appropriate Lyapunov function can be a challenging problem. In 
systems coming from physics, often the total energy can be chosen as a Lyapunov function. More precisely, there 
are two important classes of systems for which Lyapunov functions are almost ready to be used. One of these classes 
we studied before in §9.4—conservative systems, and in particular, Hamiltonian ones. If the Hamiltonian H(x) is 
known (here x is a phase point of even dimension), then its gradient VH is perpendicular to the vector field of the 
Hamiltonian system. Therefore, we can choose H(x) — H (x*) as a Lyapunov function because the trajectories of 
the system lie on level surfaces of H. 

Every second order system with n degrees of freedom is equivalent to a 2n-dimensional first order system and 
therefore meets the even-dimensionality requirement for Hamiltonian systems. If the given system is conservative, it 
can be written as X = —VII(x), where II(x) is the potential energy. Then H(x, x) = 4 x? + I(x) is its Hamiltonian. 

Apart from conservative systems, there is another kind of system of the form x = kVG(x), where G: R" +> R 
is a continuously differentiable function and k is a constant. Such a system is called a gradient system. Suppose 
that the function G(x) has an isolated local minimum/maximum value at the point x = x*. Then this point will be 
a critical point of the given system. Its orbits follow the path of steepest descent/increase of G depending on the 
sign of k. In this case, G itself will be a Lyapunov function for the system at x*. Indeed, the total derivative of G 
becomes 

G(x) = VG (x) - (kVG(x)) = k|VG (x)|. 
If k < 0, then G(x) > G (x*) except at x = x*, and x = x* will be an asymptotically stable equilibrium solution. If 
k > 0 and x = x* is a local maximum, then —G(x) is a Lyapunov function. Note that the linearized system at any 
equilibrium has only real eigenvalues. Therefore, a gradient system can have no spiral sources, spiral sinks, or center: 
nondegenerate critical points of a planar analytic gradient system are either a saddle or a node. A two-dimensional 
system (9.5.2) is a gradient system if and only if 
Of _ ag 


By Be (9.5.8) 
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Example 9.5.6: (Example 9.4.1 revisited) The total energy for an ideal pendulum 

1 : F 

5 me?6? + mg (1 — wo cos0) = E(0, 0) 


is a natural candidate for a Lyapunov function. Taking out mé?, we get V(2,y) = E(zx,y)/(mé?), where x = @, 
& = y, a Lyapunov function: it is positive in the open domain 2 = {(#,y) : —7/2 <a < m/2,—-c0 < y < oo} except 
at the origin, where it is zero. Since its derivative with respect to the pendulum equation is identically zero, V is 
negative semidefinite. By Theorem 9.3, the origin is a stable critical point. 

The total energy of the undamped system becomes a Lyapunov function if damping is added. Indeed, for the 


damped pendulum equation 
L=Y, 
me = (9.5.9) 
y=-yy-w* sina, 


the total energy V(x, %) = 44? + w* (1 — cosz) satisfies the inequality 
V =au’ sing +a¢% =—-#7y <0. 


This shows that V(a,y) is a Lyapunov function; however, we expect the original to be an asymptotically stable 
point. So we need to find an appropriate strong Lyapunov function instead (see Problem 2). 


Example 9.5.7: (Example 9.2.6 revisited) The dynamics of the unforced system can be modeled by the 
Duffing equation 
&+ 64+ Be +azxr? =0, 


where the damping constant obeys 6 > 0. When there is no damping (6 = 0), the equation can be integrated as 
def 2 1 2, 1 4 
E(t) = 5 (#) + 3 Baur + i constant. 


Therefore, in this case, the Duffing equation is a Hamiltonian system. 
When 6 > 0, E(t) satisfies 


therefore, the trajectory x(t) moves on the surface of E(t) so that E(t) decreases until x(t) converges to one of 
the equilibria where « = x. For positive a, 8, and y, E(t) is a Lyapunov function and x* = (0,0) is globally 
asymptotically stable in this case. 


1. Construct a suitable Lyapunov function to determine the stability of the origin for the following systems of equations. 


1 = —x? + 32%, &y = —2x1 — 32123, 
(a) : 3 (e) . 2 
x Lo = —3xr2 — 2xj{ x2. 


2> —-%129. 
Ly = —ax} — 103, 1 = —3r3 + 103, 
(b) : 2 3 (f) . 2 3 
Lq = —2xj{ x2 — Xd. 2g = —Lj{L2 — £5. 
é, = af +2123, t= ya, 
(c) 4. 2 (gs) 4. 
2 = 2{%2+ 72. y=—-“r—yY. 
é, = 2a$ +2123, Z = 2cy— 2x", 
(d) 7 2 3 (h) 2) 5 
te = —2xj{%2+ £5. y= x? -y 


2. With some choice of a constant a, show that V(«,y) = $y? + 4 (azx4 yy)? — yw? cosa + yw? is a strong Lyapunov 
function for the damped pendulum equation (9.5.9), with y 4 1. Hint: Use Maclaurin series approximation for sin x 
and polar coordinates. 


3. Show that the system ¢ = y, y = —2x® has a stable critical point at the origin which is not asymptotically stable. 
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The following SIRS epidemic model with disease-induced death and standard incidence 


S=A-pS— BI S/N +56R, 

I= 6IS/N-(y+p+0)], 

R= I -(ut+4)R, 
is similar to model (9.5.5); the region 2 = {(5,7,R)€ RE : S+I+R< A/} is a positively invariant set for the above 
system of equations. Show that the disease-free equilibrium Po(A/, 0,0) is globally stable in Q when 8 < uwta+y. 


One of the milestones for the current renaissance in the field of neural networks was the associative model proposed 
by the American physicist John Hopfield in 1982. Hopfield considered a simplified model in which each neuron is 
represented by a linear circuit consisting of a resistor and capacitor, and is connected to the other neurons via nonlinear 
sigmoidal activation functions. An example of such a model gives the following system of ordinary differential equations: 


t=a(y)-2, y=ul(r)-y, 
where 
z1(z) = k arctan(az), zo(y) = k arctan(ay), 


with some positive constants k and a. To analyze the stability of this system, Hopfield suggested using a Lyapunov 
function of the form 
V(x, y) = —21(2) z2(y) — 2 [In (cos z1(a)) + In (cos z1(x))]. 


By plotting V(a,y), with k = 1.4 and a = 2, verify that the Lyapunov function is positive definite in a neighborhood 
of the origin. 


For the system of equations 
: . 3 
é=xn-ay-22°, Y= GY gy Y: 
modeling competition of species, find all equilibrium points and investigate their stabilities using the appropriate 
Lyapunov function of the form V(a, y) = az? + bay +cy?. 


Use Lyapunov’s direct method to determine the stability of the zero solution for the given equations of second order. 
(a) @+3a°%+42° =0; (b) &+(cosx)#+sinaz = 0. 
Show that a system x = f(x) is at the same time a Hamiltonian system and a gradient system if and only if the 


Hamiltonian H is a harmonic function (which satisfies the Laplace equation: V?H = 0). 


Show that the one-dimensional gradient system with G(a) = 2° sin(1/a) has « = 0 as a stable equilibrium, but « = 0 
is not a local minimum of G(z). 


Find a Hamiltonian for the system 


2 5 a cf 2 2 
t1=Y91, C2=Y2., Yr=—-T1—2%1%2, Yo=wQg—-L2- 7}. 


Find all critical points and determine their stability for the system with the Hamiltonian H(z, y) = 4 yt 4 xo? — + x. 


Find a Hamiltonian for the following systems of equations. 


& = —2y + 327, & = 10x +y+4+ 3a? —2 + 4y’, 
(a) : (c) : : 
a Cc 

y = —2x — bry. y = 2x — 10y — 3x” — Gary t+ y. 
(b) “z=5 sing cosy, (d) & = —a2 + 4y + 3y’, 

y =—5 cosa siny. Y= y — 2x — 2y’. 


Show that the function V(x, y) = 3a? + Ty? is a strong Lyapunov function for the linear system 
£=-“2+7y, y= —3n— y. 


Also show that V cannot generate a gradient system. 


9.6. Periodic Solutions 529 


9.6 Periodic Solutions 


The linearization procedure discussed in §9.2 tells us nothing about the solution to an autonomous vector equation 
except in a neighborhood of certain types of isolated singular points. Local properties, that is, properties holding 
in the neighborhood of points, can be analyzed by stability analysis or the power series method. However, in many 
practical problems, one needs information about the global behavior of solutions. These solutions may occur in 
mechanical, electrical, or other nonconservative systems in which some external source of energy compensates for 
the energy dissipated. 

The periodic phenomena occur in many applications; therefore, their modeling and analysis is an important task. 
We start with a conservative system modeled by the initial value problem 


G+fYy)=90,  y0)=yo, 90) = vo- (9.6.1) 


If the function f(y) is similar to an odd function, then the following theorem (see Problem 5 for proof) guarantees 
the existence of a periodic solution to the IVP (9.6.1). 


Theorem 9.6: Suppose that the function f(y) in Eq. (9.6.1) is continuous and y f(y) is positive for |y| < a, 
where a is some positive number. For initial points (yo, vo) sufficiently close to the origin, the solutions to the 
IVP (9.6.1) are nonintersecting closed loops traced clockwise around (0, 0). 


Example 9.6.1: (Example 9.2.6 revisited) The Duffing equation (9.2.9) on page 497 gives an example of 
conservative system (9.6.1) with f(y) = w? (y— y°/6). Taking a = V6, we sce that conditions of Theorem 9.6 are 
fulfilled, and the Duffing equation has a periodic solution, which is confirmed by Fig. 9.23 on page 518. 


While important, Eq. (9.6.1) is of limited application. We consider a more general second order differential 
equation or its equivalent system named after the French physicist Alfred-Marie Liénard®* 


ae (9.6.2) 


t+ fle)t+g(a)=0 or 


where F'(z) is an antiderivative of f(x) (so F’(x) = f(«)). The following theorem [31], proved by N. Levinson and 
O.K. Smith in 1942, guarantees an existence of a unique periodic solution to the equation (9.6.2). 


Theorem 9.7: [Levinson—Smith] Let F(a) and G(a) be antiderivatives of functions f(#) and g(x), respectively, 
in the Liénard equation (9.6.2) subject to F'(0) = G(0) = 0. The Liénard equation has a unique periodic solution 
whenever all of the following conditions hold: 


1. f(z) is even and continuous, and g(x) is an odd continuous function with g(a) > 0 for x > 0. 
2. F(a) <0 when 0 < # <aand F(a) > 0 when x > a, for some a > 0. 


3. F(a) + +00 as + +00, monotonically for x >a. 


4. G(a) 4 +00 as H+ 00. 


Next, we discuss the existence of periodic solutions of planar autonomous vector equations (9.1.6) 
x = f(x) 


that satisfy the condition x(t) = x(t+T) for some positive constant T, called the period. Any equilibrium solution is 
a periodic function with arbitrary period. In general, it is difficult to discover whether a given system of differential 
equations does or does not have periodic solutions—it is still an active area of mathematical research. Even if one 
can prove the existence of such a periodic solution, it is almost impossible, except in some exceptional cases, to find 
that solution explicitly. 


88Liénard (1869-1958) was a professor at the Ecole des Mines de Saint-Etienne and during 1908-1911 he was professor of electrical 
engineering at the Ecole des Mines de Paris. In World War I he served in the French Army. He is most well known for his invention of 
the Liénard—Wiechert electromagnetic potentials. 


530 Chapter 9. Qualitative Theory of Differential Equations 


x'=—4eyt+x4(1-x%2 —4y’2, y'=x+y*(1—x%2 —4*y%2) 


Lo * oe ee _ = Fae. ul 
\\) WAZ 
OD 

\ \ 


UD) YN) SSS | 
MEN (COR 
SSS ZiINS 
MINN 
Pee os AU Ne 


L L L L 
=L5 =1.0 -0.5 0.0 0.5 1.0 15 


o 


=15 -1.0 -0.5 0.0 os Lo 5 2.0 


Figure 9.27: Phase portrait in Example 9.6.2, plot- 
Figure 9.26: Nullclines in Example 9.6.2. ted with Mathematica. 


Definition 9.9: A limit cycle C is an isolated closed trajectory having the property that all other orbits in its 
neighborhood are spirals winding themselves onto C' for t + +co (a stable limit cycle) or t + —oo (an unstable 


limit cycle). All trajectories approach the limit cycle independent of the choice of initial conditions. 


Semistable limit cycles®? displaying a combination of both stable and unstable behaviors can also occur. We 
start with an illustrative example. 


Example 9.6.2: Consider a family of systems of ordinary differential equations 
& = —4ky + x (1-27 —4y’), y=ket+y (1-2? —4y’), (9.6.3) 
depending on a positive parameter k. By plotting z- and y-nullclines 4ky = x (1 — x? —4y?) and ka+y (1 — 2? — 4y?) 


= 0 (see Fig. 9.26), it is clear that the system has only one critical point—the origin (unstable). Equation (9.6.3) 
can be solved explicitly by introducing polar coordinates: x = 2r cos@, y =r sin@. Since tand = 2y/x, we get 


2y - d 2y LY — YX 
=_ =_— — = —_—_ A 
é = arctan ( r ) => 0 a (arctan ( 7 )) 2 eae (9.6.4) 


Then substituting the expressions given in Eq. (9.6.3) for and y, we obtain 


Solving this separable equation, we get 
0 = 2kt + @, where 69 = 6(0). 


This formula tells us that trajectories revolve clockwise around the origin if k < 0 and counterclockwise if k > 0. 
Since 4r? = x? + 4y?, we differentiate both sides to obtain 


ArPa=acé+4yy => f=r(1—4r’). 


Separating variables and integrating, we get 


r(t)=+ AWE 


~ 4/1 + 472(0) (e2 — 1) 
Thus, the typical solutions of Eq. (9.6.3) may be expressed in the form 
x(t) = 2r(t) cos (kt + 00) , y(t) = r(t) sin (kt + 0). 


89This term was introduced by Henri Poincaré. 
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A family of equations (9.6.3) gives an unusual example of systems that can be solved explicitly. Consider the modified 
system 


& = —dy (1 — 2? — 4y?)* + 2 (1 — 2? — 4y?)® — 4y7, 

: 2 3 

y =a (1-2? —4y’) +y (1-2? —4y’) + ay’ 
with slopes obtained from the original ones upon multiplication by (1 — a? — Ay?)* and addition of the vector 
y?(—4y, x). By plotting a phase portrait for the modified system, we cannot see any geometric difference from 
Fig. 9.27. However, in addition to a common stationary point at the origin, the modified system has equilibrium 


points at (1,0) and (—1,0) that aren’t shared by the simpler system. Since these additional critical points lie on the 
ellipse x? + 4y? = 1, this ellipse is not a limit cycle as in Eq. (9.6.3). 


It is often useful to be able to rule out the existence of a limit cycle in a region of the plane. We present a theorem, 
referred to as Bendixson’s first theorem or more commonly as Bendixson’s negative criterion. This theorem as well 
as its generalization, proved? by Dulac, sometimes enables us to establish the nonexistence of limit cycles for the 
basic system of autonomous equations 


&= f(z,y), y = g(z,y). (9.6.5) 


Theorem 9.8: [Bendixson’s negative criterion] If the expression 
ed (9.6.6) 


does not change its sign within a simply connected domain D of the phase plane, no periodic motions of 
Eq. (9.6.5) can exist in that domain. 


This negative criterion is a straightforward consequence of Green’s theorem in the plane. A simply connected 
domain is a region without holes. In other words, it is a region having the property that any closed curve or surface 
lying in the region can be shrunk continuously to a point without going outside of the region. Note that if fr + gy 
changes sign in the domain, then no conclusion can be made. 


Example 9.6.3: Consider a system (9.6.5) with 


f(z,y) =ytau(a?+4y?-1), g(x,y) =—-a+y (x? +4y? - 1), 
" af a 1 
OF 4g OE 2 2 
an 1(« + 4y 5) 


Since this expression does not change sign for r < 1/\/2, where r = \/x? + 4y?, no closed trajectory can exist inside 
the ellipse 2? + 4y? = 1. The quantity af + ae also does not change sign outside the radial distance r = 1/2. 
However, we cannot apply Bendixson’s negative criterion because the domain is no longer simply connected—the 
region r < 1/./2 must be excluded to keep the sign of fy + gy unchanged. It can be verified that there exists a limit 


cycle at r = 1 but Bendixson’s theorem should not be applied. 


Example 9.6.4: (Pendulum) Consider the nonhomogeneous pendulum differential equation 
6+-70 +w* sind =H, 


where 7 > 0 and y are constants, and @ is an angular variable. This differential equation is a model for an unbalanced 
rotor or pendulum with viscous damping @ and external torque yw. The equivalent system of the first order equations 


t=6=y, y = p—w* sind — yy (9.6.7) 


has no periodic solutions (see Fig. 9.29), and has no rest points. Since the expression f, +g, = —y < 0, where f = y 
and g = w— yy —w’sin@, does not change its sign, the system has no periodic solution according to Bendixson’s 
theorem. 


°0Henri Dulac (1870-1955) was a French mathematician. He proved Theorem 9.9 in 1933. 
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Figure 9.28: Phase portrait and unstable limit cycle Figure 9.29: Phase portrait in Example 9.6.4, plot- 
in Example 9.6.3. ted with Mathematica. 


Theorem 9.9: [Dulac’s Criterion] Let 2 be a simply connected region of the phase plane. If there exists a 
continuously differentiable function (x, y) such that 


div (6) =F [6(ey) Faw) + 5 llewalew)l, £=(f9) 


is of constant sign in 2 and is not identically zero on any subregion of Q, then the dynamical system (9.6.5) has 
no closed orbits wholly contained in 2. 


Dulac’s criterion suffers the same problem as when finding Lyapunov functions in that it is often difficult to find 
a suitable function ¢(a,y). The case ¢(x,y) = 1 represents the negative criterion of Bendixson. A function ¢(2, y) 
that satisfies the condition of the above theorem is called the Dulac function to the system. The existence of the 
Dulac function can be used to estimate the number of limit cycles of system (9.6.5) in some regions. Namely, system 
(9.6.5) has at most p — 1 limit cycles in a p-connected region 2 if a Dulac function exists. 


Example 9.6.5: Show that the system 


b=y, yo-e—-yte’ty? 


has no closed orbits anywhere. 
Solution. First, we apply Bendixson’s negative criterion: 


Oy oO Lp! aj?) — | 
a oy yar by?) = 1+ 2y. 


Although this shows that there is no closed orbit contained in either half-plane y < 4, or y > 4, it does not rule out 
the existence of a closed orbit in the whole plane since there may be such an orbit which crosses the line y = 4 


3: 
Now let us try the Dulac function ¢(2, y) = e®*. 
F fe] +S fet (-e-y to? +9)] =e (a+ 2)y-I 
Ox Oy ; 
Choosing a = —2 reduces the expression to —e~?”, which is negative everywhere. Hence, there are no closed orbits. 


Another theorem due to Poincaré and Bendixson®! gives the necessary and sufficient conditions for the existence 
of a limit cycle. Unfortunately, the theorem is often difficult to apply because it requires a preliminary knowledge 
°lTvar Otto Bendixson (1861-1935) was a Swedish mathematician. Being the Professor of Pure Mathematics at the Royal Institute 


of Technology, he was intrigued by the complicated behavior of the integral curves in the neighborhood of singular points. Bendixson 
substantially improved an earlier result of Poincaré in 1901. 
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Figure 9.30: Phase portrait in Example 9.6.5, plot- 


Figure 9.31: Phase portrait in Example 9.6.7, plot- 
ted with Mathematica. 


ted with Mathematica. 


of the nature of orbits. The Poincaré—Bendixson theorem involves the concept of a half-trajectory, which is the set 


of points traced by an orbit starting at a particular position, usually identified by t = 0. This representative point 
divides the orbit into two half-trajectories (for t > 0 and for t < 0). 


Theorem 9.10: [Poincaré—Bendixson] Let the rate functions f(x,y) and g(a, y) in Eq. (9.6.5) be continuously 
differentiable, and let (x(t), y(t)) be the parametric equation of a half-trajectory [T which remains inside the 


finite domain 2 for t + +00 without approaching any singularity. Then only two cases are possible: either I is 
itself a closed trajectory or [ approaches such a trajectory. 


Example 9.6.6: (Example 9.6.2 revisited) Recall that the system (9.6.3) can be rewritten in polar form as 
p=r(1—4r?), 6=2k>0. 


Then dr/d@ = r (1 — 4r?) /(2k) with dr/d@ > 0 for r < 1/2 and dr/d@ > 0 for r > 1/2. If we choose an annular 
domain (donut shaped) R of inner radius r = 1/4 and outer radius R = 1, the trajectories must cross the inner 


ellipse from the region r < 1/4 to the region r > 1/4 because dr/d@ > 0. On the other hand, the orbits must cross 
the outer ellipse r = 1 towards the region r < 1 since dr/d6 < 0. 


Example 9.6.7: Show that the system of equations 


C=LX-y-@wy’ —-xw-awy’, yoetyto® —xy-y? (9.6.8) 
has a limit cycle. 


Solution. We are going to find the Lyapunov function V (a, y) and a positive function f(x,y) > 0 such that the 
given system (9.6.8) has the form: 


IVI, HSB: (9.6.9) 
We look for functions of the form f = f(r?) and V = g(r?) + h(x), where r? = x? + y? and f, g, and h are 
polynomials. Substituting these functions into Eq. (9.6.8), we get 
=yaeglar pe dar) =—tag lef fa, 
+a? +y(1—r*) =209' +h’ — 2yfd’. 


Since y cannot be written as a polynomial in r? and z, the terms multiplying y above must match. Thus: g’ = 1/2 
and f =r? —1. The equations then become 


(a +2°)\(1—r?) =(x+h’')(1—17) and Can, 
This yields the final answer: 


1 1 
f=r-1, V= ea QP aay"). (9.6.10) 
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Now, as a consequence of the form of the equations in (9.6.9), the derivative of V along the solutions is 


. d 
v= GV my) =4V,+yV, =—(Ve+V,). 
Clearly, in any region where V > 0, the function V is a Lyapunov function, and the level curves of V provide trapping 
boundaries: the solutions cross these curves in only one direction (V decreasing). So the conditions of Theorem 9.10 
are fulfilled. 

Next we find and classify the critical points for Eq. (9.6.8). From Eq. (9.6.9), we find that at a critical point 


Vy Vel Jl} 0 
Ve Vale) 
Hence the determinant of the coefficient matrix must vanish, and V, = V, = 0. From Eq. (9.6.10), we conclude that 
there is only one critical point: the origin, which is an unstable spiral. 
Note that the level curves of V are a set of nested closed curves, all containing the origin. The level curve 


V(a,y) = V* > 0 contains the disk of radius r2 = /1+V* — 1 centered at the origin. Thus, the function f is 
positive on any level curve where V* > 0. 


(a) (b) 


Figure 9.32: Example 9.6.8 (a) phase trajectories of Rayleigh’s equation, both approaching the limit cycle; (b) a 
solution of Rayleigh’s equation, plotted with Maple. 


Example 9.6.8: The British mathematical physicist Lord Rayleigh (John William Strutt, 1842-1919) introduced 
in 1877 an equation of the form 
mé + kx = a& — b(«)° 


(with nonlinear velocity damping) to model the oscillations of a clarinet reed. With y = «, we get the autonomous 
system 


y=, y = (—kax + ay — by®) /m. 
Next, we show the existence of a limit cycle by plotting trajectories using the following Maple script: 
with(DEtools): with(plots): m:=2: k:=1: a:=1: b:=1: 
deqi:= diff(x(t),t)=y; deq2:= m*diff (y(t) ,t)=-k*xtk*y-b*y73 ; 


DEplot ([deq1,deq2] , [x(t) ,y(t)],t=0..75,x=-3..3,y=-3..3, 
[[x(0)=0.01, y(0)=0]], stepsize=0.1,linecolor=blue, arrows=none) ; 
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Example 9.6.9: (Example 9.5.3 revisited) Consider the van der Pol equation 
jt+e(y—-l)yty=0. (9.6.11) 
Multiplying it by the Dulac function”? ¢(a, y) = (a? + y? — 1)~1/?, we obtain 
a a € (a2 —1)” 
5~ y P(x, y) + = [€(1 — 2*)y — 2] $(x,y) = -—~_—_,,. 
ar [ | (x? 4 y? = ie 


Since this expression does not change sign in any domain not containing the unit circle 2? + y? = 1, the van der Pol 
equation has at most one limit cycle (similar to Example 9.6.6). 


Figure 9.33: Example 9.6.9, phase portraits of the van der Pol equation for (a) « = 2 and (b) for « = 0.5, plotted 
with Mathematica. 


Figure 9.34: Example 9.6.9, solutions of the van der Pol equation for (a) « = 2 and (b) for € = 0.5, plotted with 
Mathematica. 


9.6.1 Equations with Periodic Coefficients 


A fundamental engineering problem is to determine the response of a physical system to an applied force. Consider 
a Duffing oscillator depending on the amplitude F' chosen for the forcing periodic term 


&+ 2y4¢ + ax + Bx® = F cos(wt), 


with damping coefficient y and the driving frequency w. In a typical introductory classical mechanics or electrical 
circuit course, students first solve a linear approximation (when 6 = 0) by assuming that the transient solution has 
died away and seek the steady state periodic solutions vibrating at the same frequency w as that of the oscillatory 
driving force. A nonlinear system may respond in many additional ways that are not possible for the linear system. 
For instance, a solution may “blow up” or exhibit chaotic behavior. Since we are looking for a periodic response, it 
can be shown that if a Duffing oscillator has a periodic solution, its period is an integer multiple of w. 


°? This function was discovered by Leonid Cherkas (1937-2011), a famous Russian mathematician. 
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Figure 9.35: Example 9.6.10 (a) a solution trajectory, and (b) input (in blue) and output, plotted with Mathematica. 


In 1950, Jose Luis Massera®* showed an example of a linear system of differential equations 


= f(t)e+[f® —wly—f®z2+ fH, 
g(t) tw] x + g(t) y — g(t) z+ g(t), 
h(t) + w] w+ [A(t) — w] y — h(t) z+ h(t), 


where f, g, and h are continuous T-periodic functions, that admits a periodic solution 
x=acos(wt+b), y=asin(wt+b), z=alcos(wt + b) + sin(wt + b) + 1] 


(here a and 0 are arbitrary constants) with period T; = 27/w, which, generally speaking, is not a rational multiple 
of T. This example illustrates a counter intuitive phenomenon that multi-dimensional systems subject to a periodic 
input may have a periodic response with different periods. This is something we never observe in one- or two- 
dimensional ordinary differential equations: if there is a bounded solution of the differential equation and the 
solutions are continuous for all following times, then there is a T-periodic solution. Systems of equations that admit 
solutions, called exceptional solutions, with periods that are not rational multiples of the periodic input are called 
strongly irregular, and oscillations modeled by this system are called asynchronous. In particular, all constant 
solutions are exceptional solutions. 

It was observed in practice that there exist strongly irregular mechanical and electrical systems. We present one 
such example. 


Example 9.6.10: Consider two ideal pendula of equal masses m and the same length @ connected by an elastic 
string (see figure on the opening page 341 of Chapter 6 and Example 6.1.4 on page 349). Upon linearization of 
Eq. (6.1.8), page 349, for small angles of inclinations 6; and 62, the two pendula system can be modeled by two 
coupled equations: 

by +070, — a?m-1é-7k(t) (03% — On) = 0 (7 =¢/ k=1,2), 


where g is acceleration due to gravity and a is the distance from the pivot to the point where the spring is attached. 
Here the elastic coefficient k(£) is assumed to be a T-periodic function. By plotting solutions, Fig. 9.35, we see that 
the system has a periodic response with an asynchronous period compared with k(t). 


1. When solving planar differential equations, it is helpful to use polar coordinates x = ar cos@, y = br sin@, where a 
and b are some positive numbers. Using the chain rule, show that 


rh= Set Sy, abr’@ =ay— yt. 

2. As an example of an analytically derived limit cycle, consider the set of coupled nonlinear ordinary differential equations 
rz 2 2 ae y 2 2 

(l-2°-4y°), g=e+—=—(1-2° -4y’). (9.6.12) 


/ 72 + 4y? / x2 + 4y? 


To solve this system, introduce the plane polar coordinates x = 2rcos@ and y = r sin@. Then, after multiplying the first 


&=—4y+ 


1d 
and the second equations by x and 4y, respectively, and adding, we obtain 5 a (a? + dy”) = (a? + Ay?) */? (1 ae dy”), 


which is an autonomous equation with respect to r: 2 = 1—4r?. Find a solution of Eq. (9.6.12). 


93 José Luis Massera (1915-2002) was an Uruguayan mathematician who researched the stability of differential equations. 
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. When parameter € in the van der Pol equation %— € (1 cig | &+ax = 0 is large (€ > 1), it is convenient to use Liénard’s 


transformation y = x — 23/3 — @/e. This allows us to rewrite it as the system of first order equations 


3 
bae(e- Sy), g2e, (9.6.13) 


€ 


which can be regarded as a special case of the FitzHugh—-Nagumo model (also known as Bonhoeffer—van der Pol model). 
By plotting phase portraits for different values of the parameter ¢, observe that the system has a stable limit cycle. 


. Show that the following nonlinear systems have no periodic solutions by applying the Bendixson negative criterion 9.8 


(page 531). 
(a) #=2y?+y° — 32, y = 5a? — 4y; (b) &=4y—22, y = 3a —4y’; 
(c) gay? — eet y= 08 — yd +0", (d) #=4y— 20°, y = —5a — 3y°; 
(e) = 22° —32°%y+y", y = 2y? — 32y?+2°; (f) ¢=y—2, y= 2° —2y’. 


5. Prove Theorem 9.6 on page 529. 


6. The pursuit problem of A.S. Hathaway (1920). A playful dog initially at the center O (origin) of a large pond 


10. 


swims straight for a duck, which is swimming at constant speed in a circle of radius a centered on O and taunting the 
dog. The dog is swimming k times as fast as the duck. 


(a) Show that the path of the pursuing dog is determined by the coupled nonlinear equations 
d¢/dé = (a cos ¢)/p — 1, dp/dé = a sin¢ — ka, 


where @ is the polar angle of the duck’s position, p is the distance between the instantaneous positions of the dog 
and the duck, and ¢ is the angle formed by tangent lines to the paths of the duck (circle) and the dog (pursuit). 


(b) The duck is safe for any k < 1! For k = 3/4, numerically show that the dog never reaches the duck, but instead 
traces out a path which asymptotically approaches a circle of radius 3a/4 about the origin. By choosing other 
starting positions for the dog, show by plotting the pursuit trajectories that the circle is a stable limit cycle. Take 
a = 1 and start the duck out at c = 1, y = 0, swimming counterclockwise. The value of k dictates the size of the 
limit cycle. 


. Multiple nested limit cycles. Given the pair of equations depending on a positive parameter k: 


b= Ry tet (VPFRP), g=etut (VETRP), 


with f (Vr + iy?) = kr sin(2/kr), where k?r? = 2? + k?y?. Show analytically that all of the ellipses x? + k?y? = 


1/(nr)?, n = 1,2,3,..., are stable limit cycles. 


. By carrying out an exact analytic solution, show that the system of equations 


& = —9y (x? + 9y? +1) +2 (274+9y?-1), y=ar(x? + 9y? +1) +y (2? + 9y? -1) 


has a limit cycle and determine whether it is stable, unstable, or semistable. Confirm your analysis by plotting 
representative trajectories in the phase plane. 


. Wasley Krogdahl (1919-2009) from the University of Kentucky proposed in 1955 a model to explain the pulsation of 


variable stars of the Cepheid type. He modified van der Pol’s equation by adding some extra terms: 


z b(1 x”) & a(t st) atte ax” + Laa® =0, 


where a and 0 are positive constants. Taking a = 1/12, b = 1, plot the phase portrait and show that a stable limit cycle 
exists. Also examine how the shape of the limit cycle changes with deferring values of a and b. 


Consider the Duffing equation with variable damping 


&— (169-27) &-2.252+2° =0. 


(a) Apply Bendixson’s negative criterion to this problem and state what conclusion you would come to regarding the 
existence of a limit cycle. 


(b) Find and identify the stationary points of this system. Confirm the nature of two of these stationary points by 
producing a phase plane portrait for the initial values x = 0.75, ¢ = 0 and x = —0.75, ¢ = 0. Take interval 
t = 0..30. 
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Summary for Chapter 9 


(c) With the same time interval as in (b), let the initial phase plane coordinates be x = 0.1, ¢ = 0. What conclusion 
can you draw from this numerical result? How do you reconcile your answer with (a)? 


Find the equilibrium points and the limit cycles of the system 7 = r cos(r/2), 0= 1, written in polar coordinates. 


By plotting the phase portrait, show that the system 
&=2a-—y—2", g=a2+25y-y° 


has a unique globally attracting limit cycle. Also, prove that the origin is an unstable spiral point. 


Zhukovskii’s model of a glider. Imagine a glider operating in a vertical plane. Let v be the speed of glider and 
0 be the angle flight path makes with the horizontal. In the absence of drag (friction), the dimensionless equations of 
motion are 

6 =v— (cos8) /v, v= —sin0. 


By plotting the phase portrait for this system, do you observe periodic solutions? How many different families of 
periodic solutions exist? Find an equation for the separatrix. 


The Roéssler system of three nonlinear ordinary differential equations, 


f=-y-2z, y=ut+ay, 2=b+2(x-0), 


was originally studied by a German biochemist Otto Rossler (born in 1940) in the late 1970s. Consider the Rossler 
system with a = 6 = 0.2. Confirm that three-dimensional periodic solutions exist for c = 2.5, c = 3.5, and c = 4.5. 
Identify the periodicity in each case. Are these periodic solutions limit cycles? 


The following model due to Réssler produces a three-dimensional limit cycle for « = 0.1 and a = 1/4, b = 1/16, which 
is a distortion in the z direction of the limit cycle, which is the ellipse ax? + by? = 1 rotated by 1/4. 


ét=a(1 ax by’) ts Pao ax by’) +2, g=(1—27)\(n@+2—-1)-ez. 


Confirm that this model yields a three-dimensional limit cycle of the indicated shape. 


Prove that the system « = z(a+ ba+cy), y= y(at+ Bx +-yy) has no limit cycles. Hint: Find a Dulac function of the 
form «”y*. This statement is attributed to the Russian mathematician Nikolai N. Bautin (1908-1993). 


Prove that the following Dirichlet boundary value problem has a solution. 


(A. K. Demenchuk) Verify that the equation 
#4+(sint—1)#+h'é+k? (sint—1)e¢=—0, kAOandk¥1, 


has a two-parametric family of asynchronous periodic solutions x(t) = ci cos kt + cg sin kt. 


(A. K. Demenchuk) Verify that the system 


“£=y cost+2z—cost+2, y=sint(y—1), 2=sin2t(y—1)—-2, 


has a two-parametric family of asynchronous periodic solutions a(t) = ci V2 sin V2t 
+ coV2 cos V2t, y(t) = 1, z=c1 cos J2t — co sin V2t — 1. 


Summary for Chapter 9 


I. 


3. 


A solution x(t) of a system of differential equations x = f (t,x) can be interpreted as a curve in n-dimensional space. 
It is called an integral curve, a trajectory, or an orbit of the system. When the variable t is associated with time, 
we can call a solution x(t) the state of the system at time t. 


We refer to a constant solution x(t) = x* of a system 
x(t) =Ax(t), Aisannxn matrix, 


as an equilibrium if x = dx(t)/dt = 0. Such a constant solution is also called a critical or stationary point of the 
system. 


Critical points can be stable, asymptotically stable or unstable (see Definitions 9.3 and 9.4) 


Review Questions for Chapter 9 539 


4. 


10. 


11. 


12. 


13. 


14. 


A picture that shows its critical points together with collection of typical solution curves in the xy-plane is called a 
phase portrait or phase diagram. 


A linearization of an autonomous vector equation x = f(x) near an isolated critical point x = x* is a linear system 
x = Jx, with the Jacobian matrix 


J= pray), = 2h 


3 Oxy | ee’ 45.9 = 1,232500: 
The Grobman-Hartman theorem states that as long as Df(x*) is hyperbolic, meaning that none of its eigenvalues are 
purely imaginary, the solutions of x = f(x) may be mapped to solutions of x = Jx by a 1-to-1 and continuous function. 


There are only two situations in which the long-term behavior of solutions near a critical point of the nonlinear system 
and its linearization can differ. One occurs when the equilibrium solution of the linearized system is a center. The 
other is when the linearized system has zero eigenvalue. 


A remarkable variety of population models are known and used to describe specific interactions between species. We 
mention the two most popular models: the competition one 


. 2 
1 = 71241 — Ary biz122, 
om : (9.3.1) 
U2 = 72%2 — AQXa — ber122, 
and predator-prey equations (or Lotka-Volterra model) 
b=a2(r—by), y=y(-wt+ 8x). (9.3.8) 


A system of equations x = f(x) is called the Hamiltonian if there exists a real-valued function that is constant along 
any solution of the system. 


Theorem 9.3 (Lyapunov) If x* is an isolated critical point for the differential equation x = f(x) and there exists a 
Lyapunov function V for the system at x*, then x” is stable. If, in addition, V is a strong Lyapunov function, then x* 
is asymptotically stable. 


Theorem 9.4 (Lyapunov) Let x* be an isolated critical point of the autonomous system x = f(x). Suppose that there 
exists a continuously differentiable function V(x) such that V(x*) = 0 and that in every neighborhood of x* there is 
at least one point at which V is positive. If there exists a region R 3 x* such that the derivative of V with respect to 
the system V(x) = VV(x) - f(x) is positive definite on R, then x = x* is an unstable equilibrium solution. 


A limit cycle C is an isolated closed trajectory having the property that all other orbits in its neighborhood are spirals 
winding themselves onto C' for t + +o (a stable limit cycle) or t + —oo (an unstable limit cycle). 


Theorem 9.8 (Bendixson’s negative criterion) If the expression 
#0 9.6.6 
y ( ) 


does not change its sign within a simply connected domain 2 of the phase plane, no periodic motions can exist in Q. 


Theorem 9.10 (Poincaré—Bendixson) Let the rate functions f(x,y) and g(x,y) in Eq. (9.6.5) be continuously dif 
ferentiable, and let ¢(t) = (x(t), y(t)) be the parametric equation of a half-trajectory which remains inside the finite 
domain 2 for t ++ +00 without approaching any singularity. Then only two cases are possible: either I is itself a closed 
trajectory or I’ approaches such a trajectory. 


Review Questions for Chapter 9 


Section 9.2 of Chapter 9 (Review) 


1. 


Show that the system ¢ = e* TY 


system near this point. 


y, Yy = xy —~ has only one critical point (—1,1). Find the linearization of the 


. Each system of nonlinear differential equations has a single stationary point. Apply Theorem 9.1 to classify this critical 


point as to type and stability. 


(ej eee (b) ee —f, 


y =a — 16y?. youty’—4y+3. 


540 Review Questions 


L=2—-y —7, Lr=2+y, 
(c) 4.» (f) 9 2 
y=u —Ay. y = dy— 3ay+y. 


(a) oe + 2y — 2Qay+y’, (g) foes 


y= y? + 2Qy + ay. y = 307 +y?-1. 


& = 3x4 6y +27 + 32y, t=x-2y, 
(e) - 9 2 (h) _ 2 
y=syty tayty’. y=r-yty —2. 
3. Consider the three systems 
x = 2sin 2x + 3y, z= 4x+3cosy— 3, «= 4a — 3y, 
(a) 2, a 2. (os 
y=4a+y*; y=Asinz+y"*; y = 2sin 2a. 


All three have an equilibrium point at (0,0). Which two systems have phase portrait with the same “local picture” 
near the origin? 
In Problems 4 through 6, each system depends on the parameter e. In each exercise, 
(a) find all critical points; 


(b) determine all values of € at which a bifurcation occurs. 
4. ¢=a(e-—y), y= y(2+ 32). 5. g=ex—y’, y=lta—2y. 


6. The van der Pol equation % + € (a? - 1) z+2=0. 
Section 9.3 of Chapter 9 (Review) 
Systems of equations in Problems 1 and 2 can be interpreted as describing the interaction of two species with populations 
a(t) and y(t). In each of these problems, perform the following steps. 
(a) Find the critical points. 
(b) For each stationary point find the corresponding linearization. Then find the eigenvalues of the linear system; classify 
each critical point as to type, and determine whether it is asymptotically stable or unstable. 
(c) Plot the phase portrait of the nonlinear system. 
(d) Determine the limiting behavior of x(t) and y(t) as t > oo, and interpret the results in terms of the populations of the 
two species. 


1. Equations modeling competitions of species: 


ay J2=20-F- 4), ieee 
© [prateg oa ©) W=y-4- 4); 

é=a(1-$-4), , fae eee 
©) (yosuas ©) \ 5-0-2 - 4); 
oy (2=20-$-9), é om ee 
oa aay @ y=y(1- 2-4). 


2. Predator-prey vector equations: 


ee p= 268- 
ie? x(l—<Zy), ay" 3 ( y); 
y = 4y (a — 3); y=4 

3. In each exercise from Problem 1, determine the response of the system to constant-yield harvesting of the x-species. 
4. Bees and flowering plants are famous for benefitting each other. In the following model 

paa(l+5-3))  d=v(Q2-$+4) 

Sige ff eat =e (92 a 
oe ( eg gt Oe Be tas 


determine all equilibrium solutions. 
Section 9.4 of Chapter 9 (Review) 


1. Show that the Hamiltonian system & = 2y en ty y= —24 e® +¥” has a stable equilibrium at the origin. Then show 
that a linearization technique from §9.2 fails to lead to this conclusion. 


2. In each system, find the potential function II(a) and the energy function E(x,v). Select E so that E(0,0) = 0. 


Review Questions for Chapter 9 


(a) #+ 327 —6r+5=0; 
(b) +2 cosx =0; 
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(d) #+2¢% -—a°+2° =0; 
(e) #+2x/(1+27)? =0; 


(c) ¢+2°/(1+2)? =0; (f) #+2re" —1=0. 


3. Show that a Hamiltonian system has no attractors and no repellers. 
4. Find Hamiltonians of the following systems. 
&=ar(y+1), 
(a) 9. (c) 
y= —-y(+y/2); 
&=ax(xe¥ —cosy), 


& = sec x (|x| < 7/2), 
y=—ysece tanz; 


& = 2ay — 32°, 


(b) (d) 


y =siny — 2x e"; y = 907 y — y? — 42. 


Section 9.5 of Chapter 9 (Review) 


1. Construct a suitable Lyapunov function of the form V(z,y) = ax” + by”, where a and b are positive constants to be 
determined. Then identify the stability of the critical point at the origin. 


(a) ¢=y—-2?,y=-5e?°-ys (h) ¢=2° 4297y? — 32°, y= —304-y’; 
(b) #=6ay*— 2°, g = —9y°; Gi) #=5y?— 29, y= —Bay’; 
(c) #=3y? —2?, y= —32y’; (Gj) #=5ay? — 323, y = —4a?y — Ty’; 
(d) #=2ay?-2*, g=-y*; (k) #=2°-y*, y= ay? + 4a7y + 24°; 
(e) = -2e°, y= 2Qx?y—y’: (1) #=32? — 2y3, y = Qary? + 6x y; 
(f) g=2° Be — 32°, Gai) & = 3y* + 3ay? — 4x3, 

y = —32* — 9°; y = Say — 3ry — 4y*; 


é = ay + 22y3 — 2°, 


6x3 y? — 2y°. 


& = 3y — 4x? — 102zy”, 


(g) (n) 


y = —30 — 9y*® — 2x7y; y = —224 


2. Consider the Hamiltonian system « = H,(«,y), y = —H2(a,y), where H(2,y) = x“y” e~°*—”, and constants p, 8, 1, 
and b are positive. 


(a) Show that the Hamiltonian system has the same solution trajectories as the Lotka—Volterra system from Example 
9.4.4, page 519. 


(b) Show that H(zx,y) has a strict maximum at the equilibrium (4/8,r/b), making it unsuitable as a Lyapunov 
function for the system. 


3. Consider the nonlinear system of equations 
é=ytasx (427 +y"), y=—at+ay (427 +7), 
containing a real parameter a. 


(a) Show that the only equilibrium point is the origin regardless of the value of a. 


(b) Show that a Lyapunov function can be chosen in the form V(a,y) = ax? + by”, with some positive constants a 
and b. Then show that the origin is asymptotically stable if a < 0. 


(c) Use V(a,y) to show that the origin is unstable if a > 0. 
(d) What can you say about stability if a = 0? 


4. Construct a suitable Lyapunov function to determine the stability of the origin for the following systems of equations. 


& = —Ty? &=a-—y—-2? 
| ie (a) ( ee 
y 


y = 3a — 4y’; =at+y-y"; 
b=y, & = 3y—2°, 
b e 
cP) ae (e) een 


£L=y— 22, 
© { 4 ; 


y= 2e-—y- 2X"; 


& = 6y? — 72°, 
3 ee er 
y = —3x° — dy”. 


5. Use Lyapunov’s direct method to prove instability of the origin for the following systems of equations. 
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Review Questions 


One Oe aaatan 
y = 4a y — 2y?; youtdsyta?. 


Let f(x,y) be a continuously differentiable function in some domain 2 containing the origin. Show that the system 


t=ytaf(ry), y=—e+yf(r,y) 
is asymptotically stable when f(x,y) < 0 in 2 and unstable when f(x,y) > 0 in Q. 


Find a gradient system x = VG, where the function G is given. What is a corresponding Lyapunov function for this 
gradient system? 


(a) G(z,y) = 2° — ay’; (b) G(a,y) =a? +y? — (2* +y*) — 5a7y?. 
Show that the nonlinear equations ¢ = 7x? —5xy?, y = 8y° — 5a’y, form a gradient system. 


Consider the Hamiltonian system @ = Hy, y = —Hz and the gradient system ¢ = Hz, y= Hy, where H(z,y) is 
the same function in each case. What can you say about the relationship between the two phase portraits of these two 
systems? 

Using the appropriate Lyapunov function, show that the origin is asymptotically stable for the van der Pol equation 
&+a+e(é— be?) =0. 


Prove that V(z,y) = 2? + a7y? + y", (a,y) € R’, is a strong Lyapunov function for the system 


é = 1-324 3a? + Qy? — 2? — Qay’, 


y=y—2wytary—y 
at the critical point (1,0). 
Consider the Liénard equation # + f(y) y+ g9(y) =0 with f(u) > 0 and ug(u) > 0 for u #0, where f(u) and g(w) are 


continuous functions. Rewrite the Liénard equation as a system of first order differential equations and show that the 
origin is a stable critical point. Hint: Use a Lyapunov function of the form V(x, y) = 2 i g(s)ds+y?. 


Consider the damped pendulum system « = y, y = —w* sinx — yy. 


(a) Use the Lyapunov function V = 4w? (1 — cos(# — 2nm)) + 2y? + 2y(x — 2nm)y + 7 (4 — 2n7)? to show that this 
system is asymptotically stable at every critical point (27n,0), n = 0,+1,.... 


(b) Use the Lyapunov function V = [1 — cos (a — 2ka — 7)|+ysin (a — 2k — 7) to show that this system is unstable 
at every critical point (2k7 + 7,0), k =0,+1,.... 


Determine whether the following equations are gradient systems and, in case they are, find the gradient function G(z, y). 


(a) & = 2x — 3x? + By’, (c) r = 8rd —@ sec? r, 
a c ; 
yaa —Qyt+a? — 3y?. 6 = 4t? — tanr. 
(b) &=cosz siny, (d) & = 4a? — 2y, 
y =sin& cosy. y = 4y? — 2a 


Show that the origin is an unstable stationary point of # — <?sign(#) + # = 0. 


Using a Lyapunov function of the form V(a,y,z) = ax? + by? + cz”, show that (0,0,0) is an asymptotically stable 
equilibrium point of the systems given. 


g=yte—2’, = —2Qytyz—2°, 

rane 2 \ 2 3 _ ey 
(a) yo=—a-—ayte—y’, (b) yr=u-rz-y", 

x 2 2 5 . 3 

Z>-yz-y z-—2z te zZ=n2y-—z. 


Consider the differential equation (9.4.1) on page 514, 7 + f(y) = 0, where f(0) = 0, f(y) > 0 for0 < y < k, and 
f(y) < 0 for —k < y < 0. Introducing the velocity variable v = y, we rewrite Eq. (9.4.1) as a system of two differential 
equations for which the origin is a critical point. 


1 y 
Show that the total energy function V(y,v) = a vw +f f(s) ds is positive definite, so V(y,v) is a Lyapunov function. 


0 
Use this conclusion to determine stability of the critical point (0,0). 


Section 9.6 of Chapter 9 (Review) 


1. 


2. 


Prove that the equation ¥ + y sint/ (3+ sint) =0 does not have a fundamental set of periodic solutions. Does it have 
a nonzero periodic solution? 

In each of the following problems, an autonomous system is expressed in polar coordinates. Determine all periodic 
solutions, all limit cycles, and the stability characteristic of all periodic solutions. 
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11. 


12. 


13. 


14. 


(a) fF=r(1—-7r), 6= : (d) f=r(l—r)(4-7r), 6=1; 
(b) *=r(1—r)’, 6=r; (e) f=r(1—r)’, 6=-1; 
(c) F=r(l—-r)*, 6=r; (f) *=sin(rr), @=—-1 
Find periodic solutions to the system and determine their stability types: 
& = 20 —y— 2x (x? +y"), y=xt2y—2y (2? +y7), z=—kz, k>0. 


Living cells obtain energy by breaking down sugar, a process known™ as glycolysis. In yeast cells, this process proceeds 


in an oscillatory manner with a period of a few minutes. A model proposed by Selkov in 1968 to describe the oscillations 
is 

b=-c+oyte’y, y=B-ay—a’y. 
Here x and y are the normalized concentrations of adenosine diphosphate (ADS) and fructose-6-phosphate (F6P), and 
a and £ are positive constants. 


(a) Show that the nonlinear system has an equilibrium solution « = 8, y = 8/(a + 6”). Show that the stationary 
point is an unstable focal or nodal point if (B? + a)? < B?—a. 


(b) To apply the Poincaré—Bendixson theorem 9.10, page 533, choose a domain ( in the form of a slanted rectangle 
on the upper right corner having a slope of —1. By calculating the dot product @- 7 on each boundary of the 
domain, where v = (4, ¥) is the direction of the tangent line to a trajectory, determine a domain of the indicated 
shape, such that all trajectories cross the boundaries from the outside to the inside. Is there a limit cycle inside 
Q? 


(c) Confirm that the system with a = 0.05, 8 = 0.5 has a limit cycle by plotting the phase portrait. 


Find conditions on a smooth function f(x) so that a differential equation ¢ = f(x) — ¢€f’(x)«, where ¢ is a parameter, 
has a limit cycle. Hint: Use a Liénard transformation y = ef(x) +4“, y = f(z). 


By plotting a phase portrait, show that the system 


= 3y+a(1—22?-y°), y = 2x 


has a stable limit cycle. 


Draw the phase portrait of the system to observe a stable limit cycle. 


é = 5y + 3x (1-27 — 2y’), y = —2e. 
Determine limy-,.. x(t), where x(t) denotes the solution of the initial value problem 


#+e+2x+22°=0, x2(0)=1, «(0) =0. 


Show that the differential equation % + 2 (a? + x? 1) &+a=0 has a unique stable limit cycle. 


By plotting the phase portrait, show that the system 


has periodic solutions, but no limit cycles. 


Find several values of parameter k such that the initial value problem has a periodic solution. 

é=05-a4+y", y=0.5yt+ key +y’, x(0) = 0.9, y(0) = 0.5. 
Draw the phase portrait for the system # = «7/2 — 2%. Is the solution with the initial conditions (0) = 3/4 and ¢ = 0 
periodic? 


Draw the phase portrait of the Hamiltonian system #+ 1.022 — «x? = 0. Give an explicit formula for the Hamiltonian 
and use it to justify the features of the phase portrait. 


For the system 
&=yt+a" —2y’, y = —2x — 2xry, 


find a Hamiltonian and prove that the system has a limit cycle or periodic solution. 


°4The model was first proposed by the famous Russian scientist Evgenii Selkov in the paper “On the Mechanism of Single-Frequency 
Self-Oscillations in Glycolysis. I. A Simple Kinetic Model,” Eur. J. Biochem. 4(1), 79-86, 1968. 
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Review Questions 


For positive constants a and 6b, show that the system 
b= be—aytbaf(r)/r,  y=be+ay+ay f(r)/r, 
where x = ar cos@, y = br sin@, a2b?r? = b?x? + ay”, and tan6 = (ay)/(bx), has periodic solutions corresponding to 
the values of r such that r+ f(r) = 0. Also determine the direction of rotation for the orbits of this system. 
Show that the given system has only constant periodic solutions in some neighborhood of the origin. 
& = 4a — 5y + 307, 

(a) 4. 32 (c) 
y = 38y° + 2° y — 2; 
& = 2y+2° — 32, 
y= Tx —Qy+y?; 


& = 6x + 5y— ay’, 
y= 23 —Ty— 27y; 
& = 5x — 3y — Qry’, 
(b) (dq) y 2 
y = 6x + 3y — 3a~y. 


Show that the given system has a nonconstant periodic solution. 


&b=yta? —y? +407y3, &=2Qyta?—2Qy?427y?, 

(a) — A. (c) ee | 3,2, 
y= —2a — Qey — 2wy’; y 2x —2ey+ary + 2°y’*; 
E=y, & = 2y, 

b d 

(») y=—x—(x*+y*-1)y; @) y = —20° + 3y In (a? + y* +1/4). 


Show that each of the given Liénard equations has a unique nonconstant periodic solution. 
(a) #+27 (a? —-9)¢+4 42° =0; (c) #+ (3a7—4) 4422 =0; 
(b) #+(3a7-cosx)t+a+sinz=0; (d) #+ (52*-16)%+6a° =0. 
Find a continuous periodic solution of the perturbed harmonic oscillator in each of the following systems: 


(a) nonlinear weakly damped van der Pol equation (c is a positive constant) 


x4 e (x? 1a bw a eax =0; 
(b) modified van der Pol equation (€ is a positive parameter) 


é+e(27+4?-1l)¢t+a2=0. 


Find numerical values of € and w? for which the weakly damped van der Pol equation % + € (a? 1) &+wae =0 has 
periodic solutions. 


The Morse potential, named after American physicist Philip M. Morse (1903-1985), is a convenient model for the 


potential energy of a diatomic molecule. The corresponding model reads y = De (c Bay Pe) ge =), where D, is 
the dissociation energy, re is the equilibrium bond distance, y is the distance between the atoms, and the parameter a 
controls the “width” of the potential. By plotting a phase portrait, show that the equation has periodic solutions. 
The Rosen—Morse (1932) differential equation” 


” 


+ + 6 tanh(axr) + 4| y =0, 


Qa 
cosh? (ax) 
where a, (, y, and a are positive parameters, has been used in molecular physics and quantum chemistry. By plotting 
a phase portrait for some numerical values of parameters, observe that it has periodic solutions. 

(N. P. Erugin) Verify that the system 


é=yt (2? +y? —1)sinut, y=-2, 
has an asynchronous solution z = — cost, y = sint. 


It is convenient to reformulate the SIRS model in terms « = S/N, y = 1/N, and z = R/N, which are the fractions of 
the susceptibles, infectives, and removeds, respectively: 


i= [8-7 +048)-(6-a)y— B24 2] 


rNz 


zZ=yy—(6+b)z4 K + ayz, w= [r (1-2) ay] w. 


Show that the above system has a unique positive equilibrium Pi(ya, z4, Na) in the interior of Q if and only if 


B rp ( ei ) 
Ro = ———_ > 1, d Rg = —— (14 — ] > 1, 
0 yFatb—r saad (PEs Gaal aT 


1 6+ pt 1 y ( =) 
9 a ee ee S(io—) |) —— —, Near (i-—.). 
- ( oe eemeer = ( in) yro+ ‘ Ra 


where 


°5Nathan Rosen (1909-2005) was an American-Israeli physicist who worked with Albert Einstein and Boris Podolsky. 
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Fourier partial sum approximation (left) of —z/2 with 45 terms versus 


Cesaro partial sum approximation with the same number of terms. 


Orthogonal Expansions 


The objective of this chapter is to present the topic of orthogonal expansions of functions with respect to a set of 
orthogonal functions. This topic is an essential part of the method of separation of variables, used to solve some linear 
partial differential equations (Chapter 11). The set of orthogonal functions is usually determined by solving the so 
called Sturm-—Liouville problem that deals with ordinary (or partial) differential equations containing a parameter 
subject to some auxiliary conditions. As an application, we give an introduction to the fascinating example of such 
an orthogonal expansion—Fourier series. 


10.1 Sturm—Liouville Problems 


Many physical applications lead to linear differential equations that are subject to some auxiliary conditions that 
could be either boundary or boundedness conditions rather than initial conditions. To every such differential equation 
can be assigned a linear operator acting in some vector space of functions. Moreover, there are many instances where 
a differential equation contains a parameter, which is typically denoted by X. 


Definition 10.1: Let L: V > V bea linear operator acting in a vector space V with domain W CV. A real 
or complex number 4 is called the eigenvalue of the operator L if there exists a nonzero (also called nontrivial) 
element y from W such that 

Ly = dy. (10.1.1) 


The corresponding solution y of the equation (10.1.1) is called the eigenfunction (or eigenvector) of the operator 
L. The problem of determination of eigenvalues and eigenfunctions is called the Sturm—Liouville problem. 
A linear operator is called positive/nonnegative if all its eigenvalues are positive/nonnegative. 


In other words, a Sturm—Liouville problem requires finding such values of parameter for which the given 
problem (10.1.1) has a nontrivial (not identically zero) solution, and then find these solutions. Our objective is to 
solve some Sturm-—Liouville problems for linear differential operators acting in a space of functions. What makes 
this problem special is the presence of a (real or complex) parameter 1. The theory of such equations originated 
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with the pioneer work of Sturm from 1829 to 1836 and was then followed by the short but significant joint paper of 
Sturm and Liouville in 1837 on second order linear ordinary differential equations with an eigenvalue®® parameter. 

Note that Eq. (10.1.1) resembles an eigenvalue problem Ax = \x for a square matrix A and a column vector x 
that we discussed in §7.3. While a Sturm—Liouville problem embraces a matrix equation, it is usually referred to as 
an infinitely dimensional vector space of functions. 

Let us start with a motivated example involving a nonnegative differential operator Ly = —y’’ of the second 
order: 

—y" (x) = Ay(2), -0 <@< om. 

This homogeneous differential equation always has the solution y(x) = 0, which is referred to as the trivial solution. 
The identically zero solution is rarely of interest. However, the given equation has a bounded nontrivial solution for 


any positive A: 
y(x) = Ci cos (2v2) + Co sin (xv) : 


where C; and C, are arbitrary constants. When A = 0, the equation y’’ = 0 has a constant nontrivial solution. For 
negative A, the given equation has only unbounded exponential solutions that are disregarded. Since the eigenfunction 
y(a) exists for every \ > 0, the corresponding differential operator is nonnegative. Clearly, an eigenfunction is not 
unique and can be multiplied by an arbitrary nonzero constant. The set of all eigenvalues is usually referred to as a 
spectrum (plural spectra). In our simple case, we say that the differential operator L has a continuous nonnegative 
spectrum. 

Now we consider the same differential equation on the finite interval: 


y (x) + Ay(x) = 0, O0<ar<f, (10.1.2) 


where £ is some positive real number and 4) is a (real or complex) parameter. Among many possible boundary 
conditions, we begin our journey with simple homogeneous conditions of the Dirichlet type: 


y0)=0, = y (2) = 0. (10.1.3) 


A linear operator corresponding to the given problem (10.1.2), (10.1.3) is L[D] = —D? = —d?/dz? acting in the space 
of functions defined on the finite interval [0, ¢] that vanish at the end points « = 0 and z = @. 

To solve the Sturm—Liouville problem (10.1.2), (10.1.3), we need to consider separately three cases depending on 
the sign of A because the form of the solution of Eq. (10.1.2) is different in each of these cases. It will be shown in 
the next section that the Sturm—Liouville problem for a self-adjoint operator (Ly = —y” is one of them) does not 
have a complex eigenvalue. 


1. If A < 0, then the general solution of the differential equation (10.1.2) is 


? 


y(x) = Cy e*¥—* + Cpe *¥™ 
for some constants C1, C2. Satisfying the boundary conditions (10.1.3), we get 
QtCr=. ev "4060" =o 


Since the determinant of the corresponding system of algebraic equations 
1 1 x <x 
det =e 'V-A_ eV = _9 gi h(¢v=A) 40 
7 lew 0 C € sin : 


the given problem has only a trivial (identically zero) solution. 


2. If \ = 0, the general solution of the differential equation (10.1.2) becomes y(x) = C1 +C 2x. From the boundary 
conditions (10.1.3), it follows that 


Cy +C2:-0=C, =0, Cy t+ Col=Col=0. 


Hence, we don’t have a nontrivial solution. 


96 Jacques Charles Francois Sturm (1803-1855), a French mathematician of German ancestry, was known for his work in differential 
equations, projective geometry, optics, and mechanics. He made the first accurate measurements of the speed of sound in water in 1826. 
Joseph Liouville (1809-1882) was a French mathematician who, besides his academic achievements, was very talented in organizational 
matters. The definition of positiveness was introduced by the German-American mathematician Kurt Otto Friedrichs (1901-1982). He 
was the co-founder of the Courant Institute at New York University and recipient of the National Medal of Science. 
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3. If X > 0, then the general solution of the differential equation (10.1.2) is 
y(x) = Ci cos (xv) + Cy sin (xv) : 
Satisfying the boundary conditions (10.1.3), we get 
Ci+Cz-0=C,=0, Cz sin (¢vX) =0. 
Assuming that C2 4 0 (otherwise, we would have a trivial solution), we obtain the transcendent equation 
sin (ev>) =0, 
which has infinite many discrete solutions (called eigenvalues) 


iS (=y", a oe ee 


To these eigenvalues correspond eigenfunctions (nontrivial solutions): 
NTL 


yn(o) = sin (“*), 1, De Byes 


Positive and negative values of n which are equal in magnitude correspond to the same eigenfunctions up to a 
multiplicative constant. Thus we choose only positive indices for n to label eigenfunctions and eigenvalues. Other 
numbering schemes such as n = —1,2,—3,4,...also work. Therefore, the Sturm—Liouville problem (10.1.2), (10.1.3) 


has a positive discrete spectrum {A,,} (n = 1,2,...) to which correspond eigenfunctions sin (24) up to an arbitrary 
multiplicative constant. | 


Now we turn our attention to Neumann boundary conditions 
y'(0)=0, = y'(@) =0. (10.1.4) 


To solve the corresponding Sturm—Liouville problem (10.1.2), (10.1.4), we need again to consider three cases de- 
pending on the sign of A. When A < 0, we have only a trivial solution. For \ = 0, we substitute the general solution 
y(x) = Cy + Cy into the Neumann conditions to obtain 


Cz =0 


because its derivative y’ = Cz must vanish in order to satisfy the boundary conditions (10.1.4). Therefore, A = 0 is 
an eigenvalue to which corresponds a constant eigenfunction y = Cj; it is convenient to choose C; = 1. 


For > 0, the general solution of Eq. (10.1.2) is a linear combination of periodic functions y(a) = C, cos (xv) + 


C2 sin (zv9), which upon substitution into the Neumann boundary conditions (10.1.4) yields 
y/(0)=CoVA=0, —y'(02) = CiVXsin (ev>) =0. 


From the former, we get C2 = 0 because A > 0. Solving the transcendent equation sin (evr) = 0, we obtain a 


discrete set of eigenvalues 
nT \ 2 
— (=) he oes 


£ 
Note that we include n = 0 to embrace the case X = 0. To these eigenvalues correspond the eigenfunctions 
Yn(x) = cos (=). Oo 
If we have a Dirichlet condition at one end and a Neumann condition at the other end, 
y(0)=0, = y"(€) = 9, (10.1.5) 


then these mixed boundary conditions are called of the third kind. This leads to the Sturm—Liouville problem that 
consists of the differential equation (10.1.2) with a parameter subject to the boundary conditions of the third kind 
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(10.1.5). Similarly to the previous discussion, it can be shown that the Sturm—Liouville problem (10.1.2), (10.1.5) 
has only a trivial solution for A < 0. Assuming » > 0, we substitute the general solution y(a) = C1 cos (xv) + 


C2 sin (xv) into the boundary conditions (10.1.5) to obtain 


y(0)=C,=0, yy" (2) = CoV cos (ev>) asf 


To avoid a trivial solution, we set C2 # 0 and get the eigenvalues from the equation 


; _ _ (n+ 2n) : S 
cos (eV) = 0 => a = () ; n=0,1,2,.... 


The corresponding eigenfunctions become 


1+2 
pos = 0,1, 2s 2 
2 
P 
¥y | y 0 
x=0 el 
P 
r= got 
v x 


Figure 10.1: Example 10.1.1. 


Example 10.1.1: Let us consider a rod of length @; one end of it x = £@ is fixed, while another one x = 0 is free. A 
stretching force P (with units in newtons) is applied to the free end x = 0 along the axis of the rod. It is known 
that when the load P is small, the form of the rod is stable; however, there exists a critical value Py (known as the 
Euler load) of the force that the form of the rod becomes unstable when P > Po, and the rod bends (or buckles). 

We consider the beginning of this bending; in other words, we assume that it is slightly different from its 
equilibrium position along a straight line. Then the equation (credited to Euler) of the bent axis of the rod y = y(z) 
becomes 

Py=-—EIy" (0<a< 2), 


where I is the area moment of inertia of the cross-section about an axis through the centroid perpendicular to 
the xy-plane (it has dimensions of length to the fourth power), and E is Young’s modulus (or elastic modulus, 
which measures the stiffness of an elastic material and has dimensions of force per length squared). If the rod is 
homogeneous of constant cross-section, then EJ is a constant with units N-m?. Setting \ = P/(EI), we obtain the 
boundary value problem 

-y"=dy, —-y/(0) =0, (2) =0. 
Similar Sturm—Liouville problem has been considered previously. The critical loads P, = EI X,, correspond to the 


eigenvalues A, = (ae), and the eigenfunctions y,, (a) = cos rOn uz, n= 1,2,..., define the equilibrium positions 
of the rod. 
If the given rod is nonhomogeneous, then EJ is a function of x. If we set p(x) = (EI)~', we get the Sturm— 


Liouville problem 


-y"=Pp(z)y, y'(0)=0, y(2) =0, 


which cannot be solved using elementary functions in general. 
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Example 10.1.2: Consider an elastic column of length @; one of its end is clamped, but the other one is simply 
supported. Let y(a) be the deflection of the column at point x from its equilibrium position. 

The bending moment is M = Py — Hx = —ETy", where H is the horizontal force of reaction, and P is a load. 
After differentiation twice, we get 


(EIy")"=—Py", — y(0) =y"(0) =0, y(Q=y'(0 =0. 
If flexural rigidity EJ is a constant, we set k? = P/(EI) (with units m~?) and get the Sturm—Liouville problem 
y+ yl” =0, — y(0)=y"(0) = 0, (@) = y'(0) =O. 
The general solution of the differential equation y7Y) + k? y” = 0 is 
y=a+be+ccoskxa+d sinkz, 


with some constants a, b, c, and d. The boundary conditions dictate that a = c = 0, and the eigenvalues k, 
(n = 1,2,...) are roots of the transcendent equation 


sin(ké) = ké cos(ké). 


The eigenfunctions y,,(2) = sink,x — xk, cosk,,€ correspond to these roots. 
When k = 0, the general solution is y = a + bx + cx? + dx?. The boundary conditions are satisfied only when 
a=b=c=d=0. Hence, \ = 0 is not an eigenvalue. 


y 
P 
—_—— 
r=0 r= 


Figure 10.2: Example 10.1.2. 


The conditions (10.1.5) constitute a particular case of the boundary conditions of the third kind: 
aoy(0) — Boy’(0)=0, — ary(2) + Bry/(€) = 0 (10.1.6) 


with some specified values ag, a1, 89, and 8,. The periodic endpoint condition y(0) = y(¢) may be imposed in some 
Sturm-Liouville problems instead of traditional boundary conditions. Equation (10.1.2) can be generalized to 


< (a) | — g(x) y(x) + Ap(x) y(a) = 0, 0<a4<, (10.1.7) 


where p(x) > 0 has a continuous derivative, and g(a), and p(x) > 0 are continuous functions on the finite interval 
(0, ]. The function p(a) is called the “weight” or “density” function. By introducing the derivative operator 
D = d/dz, Eq. (10.1.7) can be rewritten in the operator form L[y] = Ap y, where 


L = L|x,D] = —D (p(x) D) + q(x), D=d/dz, (10.1.8) 


is the linear self-adjoint differential operator. Such differential equations are typical in many applications and are 
usually subject to boundary conditions of the third kind (10.1.6). The corresponding Sturm—Liouville problem 
(10.1.7), (10.1.6) is much harder to solve and analyze. We illustrate it in the following example. 


Example 10.1.3: Solve the Sturm—Liouville problem 
y'+rAy=0 + (0<2<2), (10.1.9) 


y(0)-y'(0)=0, (2) = 0. (10.1.10) 
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We disregard negative values of A because the problem (10.1.9), (10.1.10) has only a trivial solution when A < 0. 
For \ = 0, the general solution of Eq. (10.1.9) is a linear function 


y(x) = Ci + Cox => y (x) = Co. 
The two boundary conditions require that 
y(0)—y'(0)=0=Ci—-Co, = (2) =0 = Cr. 


This leads to Cy = Cz = 0, and we get the identically zero (trivial) solution. 
If A > 0, the general solution of Eq. (10.1.9) is a linear combination of the trigonometric functions 


y(x) = Cy sin (xv>) + C2 cos (xv) ; 


where, as usual, we choose a positive root VA > 0. The boundary condition at x = 0 requires C2 — VAC, = 0 or 
Cz = C, VX. From another boundary condition at x = 2, we get 


CL [sin (2v) + VX cos (2v)] = 0. 


For the Sturm-—Liouville problem (10.1.9), (10.1.10) to have a nontrivial solution, we must have C, 4 0, and p = VX 
must be a positive root of the transcendent equation 


sin (2) + ps cos (244) = 0 (u = V2). 


Since the sine and cosine functions cannot annihilate the same point simultaneously, we may assume that both of 
them are not zero: sin (241) 4 0 and cos (2m) 4 0. Dividing the previous equation by cos (2/4), we get 


tant=—t/2, where t=2V). (10.1.11) 


This equation does not have an analytic solution expressed through elementary functions; however, it can be solved 
numerically. The roots of Eq. (10.1.11) can also be found approximately by sketching the graphs f(t) = tant and 
g(t) = —t/2 fort > 0. From Fig. 10.3, it follows that the straight line g(t) = —t/2 intersects the graph of the tangent 
at infinitely many discrete points t,, n = 1,2,.... The first three positive solutions of the equation tant + t/2 = 0 
are ty 2.28893, tg ~ 5.08699, and t3 ~ 8.09616, to which correspond eigenvalues \; = (t;/2)? ~ 1.3098, Az = 
(t2/2)? ~ 6.46935, and \3 = (t3/2)? ~ 16.387, respectively. The other roots are given with reasonable accuracy by 


ty + (n— 1), n =3,4,.... 


For instance, 77/2 ~ 10.9956 gives a good approximation to t4 11.1727 with correct 2 decimal places. Hence, the 

m(2n —1)\? 
4 

grows. Say for n = 15, we have tis © 45.5969 and 2977/2 = 45.5531. 


eigenvalues are An = (tn/2)? © (n = 3,4,...), where precision of this estimation becomes better as n 


Finally, it should be noted that, generally speaking, a Sturm—Liouville problem may have complex eigenvalues. 


Example 10.1.4: Consider the following boundary value problem from Example 10.1.2: 


yO +k y”=0, (0) =y'(0) =y"(0) =0, (2) =0. 


As previously, it can be shown that & = 0 is not an eigenvalue. Nontrivial solutions exist when & is a root of the 
transcendent equation sin k@ = ké, which has only complex roots. 


In each of Problems 1 through 6, either solve the given boundary value problem or else show that it has no solution. 
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moe 


—rfa 
ry 
Figure 10.3: Graphical solution of tant = —t/2. 
1. y’ +4y=427, y(0) =0, y/(x) =0. 4. y"+y=2sinz, y(0)=0, y(r) =0. 
2. y”+9y=8cosz, y/(0)=0, y(m) =0. 5. y +4y=42, y'(0) =0, y(r) =0. 
3. y’+y=3sin2z, y(0)=0, y(r) =0. 6. y” + 2y’ + y =cos3z2, y'(0) = 0, y’(1) =0. 


In each of Problems 7 through 10, find the eigenvalues and eigenfunctions of the given boundary value problem. Assume 
that all eigenvalues are real. 


7. y” —4y' + r(Ay = 0, y(0 
8. y” + 2y’ + Ay =0, y(0 
9. y” + 4r\y = 0, y(-1)=0, y(1) = 0. 
10. y” +2y' + Ay =0, y'(0)=0, y(2)=0 
In each of the following two problems, determine the real eigenvalues and corresponding eigenfunctions (if any). 
1. y+ y+2A(y’+y)=0 (O<2<1), y(0)=0, y’(1)=0. 
12. xy” —rA(zy'-y)=0 (l<2<4), y1)=0, y(4)+y'(4) =0. 


In each of Problems 13 through 15, assume that all eigenvalues are real. 


a) Determine the form of the eigenfunctions and find the equation for nonzero eigenvalues. 
b) Determine whether = 0 is an eigenvalue. 
c 


(c) Find approximate values for Ai and 2, the nonzero eigenvalues of smallest absolute value. 


( 
( 
(d) Estimate An for large values of n. 

13. y"+dAy=0, = -y(0) = 9,» -y(1) +. 2y(I) = 
14. y"+r4y=0, —-y(0) — 2y'(0) = 0, y= 
15. y” +Ay =0, y(0) — 2y’(0) =0, y(1) + 2y'(1) =0. 

16. Solve the Sturm—Liouville problem y’’ — 4y’ + Ay = 0, y'(0)=0, y’(1) =0. 

17. Solve the Sturm—Liouville problem y” + Ay=0, y(0)— hy’(0) =0, y(€) + hy’(€) = 0, where h, ¢ > 0. 
18. Solve the Sturm—Liouville problem 


y" — by’ +(9+A)y=0, — -y/(0) = 0, y(1) = 0. 


19. Solve the Sturm-—Liouville problem 
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20. A quantum particle of mass m in a one-dimensional box of length ¢ is modeled by the Schrodinger equation subject to 
the boundary conditions (fi = 1.054571800... x 107*+m? kg/s is the reduced Planck constant) 


Find the eigenvalues and eigenfunctions of this problem. 


21. Consider the Sturm—Liouville problem with a positive parameter a 
y"+Ny=0, — y'(0)—ay(0)=0, y/(1) =0. 
(a) Show that for all values of a there is an infinite sequence of eigenvalues and corresponding even eigenfunctions. 
(b) Show that independently of a, the eigenvalues An ++ 7(1+ 2n)/2 as nH oo. 
(c) Show that A = 0 is an eigenvalue only if a = 0. 


22. For each of the following boundary conditions, find the smallest real eigenvalue and determine the corresponding 
eigenfunction of the Sturm—Liouville problem for the buckled column equation y“ + k?y” = 0 (0 <a < £), subject 
to the boundary conditions 


(a) yO) =y'0)=0, yO =y"(4) = 
(b) y(0) =y'"(0) =0,  y(@) =y'(€) = 0; 
(c) yO) =y'(0) =0, y() =y'( =0. 


23. Consider the Sturm—Liouville problem assuming that all eigenvalues are real 


y" —4y'+(4+rA)y=0, = (0) = 0, -y(r) = 0. 


(a) By making the Bernoulli substitution y = uv, determine the function u(«) from the condition that the differential 
equation for u(x) has no v’ term. 


(b) Solve the boundary value problem for v and thereby find the eigenvalues and eigenfunctions of the original problem. 


24. Consider the Sturm—Liouville problem: 


y’ +rAy=0, = -y(0) = 0, 3y(1) - y'(1) = 0. 


(a) Find the determinantal equation satisfied by the positive eigenvalues. 

(b) Show that there is an infinite sequence of such eigenvalues. 

(c) Find the first two eigenvalues, and then show that An © [(2n + 1)7/2]* for large n. 
(d) Find the determinantal equation satisfied by the negative eigenvalues. 

(e) Show that there is exactly one negative eigenvalue and find its value. 


25. Determine the real eigenvalues and the corresponding eigenfunctions in the boundary problem 
wy” —d(ay'—y)=0, (1) =0, (4) = y"'(4). 
26. Consider the Sturm—Liouville problem with a positive parameter a 
y + 2d04y = 0, y'(0)=0, ay(1)+y/(1) =0. 


a) Show that for all values of a > 0 there is an infinite sequence of positive eigenvalues. 


(a) 
(b) Show that all (real) eigenvalues are positive. 
(c) 


(d) Show that the eigenvalues \, = 2/2, where pin & nw — a/(nm) for large n. 


Show that \ = 0 is an eigenvalue only if a = 0. 


In each of Problems 27 through 30, convert the given problem into a corresponding boundary value problem for the Priifer 
variables. Assume that R(x) is not zero at the end points. 


27. y' +3y+Ay=0, y(0) = y(1) =0. 29. y’ —Ty + 8Ay=0, y(0) = y(2) = 0. 
28. y’ —4y + 5Ay=0, y(0) = y(1) = 0. 30. y’ — 2ry+(34+2A)y=0, y(0) =y'(1) =0. 
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10.2 Orthogonal Expansions 


In this section we continue analyzing the Sturm-—Liouville problems and their applications. However, before pre- 
senting this material, we must first develop certain properties of a set of functions; therefore, we start with some 
definitions. 


Definition 10.2: Let f(x) and g(a) be two real-valued or complex-valued functions defined over an interval 
|a, b| and p(x) be a positive function over the same interval. The inner product or scalar product of these 
functions with weight p, denoted by (f |g) or simply by (f,g), is a complex number 


b 
(fla) = (fa) = / F (ae) g(oe) p(x) de, 


where f = u — ju is the complex conjugate of f = u+jv. If f and g are real-valued functions, then their scalar 
product is a real number 


(f,9) = 


If p(x) = 1, then 


The bra-ket notation (f |g) for the inner product was introduced in quantum mechanics in 1939 by Paul Dirac 


(1902-1984). This definition is a natural generalization of the finite dimensional dot product of two n-vectors 
n 


(u,v) =UW-v= S- Ur Uk, Where u = (u1,.--.,Un), V = (U1,---,Un), as the dimension n of the vector space, 
k=1 

which is the number of components involved, becomes infinitely large. From the definition of the inner product, it 

follows that 


(f,9) = (9, f). (10.2.1) 


Definition 10.3: A linear operator L is called self-adjoint if for every pair of elements u and v from the 
domain of L, we have 
(Lu, v) = (u, Lv). 


Definition 10.4: The positive square root of the definite integral (if it exists) 


b b 
/ |f(a)|? p(x) da in the complex case, / f(x) p(x) dx in the real case, 


is called the norm of the function f(«#) (with weight p > 0). It is denoted by ||f||2 or simply || f||, that is, 


b 
WiP=(A= f If (2)|? p(x) de. (10.2.2) 


A function f that has unit norm || f|| = 1 is said to be normalized. 


If the norm of f is zero, then the integral of the nonnegative function | f(a)|? over the interval |a,b| must vanish. 
This means that f(a) is almost everywhere zero and it may differ from zero over any range of zero length. It is 
convenient to speak of a such function as a trivial function. In particular, if f is continuous everywhere in an interval 
\a, b|, and has a zero norm over that interval, then f must vanish everywhere in |a,b|. Functions with finite norm 


(10.2.2) are called square integrable. 


554 Chapter 10. Orthogonal Expansions 


Definition 10.5: For a real-valued function f(x) defined on interval |a, b|, its root mean square value (r.m.s.) 


= | b 
f=1me f= — f teen. 


is 


Squaring both sides, we obtain the mean square value (7) = (b— a)“ |FII?. 


For oscillating quantities, caused by alternating currents and electromotive forces, we are most interested in their 
averages. For instance, a current I(t) amperes, produced by an electromotive force of E(t) volts across a resister of 
R ohms, generates heat at a rate 


0.24 I°R = 0.24 E?/R (cal/sec). 


The number of calories generated during any time interval a < t < b may be expressed in terms of the root mean 
square values for that interval as 


0.24 (7) R(b—a) =0.24||I(t)|2R = 0.24 (Z) (b — a)/R = 0.24||E(t)|[?/RB. 


Definition 10.6: Let p(x) be a positive function over the interval |a,b|. We say that two functions f(x) and 
g(x) are orthogonal on the interval |a, | (with weight p) if 


b 
(f,9) = / p(a) f(«)g(a) de = 0. 


The next definition uses the Kronecker delta®’ notation. 


Definition 10.7: Let ¢1(x), ¢2(x), ...,¢n(x), ... be some set of functions over the interval |a,b|. We call 
such a set of functions an orthogonal set if 


b 
i | F(a) d4(a) ple)dx =0, tj. 


This set is called an orthonormal set if 


b 
Baye / $i (2) $4 (@) plat) de = 8 


where 6;; is the Kronecker delta. 


Example 10.2.1: The set of linearly independent functions ¢,,(”) = x” with odd or even powers n is an orthogonal 
set on any symmetrical interval (—@, 2) with the weight p(a) = x. Calculations show that 


L a r=t 
1 
kon k+n+1 _ k+n+2 _ 
.,¢n) = dz = Ae a —§ 
(ok, on) [ uv Lax [ x AT ae oa 


since k + n+ 2 is an even integer. In general, an infinitely differentiable function can be extended in a Taylor series 


oo F(R) 
about a point zo as f(x) = » ! ee) 


k=0 


(a — ao)*, which may contain all powers of (a — a). 


°7Leopold Kronecker (1823-1891), a German mathematician (of Jewish descent) at Berlin University, made important contributions 
to algebra, group theory, and number theory. 
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Consider a self-adjoint differential operator (10.1.8), page 549, which we denote for simplicity by L. Upon 
multiplication of L[u](a) by a function v(x) and integration by parts within an interval [0, é], we get 


[ Pinar [ (tl ioun) dx 


Thus, we obtain the so called Lagrange identity (for brevity, the independent variable is dropped in the left-hand 
side) 


a=l 


L 
| {L[u] v — wL[v]} dx = —p(x) [u'(x) v(x) — u(x) v(x)| 


10.2.3 
Ls ( ) 


Theorem 10.1: All eigenvalues of the self-adjoint operator are real numbers. 


Let be an eigenvalue of the self-adjoint operator L, and y be the corresponding eigenfunction. 
Then from the identity Ly = Ay, it follows that (Ly,y) = (Ay,y) = Aly, y). Since (y,y) = |ly|/? > 0, and 
(Ly, y) = (y, Ly) = (Ly, y) is a real number (because (Ly, y) is equal to its complex conjugate), then 


, — Lay) 


(y,y) 


is also a real number. 


Theorem 10.2: Eigenfunctions y,(x) and ym(x) of the Sturm-—Liouville problem: 


(py) —ay+Apy =0, — aoy(0) — Boy’(0) = 0, ary(£) + Biy'(Q) =0, (10.2.4) 


corresponding to distinct eigenvalues \,, and X,,, are orthogonal: 
é 
[Ce )um(w)um(0) dx = 0. 
0 


Let L = L/x,D] = —D (p(#)D)+¢(x)I be the differential operator of the given equation, where D = d/dz 
and I is the identity operator. For two eigenfunctions y,,(x) and ym(x), we have Llyn] = Anpyn and L[ym] = Ampym- 
Using Lagrange’s identity (10.2.3), we get 


L £ 
| Cintas xes,) | prea 
0 0 


= —p(2) [vh(2) val) — un(2) vm) | 


From the boundary conditions (10.1.6), it follows that the latter is zero and we have 


£ 
ii | POT ROT One 


Since An ~ Am, we conclude that the eigenfunctions y, and y, are orthogonal. 


Example 10.2.2: Let us consider a set of linearly independent complex-valued functions: 


On(2) age = (eiv7)" n=0,, 1, 2,24; 


where w is a real positive number, z is an arbitrary real number from a finite interval, and j is the unit vector in the 
positive vertical direction on the complex plane so that j? = —1. Note that the complex variable z = ei” has unit 
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length independently of x and w. Each of these complex-valued functions ¢,,(x) of a real variable x repeats itself 
after passing an interval of length T = 27/w. It is convenient to denote T = 2¢ and choose the basic interval [—Z, ¢]. 
Now we show that these functions are orthogonal on the interval [—@, ¢], where ¢ = £: 


e £ i 
= ; p ; 0, fk : 
($4, n) = / elhwe ginwe qa — / el(n—k)we ga — uke en (10.2.5) 
~£ 2 2é, ifk=n. 
Indeed, taking the antiderivative, we get 
1 —- 1 
/ el(n—k)war dz = el(n—k)war =. (nm _ glee) : 
2 J(n — kw ae I(n—k)w 


Since wf = 7, the right-hand side is zero when n # k. For k = n, the identity is obviously true. 
Using Euler’s formulas 


: . : 1 : : 
e? =cosO+jsin#, cos@==(e? +e), sind = 5 (ei? — e9*) , (10.2.6) 


Nl rR 


we obtain from Eq. (10.2.5) the orthogonality relationships for trigonometric functions 


T 27 
/ cos(k@) sin(n@) dé = ‘ cos(k@) sin(n@) dé = 0; (10.2.7) 
—T (0) 
= on 2n, ifk=n=0, 
/ cos(k@) cos(n@) dé = | cos(k#) cos(n0)d@= <7, ifk=n>0, (10.2.8) 
™ 0, otherwise; 


7, ifk=n>0, 


; (10.2.9) 
0, otherwise. 


a sin(k@) sin(né) dé = [ sin(k@) sin(né) dé = 
0 


=o. 


The above formulas tell us that the average of the product of two sines, of two cosines, or of a sine and a cosine, of 
commensurable but numerically unequal frequencies, taken over any interval of length 27 is zero. 


Let { dp(x)}, k = 1,2,..., be a set of linearly independent orthogonal functions on an interval |a,b|, which 
means that none of the functions { ¢;(x) } can be expressed as a linear combination of the other functions. In many 
applications, these functions are eigenfunctions of some linear differential operator. For any square integrable (with 
weight ) function f(x) defined in the interval |a,b|, we may calculate the scalar product of f(a) with each function 
Ok (x): . 

(f, Ox) 1 
Ck del? > Veal? [ f(x) d(x) p(w) da, k=1,2,.... (10.2.10) 
These coefficients are called the Fourier constants (or coefficients) of f(a) with respect to the set of orthogonal 
functions {¢(x)}),,5, and the weight function p(x). With this picturesque terminology, the representation of a square 
integrable function f(x) as 


f(x) =>— ce bx (a) (10.2.11) 
k>1 
can be interpreted as an expansion of the given function in terms of the set of orthogonal functions. 

The set of functions {¢x(7)},51 is analogous to a set of n mutually orthogonal vectors in n-vector space, and 
we may think of the numbers ci, C2, ..., Cx, ...as the scalar components of f(x) relative to this basis. We can 
identify f(x) with the infinite vector c = (ci1,c2,...). This correspondence along with uniqueness of the Fourier 
series representation establishes a one-to-one mapping between a certain set of functions (which usually includes 
square integrable functions) and a discrete but infinite set of sequences. 

We may try to find a finite approximation of the given real-valued square integrable function f(a) as a linear 
combination of functions {x (x) } 51: 


n 


f(v)* So ange(e) = (a<a<d). (10.2.12) 


k=1 
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It is convenient to assume that the set of functions is normalized, that is, ||¢,(a)|| = 1, k = 1,2,.... We want to 
determine the coefficients a; in such a way that the norm of the difference between the function f(z) and its partial 
sum (10.2.12) over the interval (a,b) is as small as possible: 


‘ in 2 
-| [109-3 aout p(x) de. 


k=1 


n 2 


f(a) — S> ax bx (a) 


k=1 


A 


The approximation to be obtained, over the interval (a,b), is thus the best possible in the “least squares” sense. 


Now, we reduce brackets to obtain 
: 2 
ee oto) p(x) de 


A= [Per (@dr-2 0m fH ) be (a Jotayae+ f 2, 
= [ Pe (dr —2 9° accaldn? + > away bx (x) 3 (x) p(x) dx 


k,j=l 
2) eS ag =f? -— > &+ >) (ce—ax)? (Ide l| = 1). 
k=1 k=1 k=1 k=1 


It is clear that, since f and its Fourier constant cy, are fixed, A takes a minimum value when the coefficients a, are 
chosen such that 
Ak =CrK (k =1,2,...). 


Therefore, the best approximation (10.2.12) in the mean square sense is obtained when a, is taken as the Fourier 
constant of f(a) relative to ¢,(a#) over the interval (a, b). 
Since A is a positive number, we have the relation 


n 


b 
II? = f #0) ole) az > > elleul?, (10.2.13) 


k=1 


known as Bessel’s inequality. The geometric meaning of inequality (10.2.13) is that the orthogonal projection of 
a function f on the linear span of the elements, {¢,(a)}, k = 1,2,...,n, has a norm which does not exceed the norm 
of f. 

Suppose now that a dimension n of the finite orthogonal set {¢,(x)}, k = 1,2,...,n, is increased without a limit. 
The positive series in the right-hand side of Eq. (10.2.13) must increase with n because it is the sum of positive 
numbers. Since the series cannot become greater than the fixed number ||f||?, we conclude that the series }> c? 
always converges to some positive number less than or equal to r fe pde=|[7 |. 

However, there is no assurance that the limit to which this series converges will actually coincide with this 
integral. If this is the case for every square integrated function f, we say that the set®® of functions ¢1(x), ¢2(2), 


., n(x), ..., is complete (with respect to mean square convergence). Hence, for a complete set of functions, we 
have 
b n 7 
im : fre - a Ch oto) dx = 0. (10.2.14) 


Generally speaking, we cannot guarantee that the Fourier series 


n co 


dim, S- Cr Og (2) = 7 Ck Pk (L) 


k=1 k=1 


converges pointwise to f(a) for every x € (a,b). We know only that the mean square error in (a,b) goes to zero, 
and we say accordingly that if Eq. (10.2.14) is valid, then the series converges in the mean square sense to f(z). 
This is an essentially different type of convergence compared to a pointwise convergence because an infinite series 
may converge at every point but diverge in the mean square sense. And vice versa, a series may converge in the 


°8The term “complete” was introduced in 1910 by the Russian mathematician Vladimir Andreevich Steklov (1864-1926), a student of 
Aleksander Lyapunov. 
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mean square sense to a function that differs from a pointwise limit in a discrete number of points. However, if f(z) 
and all functions ¢,(x) are continuous over the interval (a,b) and the series }>,, crx x(x) converges uniformly in the 
interval, then the Fourier series (10.2.14) pointwise converges everywhere in (a,b). 


Theorem 10.3: Every function from the domain of a self-adjoint differential operator can be expanded into 
uniformly convergent Fourier series (10.2.14) over the set of its eigenfunctions. 


The proof of this theorem is based on reduction of the problem to an integral equation. Then using the Hilbert— 
Schmidt theorem, a function is expanded into series over eigenfunctions. See details in a course on integral equations, 
for instance, [40]. Oo 

Recall that the domain of a linear differential operator of order n consists of all functions that have continuous 
derivatives up to the order n and satisfy the boundary conditions that generate the linear operator L. Therefore, 
Theorem 10.3 refers to such functions. In particular, if L is an operator (10.1.7) of the second order and f is a 
function having two continuous derivatives subject to the corresponding boundary conditions (10.1.6), then the 
Fourier series converges uniformly to f(z). 

Finally, we turn our attention to the Sturm—Liouville problems (10.1.7), (10.1.6) generated by the self-adjoint 
differential operator L[x,D] = —D (p(x) D) + q(x) on a finite interval |0, ¢| subject to traditional boundary conditions 
(10.1.6). This problem is usually referred to as a regular Sturm—Liouville problem when p(a) > 0 and p(z) > 0. 


Theorem 10.4: Any regular Sturm—Liouville problem has an infinite sequence of real eigenvalues Ap < A1 < 
A2 <-++ with limp, An = 00. The eigenfunctions ¢,(#) are uniquely determined up to a constant factor. 


Next statement assures us that any square integrable function can be expanded into Fourier series over eigen- 
functions generated by a regular Sturm—Liouville problem and this series converges in mean square sense. 


Theorem 10.5: The set of eigenfunctions of any regular Sturm—Liouville problem is complete in the space of 
square integrable continuous functions, on the interval 0 < x < £ relative to the wight function p(z). 


1. This problem consists of two parts. In the first one, you are asked to show that pointwise convergence does not apply 
to a mean square convergence. In the second part, you are asked to show that a sequence may converge pointwise, but 
diverge in the mean square sense. 


(a) Let fn(z) = 2” be a sequence of functions on the unit interval [0,1], which converges pointwise to f(x) = 0 on 
the semi-open interval [0,1) and to 1 at « = 1. Show that the sequence f,,(#) converges in the mean square sense 
to 0 on (0, 1]. 

(b) Consider a sequence of functions s,(x) = ne~”* on the unit interval 0 << 1. Show that sn(#) > 0 as n > oo for 
each x in (0, 1]. On the other hand, show that 


1 
mean error = | s>(a) dx ae e 7”) + OO. 
0 


2. Find the square norm for each of the following functions on the interval —2 < x < 2: 
(a) 4; (b) cos - (c) cosh z; (d) 22; (e) x. 


3. Find the square norm for each function of the previous problem on the intervals 0 < 7 < 2 and -2<2<0. 
4. Find the norm and the root mean square value for each of the following functions on the interval indicated 
(a) cost; 0<t< 7/2; (b) 1/t, 2<t<6; (c) t*, O<t<1. 
5. Find the norm and the root mean square value of the product cos x sin on the interval 0 < « < m/2 and —7/2 <a < 
mw /2. 
6. What multiple of sin x is closest in the least square sense to sin® z on the interval (0,7)? 


22, if0<a<1, 
3-a, ifl<a4<3; 


7. Find the norm and the root mean square value of the function f(x) = over the interval 


0<2<3. 
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10.3 Fourier Series 


How can a string vibrate with a number of different frequencies at the same time? The answer for this question 
was given by Jean Baptiste Joseph Fourier®? (1768-1830), who made a claim in 1807 that a periodic wave can be 
decomposed into a (usually infinite) sum of sines and cosines. 


10.3.1 Music as Motivation 


People have enjoyed music since their appearance on the earth—it is one of the oldest pleasurable human activities. 
Reproducing music and transferring it from generation to generation and from one nation to another requires a 
special universal language. This language is not our objective; however, we try to explain music using mathematics. 
The partnership between mathematics and music traces back at least 2500 years when the ancient Greeks established 
a connection between the two. The Pythagoreans tried to explain the pleasing harmonics of some sounds and the 
distracting effects of others—you can observe this phenomena by running a long chalk bar over a black board. We 
use music as a motivating example for clarifying the topic in this section. 

The sounds we hear arise from vibrations in air pressure. Humans don’t hear all sounds. For example, we can’t 
hear the sound a dog whistle makes, but dogs can hear that sound. Marine animals can often hear sounds in much 
larger frequency range than humans. What sound vibrations we can hear depends on the frequency and intensity 
of the air oscillations and one’s individual hearing sensitivity. Frequency is the rate of repetition of a regular event. 
The number of cycles of a wave per second is expressed in units of hertz!°? (Hz). Intensity is the average amount 
of sound power (sound energy per unit time) transmitted through a unit area in a specified direction; therefore, 
the unit of intensity is watts per square meter. The sound intensity that scientists measure is not the same as 
loudness. Loudness describes how people perceive sound. Humans can hear sounds at frequencies from about 20 Hz 
to 20,000Hz, though we hear sounds best at around 3,000 to 4,000Hz, where human speech is centered. Scientists 
often specify sound intensity as a ratio, in decibels!®! (written as dB), which is defined as 10 times the logarithm of 
the ratio of the intensity of a sound wave to a reference intensity. 

When we detect sounds, or noise, our body is changing the energy in sound waves into nerve impulses which the 
brain then interprets. Such transmission occurs in human ears where sound waves cause the eardrum to vibrate. 
The vibrations pass through 3 connected bones in the middle ear, which causes fluid motion in the inner ear. Moving 
fluid bends thousands of delicate hair-like cells which convert the vibrations into nerve impulses that are carried to 
the brain by the auditory nerve; however, how the brain converts these electrical signals into what we “hear” is still 
unknown. 


Figure 10.4: The sound of a clarinet. 


Sounds consist of vibrations of the air caused by its compression and decompression. Air is a gas containing 
atoms or molecules that can move freely in an arbitrary direction. The average velocity of air molecules at room 
temperature, under normal conditions, is around 450-500 meters per second. The mean free paths of air molecules 
before they collide with other air molecules is about 6 x 107° meters. So air consists of a large number of molecules 
in close proximity that collide with each other on average 10!° times per second, which is perceived as air pressure. 


°° Joseph Fourier was born in Auxerre (France), the ninth child of a master tailor. The name “Fourier” is a variant on the word 
fourrier, which means in a military sense a quartermaster and in a figurative connotation a precursor or anticipator of ideas. More often 
than not during his life Fourier was to have his name spelled in that way. He took active role in French revolution, and more than once 
his life was in danger. In 1798, Fourier accompanied Napoleon on his expedition to Egypt, and was Prefect (Governor) of Grenoble twice. 

100The word ”hertz” is named for a German physicist Heinrich Rudolf Hertz (1857-1894), who was the first to conclusively prove the 
existence of electromagnetic waves. 

101The unit for intensity is the bel, named in honor of the eminent British scientist Alexander Graham Bell (1847-1922), the inventor 
of the telephone. 
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When an object vibrates, it causes waves of increased and decreased pressure in the air that travel at about 340 
meters per second. Figure 10.4 represents variation of air pressure on the vertical scale produced by a clarinet with 
time along the horizontal axis. The greater the variation in the vertical direction, the louder the sound. 

A musical tone (pl. tonoi) is a steady periodic sound, which is often used in conjunction with pitch. A simple 
tone, or pure tone, has a sinusoidal waveform. Such a tone is identified by its frequency and magnitude, while its 
corresponding pitch is a subjective psychoacoustical attribute of sound; however, tone is commonly used by musicians 
as a synonym for pitch. From a mathematical point of view, a tone is a sine (or cosine) function sin(27vt), where v 


is the frequency of the pitch. In theory, it is often called the mode!” (from the Latin modus, which means measure, 


standard, or size). 
Ch Dt Ft Gi At 
Re Mi Fa Sol La 


Do Si 


To represent a pitch class and duration of a musical sound, musicians use special symbols, called notes. Many 
years ago, it was discovered that doubling the frequency of a note results in a higher note that, in some sense, people 
perceive to be the same as the original. In musical terminology, the note with double the frequency of another note 
lies one octave higher and is called an overtone. Likewise, descending by an octave—halving the frequency—produces 
a lower note that is one octave below the original note. All such notes are called the pitch class. 

One can easily recognize notes of the same pitch class on a classical 88-key piano that has seven octaves accom- 
panied by a minor third. A 12-key pattern with seven white keys and five black keys repeats up and down the piano. 
The lowest note marked Ap corresponds to a frequency of 27.5Hz. The highest note is Cg with a frequency of about 
4186.01Hz. The interval between two successive piano keys is called a semitone. Therefore, an octave includes 12 
semitones. A sharp { raises a note by a semitone or half-step, and a flat » lowers it by the same amount. In modern 
tuning a half-step has a frequency ratio of \/2 = 2'/!?, approximately 1.059. The accidentals are written after the 
note name: so, for example, Ff represents F-sharp, which is the same as Gb, G-flat. In European terminology, they 
are called Fa-diesis or Sol-bemolle. 

In traditional music theory within the English-speaking world, pitch classes are typically represented by the 
first seven letters of the Latin alphabet (A, B, C, D, E, F, and G). Many countries in Europe and most in Latin 
America, however, use the naming convention Do-Re-Mi-Fa-Sol-La-Si-Do. The eighth note, or octave, is given the 
same name as the first, but has double its frequency. These names follow the original names reputedly given!®* by 
Guido d’Arezzo. 

“Concert pitch” is the universal pitch to which all instruments in a concert setting are tuned, so that they all 
produce the same frequency corresponding to the middle C or Do (although technically, the A or La above middle 
C is used as the compass, and has a frequency of 440Hz). Middle C’s frequency is 220 x 2'/4 = 261.626Hz. The 
middle C, which is designated C4, in scientific pitch notation, has key number n = 40. On a piano keyboard, the 
frequency of the n-th note is 2("~49)/12 x 440 Hz. 

Overtones are often referred to as harmonics and are higher in frequency. Therefore, they can be heard distinctly 
from other tones played at the same time. Tones of the lowest frequency of periodic waveforms, referred to as 
fundamental frequencies or tones, generate harmonics. Pythagoreans also discovered that there exist other pairs 
of notes (called chords) that sound pleasant to human ears. One of them, called the fifth, is a musical interval 
encompassing five staff positions, and the perfect fifth (often abbreviated P5) is a fifth spanning seven semitones, for 
instance, C and G (or D and A). If the first note in the fifth has frequency f, then the next note has the frequency 
approximately of 3f/2. This note together with the note of frequency 2f (one octave above) constitute the fourth; 


102 A Roman philosopher Anicius Boethius (480-524) used the term modus to translate the Greek tonos, or key, which is the source of 
our word “tone.” 

103 Guido of Arezzo (also Guido Aretinus, Guido da Arezzo, Guido Monaco, or Guido d’Arezzo) (991/992-after 1033) was a music 
theorist of the Medieval era. 
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they are also consonant. For example, G and the C located an octave up is a fourth. Its frequency ratio is 4/3 
because 3 : 4 = 2. Therefore, the octave is divided into a fifth comprised by the fourth. 

Not every chord results in as pleasant a harmony as an octave, which has a simple frequency ratio of 2, or a fifth 
(with a ratio about 3/2). Some note pairs make us cringe. As an example, let us look at a chord to find out why 


these notes sound good together: 


C  261.626Hz 
E  329.628Hz 
G 391.995Hz 


The ratio of E to C is about 5/4. This means that every 5th wave of E matches up with every 4th wave of C. 
The ratio of G to E is about 5/4 as well. The ratio of G to C is about 3/2. Since every note’s frequency matches up 
well with every other note’s frequencies (at regular intervals), they all sound good together! 


Vi Vy 


Figure 10.5: The chord C-E-G. Figure 10.6: The chord C-Ct. 


Now consider two chromatically adjacent notes C and Ci (of frequency 277.183Hz), which make up the smallest 
musical interval, known as a half-tone interval. Played alone, they produce an unpleasant, clashing sound (see 
Fig. 10.6). 

These examples show that musical sounds or notes can be interpreted as a combination of pure tones of sine 
wavefronts. In reality, any actual sound is not exactly periodic because periodic functions have no starting point 
and no end point. Nevertheless, within some time interval, almost every musical sound exhibits periodic movements. 
The next question to address is whether any sound can be decomposed into a linear combination of pure tones? The 
affirmative answer was given by Joseph Fourier, and is presented in §10.3.3. 


10.3.2 Sturm—Liouville Periodic Problem 


To understand sound waves and their wavefronts, we have to create mental pictures of them because they cannot 
be seen. As we saw previously, a sound can be modeled by a composition of sine and cosine functions. What 
makes these trigonometric functions so special? Could we decompose sound waves using other families of periodic 
functions? The answer lies in the differential equation for a simple harmonic motion 


when a particle of mass m is subject to an elastic force moving it toward its equilibrium position y = 0. Here 
y= d?y/ dt? is the acceleration of the particle and k is the constant of proportionality. 
Recall that a function defined for all real values of its argument is said to be periodic if there exists a positive 
number 7’ > 0 such that 
f(t+T) = f(t) for every value of t. (10.3.1) 


It is obvious that if, for some T, Eq. (10.3.1) is valid, then it is true for 2T, 37, and so on. The smallest positive 
value of T for which Eq. (10.3.1) holds is called the fundamental period or simply the period. 

In the case of the human ear, the harmonic motion equation my +k y = 0 may be taken as a close approximation 
to the equation of motion of a particular point on the basilar membrane, or anywhere else along the chain of 
transmission between the outside air and the cochlea, which separates out sound into various frequency components. 
Of course, harmonic motion is an approximation that does not take into account damping, nonlinearity of the 
restoring force, or the membrane equation, which is actually a partial differential equation. Nevertheless, since 
harmonic motion gives a reasonable approximation to actual phenomena, we can start with it. 


562 Chapter 10. Orthogonal Expansions 


Setting \ = k/m, we consider the Sturm—Liouville problem with the following periodic conditions: 
y+ Ay =0, y(t) = y(t + 20) = y(t + 2ne), 


where T = 2¢ is the period and n is any integer. Obviously, \ = 0 is an eigenvalue, to which corresponds the constant 
eigenfunction. When A < 0, the general solution of the equation y + Ay = 0 is a linear combination of exponential 
functions that are not periodic. Therefore, negative \ cannot be an eigenvalue. Assuming that \ > 0, we find the 
general solution of the equation y” + Ay = 0 to be 


y(t) = a cos (tv) +6 sin (tv) =c sin (« V+ ¢) ’ 


where a, b or c, y are arbitrary constants (c = Va? + 6? is called the peak amplitude and the y phase). This 
function is periodic with period 2¢ if VX = 2nn/(20), which leads to the sequence of eigenvalues Ay, = (n/0)” and 
corresponding eigenfunctions 


t 
Yn(t) = Gn COS (=) + by sin (=). na ee 


If n is a positive integer, w = (27) /(20) = 7/£ is a positive number, and a», by, are real constants, the eigenfunction 


Gn Ccos(nwt) + by sin(nwt) = ./a2 + b2 sin (nwt + Yn) 


represents an oscillation of frequency v = nw/(27). The period of each oscillation is the reciprocal of the frequency, 
v—| =27/(nw). Hence, in a time interval of length tr = 27/w, the function y(t) completes n oscillations. 


10.3.3. Fourier Series 


In 1822, the French mathematician/physicist /engineer/politician Joseph Fourier published a book (“Théorie analy- 
tique de la chaleur,” which is translated as “The analytical theory of heat”) in which he summarized his research, 
including an astonishing discovery that virtually every periodic function could be represented by an infinite series of 
elementary trigonometric functions: sines and cosines. Fourier’s claim was so remarkable and counter intuitive that 
most of the leading mathematicians of that time did not believe him. Now we enjoy applications of his discovery: 
Fourier analysis lies at the heart of signal processing, including audio, speech, images, videos, seismic data, radio 
transmissions, and so on. 


Figure 10.7: The graph of the sum of three eigenfunctions. 


It is obvious that the sum of two sinusoidal functions with the same frequency, ap, cos(nwa) + by sin(nwa) and 
A, cos(nwx) + B,, sin(nwa), is again an eigenfunction with the same frequency. On the other hand, the sum of two 
or more sinusoidal eigenfunctions with different frequencies is a periodic function, but it is not an eigenfunction. For 
example, the sum of three eigenfunctions 


1 1 
sinz + = sin2x — — cos3z 
2 6 
is a periodic function, which graph has little resemblance to a sinusoidal function (see Fig. 10.7). It is natural to 
ask an opposite question: is it possible to represent an arbitrary periodic function as a linear combination of a finite 
or infinite number of such oscillations? For a wide set of functions, an affirmative answer for this question will be 
given in the next section. Now we start with a finite sum of eigenfunctions: 


N 
Sn(a) = S- (ay, cos(nwax) + by sin(nwx)) , w= 


n=0 
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that defines a trigonometric polynomial (of order N) that repeats itself after each interval of length 7 = 27/w = 
2¢, that is, Sv(a +7) = Sy(x). Therefore, 7 is the fundamental period for S'y(a) independent of the number of 
terms, N. Sometimes, it is convenient to introduce a new variable t = wx = 72/@, so the function Sy (a) = Sy (té/7) 
becomes a finite sum of simple harmonics with the standard period 27 in variable t. 

Suppose that we are given a periodic function f(a) of period T = 2¢, which we approximate with a trigonometric 
polynomial Syj(a). As the number N of terms in the finite sum Sy(x) increases without bounds, the limit function, 
if it exists, defines the infinite series. But it may be neither differentiable nor continuous. The limit is called the 


Fourier series of f(x): 
f(z) ~ > F » ap COS (=) + by sin (=) . (10.3.2) 


The first coefficient ag/2 in this sum is written in this specific form for convenience that will be explained shortly. 
When we do not know whether the infinite sum (10.3.2) converges or diverges, or we are not sure whether it is equal 
to the given function at all points or not, we prefer to use the symbol “~” instead of “=.” This series (10.3.2) can 
be simplified further for a function F(t) = f (t@/7) with the fundamental period 27: 


ail: 


F(t) ~ > + (a, cos kt + b;, sin kt) . (10.3.3) 


Il 


1 


To find the values of coefficients in these series (10.3.2) and (10.3.3), we first assume that they converge uniformly 
to the given functions. Then the symbol “~” in Eqs. (10.3.2) and (10.3.3) should be replaced with an equal sign. 
Multiplying both sides of the relation (10.3.3) by sinmt for some integer m, and integrating the results over an 
interval (—7,7), we get 


/ F(t) sinmt dt = > if sin mt dt + > i (az, cos kt + b, sin kt) sin mt dt. 
ae k=1°"7 


TT 


Here we interchanged the order of integration and summation, which is justified for uniformly convergent series. 
All integrals in the right-hand side are zeroes because of orthogonality formulas (10.2.7)-(10.2.9) except k = m. 
Repeating this procedure for cos mt (involving multiplication by cos mt and integration), we finally obtain 
1 TT 
dh = — F(t) cos(kt)dt, k=0,1,2,..., 


TT 


== f F(t) sin(kt)dt, k=1,2,3,.... 


Tv 


(10.3.4) 


The coefficients ax, by (K = 1,2,...) are twice averages of F(t) times the trigonometric functions over the interval 
[—7, a]. Similarly, for a periodic function f(x) of period T = 2, we have 


9) T/2 ak 
a= | fe) 00s (7) ae, k= 0,1, 2,-2.5 


-T/2 


2 7 . Qhrx 
n= =f f(a) sin ( Tr ) ae, Ka 1, 2:35.52: 


(10.3.5) 


The formulas (10.3.4) and (10.3.5) were discovered by L. Euler in the second part of the eighteenth century, and 
independently by J. Fourier at the beginning of nineteenth century. Therefore, they are usually referred to as the 
Euler—Fourier formulas. The numbers ao, aj, ..., 61, b2, ... are the Fourier coefficients of f(x) on [—@, 4] with 
: ‘ . kra . (kre . 
respect to the set of trigonometric functions {eo (=) ,sin (=) \ . Now it is clear why we have chosen the 
k>0 

free coefficient (ao/2 is the average value of the corresponding function) in such a form—it simplifies and unifies the 
formulas (10.3.4) and (10.3.5). It is possible to show (see Problem 11) that for a periodic function f(a), the Fourier 
coefficients can be evaluated over any interval of length T = 2¢. 

Since the trigonometric functions in expansion (10.3.2) are periodic, the series (10.3.2) thereby defines a periodic 
function with period T = 2¢. If originally we are given a function f(x) on an interval of length T, it is convenient to 
expand f(a) from this interval by making it periodic with the period T = 2, that is, we set f(x +7) = f(x). With 
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this approach, we expect that the trigonometric series (10.3.2) represents extended periodically function for all real 
x. There is nothing special that initially a function f(a) is required to be defined in the symmetrical interval (—2, ¢) 
of length T = 2@. An arbitrary interval (a,b) of length T = b — a can be used instead. This will affect the limits of 
integrations in Eq. (10.3.5) but not the values of coefficients. The Fourier series (10.3.2) defines a periodic extension 
of the given function from the finite interval. The following examples clarify the topic. 


Example 10.3.1: Let us consider the function f(¢) = —t/2 on the interval (—7, 7). Expanding it periodically with 
the period 27 (so ¢ = 7), we obtain a sawtooth function (see Fig. 10.8). Other sawtooth functions are discussed in 
Exercise 2, page 568. From the Euler—Fourier formulas (10.3.4), it follows 


1 /" -t |? 
=i 2 4 t=r 
oe i kt dt = ! (tk sin kt + cos kt) = =0 
a 3 cos oe sin cos _ =0. 
Similarly, 
by = ~[ a3 kt dt = : (kt kt — sin kt) a 
——= 3 sin = >Re cos sin _ 
_ km coskt—sinkn — (—1)* 
— ker —  k O 
Therefore, 
bx l=) 
=> i sin kt, —7<t<7 


Figure 10.8: Example 10.3.1, graph of the function f(t) = —t/2 extended periodically from the interval (—7, 7). 


The series converges to 0 at the points of discontinuity (t = 2na+7, n =0,+1,+2,...). The graph of its partial 
sum, ae at sin kt with N = 45 terms, is presented in the figure on the front page 545 of Chapter 10. 


If we consider the same function f(t) = —t/2 on the interval [0,27], we will get the following Fourier series: 
t =a 
-5 =-3+> psinkt,  0<t<2n. 


k=1 


To estimate the quality of partial sum approximation (see Fig. 10.9), we calculate the mean square errors with 


N =15 and N = 50 terms: 
OT ea. : 


which gives Ays % 0.202613 and Aso & 0.0622077. 


Example 10.3.2: Now we find the Fourier series for the piecewise continuous function on the interval | — 2, 2]: 


@< —2, —2<2<0, 
a aa 1, O0<a4<2. 
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Figure 10.9: Example 10.3.1. (a) Fourier approximation of the periodic extension of the function f(t) = —t/2 defined 
initially on the interval (0,27) with N = 15 terms and (b) the same approximation with N = 50 terms. 


Using the Euler—Fourier formulas (10.3.5) with @ = 2, we obtain 


a i ca 
ag=- av—> | xdx = 2, 
2 Jo 2 J_»9 


ie kra if kra 2 
an} fo) se- $f am (Jae atotn—n, 


i kra a i krax 1 
be = = i -= i — ; 
5 | sin (= 5 ) da 5 I, xsin (= 5 ) dx = = (cos(k7) + 1) 


Recall that cos(kr) = (—1)*, for integer values of k, so cos(k7) — 1 = (—1)* —1 = —2 for odd k, and 0 for all 
other values of k. Similarly, cos(km) + 1 = (—1)* +1 = 2 for even k, and 0 otherwise. Thus, we can simplify the 


expressions for the Fourier coefficients: 


0, if k is even, — if k = 2n is even, 
Qk — by — 
k272? 


> 


if k = 2n — 1 is odd; 0, if k is odd. 
Substituting these coefficients into the Fourier series and separating even and odd indices, we get 
(2n — 1)ra 
g(x) )=14 = sin( (nx) “Lo yy os (Sete ‘ 
n21 
These sums can be evaluated explicitly: 
a —2<2<0, 
14+452, 0<2<2; 


2n—1)r2\ —it, —2<2<0, 

(St OK eK<2. 
This allows us to represent g(a) as g(x) = gi(a) + go(x). The partial sums 1 + y 1 ae Sin(nr2) converge to gi (x) 
at every point where the function is continuous (except points 2k7, k = 0,+1, +2, . rr The series for g2(x) defines 
a continuous function on the whole line. The partial sums can be plotted using the following Maple commands: 


N:=15: £N:= x 1+sum((n*Pi)(-*sin(n*Pi*x) ,n=1..N) - 
4*Pi(—?) *sum((2*n-1)(-2)kcos ((2*n-1) *Pi*x/2) ,n=1..N) 


plot (£N(x) ,x=—2..2, color=blue) 
We choose N = 15, the number of terms in the partial sums, for simplicity. The graph of the corresponding 


truncated Fourier series for the function g(x) is presented in Fig. 10.10(a). 
Example 10.3.3: Let us consider a continuous function on the interval (—1,3): 


x, -l<e<l, 
h(x) = 1, L<g¢< 2, 
—“2+3 2<2<3. 
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Figure 10.10: Example 10.3.2. (a) Fourier approximation of the periodic extension of the function f(a) and (b) the 
corresponding Cesaro approximation, plotted with Maple. 


Figure 10.11: Example 10.3.3, graph of the function h(z). 


If we extend it periodically h(x) = h(a + 4), then its extension becomes a discontinuous function on the whole line 
(—oo, 00), see Fig. 10.11. Since the given interval has length T = 4 = 2¢, we use the Euler—Fourier formulas (10.3.5) 
with ¢ = 2: 


a i a a i 1.1.3 
== x] 4 = = = scien eens 
ao yf eats f de+5 f x) da Ot+st7 ri 


i I ge a i 
an=5 | 1 cos “ae + 5 | ee CR 
2 f 6 , ) 


1 2 2 2 
= ep (2 cosnin — nr sin St 2 cos 2 ) 
n2 7? 2 oe a 
1 2 3 
ina gf asin ar 5 | 1 sin" ae +5 f (-2) sin ae 
= ey (4 sin SE — 2 sin 3 — nn cos SE) 
n212 2 2 2 


Substituting these values into Eq. (10.3.2), we obtain the Fourier series for the given function: 


3 1 - _ nT 3nTt nT 
h(a) = tyes (2-1 — NT aoa —2 cos ) cos (S2) 
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Figure 10.12: Example 10.3.3; Fourier approximation with N = 20 terms. 


Figure 10.13: Problem 2(a), graph of the sawtooth function. 


Figure 10.14: Problem 2(b), graph of the sawtooth function. 


Figure 10.15: Problem 3, graph of the function f(z). 


Figure 10.16: Problem 3, graph of the function g(x), plotted with pstricks. 
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Figure 10.17: Problem 5, graph of the square wave, plotted with pstricks. 


NANI NE ™.. 


Figure 10.18: Problem 6, graph of | sin z|, plotted with pstricks. 


ae and Euler’s formula (10.2.6), show the following relations: 


1. Using the geometric series sum )7/_, i 


a cos 9 — cos(2n +1) _ sin(n+1)@ sinné 
2k) = ————— = a 10.3. 
» sey) 2sin 0 sin 0 ; Mose) 
k=1 
é _ sin(2n+1)@—sin@ — cos(n+1)6 sinné 
k=1 
ey sin(2k0) = — cos mn ens YO) sin =_ (10.3.8) 
> (-1)* cos(2k0) = —; cos (a+ Orr?) sin ae (10.3.9) 


k=1 
2. Prove the following Fourier series for sawtooth functions: 


(a) Euler (1755): 


k=1 
(b) 
ely? 2 
1+t=1-2)> i sin kt, —1T<t<7; 
k=1 
(c) Euler (1755): 
2a (—1)* kra 
2=— ke sin | —— ], -—a<r<a; 
i a 
(d) 
2Qa+ta a a y (—1)* Qhra 
=--— sin : -a/2<a<a/2. 
4 4 2 SS k a 


3. Consider two continuous ramp functions 


ee) al en ee 


2+2, —2<a2<0, 
f(x) = 
2-2, O0<2<2; 


aes —2<2<0, 


By extending these functions periodically: f(a) = f(a +4) and g(x) = g(x + 4), find their Fourier series. 
4. Compute the Fourier series for another ramp function: f(x) = |z|, -a <a <a. 


5. By differentiating the function in the previous problem, find the Fourier series for the square wave function 


(x) 1, -l<2 <0, 
Wr) = 
-1, O<a<l. 


In particular, choose ¢ = 2. 
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6. Find the Fourier series for the periodic function f(t) = |sint| (the absolute value of sin t). 


7. Find the Fourier series on the interval [—7, 7] of each of the following functions 
(a) sin? t — cos* ¢; (b) cos*t — sin‘ t; (c) sin 2t — cos” t. 
8. Find the Fourier series of the following intermittent function: 


i. Haigecea 
wey= {i if O<a<8. 


9. Find the Fourier series for a periodic function of period 6, which is zero in the interval —3 < x < 0, and in the interval 
0<2 <3 is equal to 
(a) 2; (b) 2”; (c) sina; (d)e” (0<a< 83). 


10. Find the Fourier series which represent the function of period 27, defined in the interval —a7 < t < 7 as equal to 
(a) e’; (b) t sint; (c) t cost. 


11. Suppose that f(a) is an integrable periodic function with period T. Show that for any 0 <a<T, 


i f(a) da = f(a) da. 


12. A function is periodic, of period 8. Find the Fourier series which represent it, if it is defined in the interval 0 < x < 8 
by 

5, if 0<a< 2; 

2, if 2<a<A4; 

—5, if d<2<6; 

-2, if 6<a<8. 


f(x) = 


13. Expand the function into Fourier series assuming that it is periodic with period 4: 


1, if -—2<2<0, (b) 0, if -—2<2<0, G —-a, if -2<2<0, 
c 
a, if 0<a2<2; a, if O0O<a@< 2; De, if 0<4<2. 


14. Expand the function into Fourier series assuming that it is periodic with period 6: 


0, if —-3<a<-1, 
f(w#)=43, if -l<2<1, 
0, if l<a4<3. 


15. Expand the function into Fourier series assuming that it is periodic with period 4: 


2+ 3, if -—2<2<0, 
f(z) = 
3-24, if 0<4<2. 


16. Expand the function f(x) = x°/3 into Fourier series assuming that it is periodic with period 4: f(a +4) = f(z). 


In Problems 17 through 20, compute the Fourier series for the given function on the interval [—2, 2]. 


3 —2 
—x2°, —2<2<0, e”, —2<2<0, 
ne= {5 O0<a2 <2; A i O0<ar<2; 
z+2, —-2<2<0, —a“?, -2<a2<0, 
18. = 20. = 
F(x) fe 0<2< 2; F() ei O0O<a<2. 


a 20, 
21. Find the Fourier series of the 3¢-periodic function f(x) = ee peeannes 


2. 
3-2, W<ar< 38. 


22. Find the Fourier series for the function 


In particular, choose ¢ = 2. 
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10.4 Convergence of Fourier Series 


The classical subject of Fourier series is about approximating periodic functions by sines and cosines. This topic 
became a part of harmonic analysis, which includes representations of functions as infinite series of eigenfunctions 
corresponding to a Sturm—Liouville problem. The sines and cosines are the “basic” periodic functions in terms of 
which all other functions are expressed. In chemical terminology, the sines and cosines are the atoms; the other 
functions are the molecules. However, this language is not strictly speaking appropriate because it does not reflect 
the existence of other atoms, other eigenfunctions, that can serve as the “basic” functions. 

The process of understanding is always facilitated if more complicated structures are known to be synthesized 
from simpler ones. In mathematics, we don’t usually get a full decomposition into the simpler things, but an 
approximation. For example, real numbers are approximated by rationals. Another example gives the Taylor series 
expansion of a function, which provides an approximation by a polynomial. Of course, for a function to have a 
Taylor series, it must (among other things) be infinitely differentiable in some interval, and this is a very restrictive 
condition. Sines and cosines serve as much more versatile “prime elements” than powers of x because they can be 
used to approximate nondifferentiable functions. 

In any realistic situation, of course, we can only compute expansions involving a finite number of terms anyway. 
We need some sort of assurance that the partial Fourier sums 


N 
ao NTx _ nner 
Sn (x) = ps + ) (an cos —— +b, sin a) (10.4.1) 
n=1 


converge to the given function as N — oo in some sense. Using completeness of the set of trigonometric functions 


NTL . NTL . : : : 
cos ——, sin —— in the space of square integrable functions, we obtain our first result. 
n>0 


Theorem 10.6: Let f(x) be a square integrable function on the interval |—Z, |, that is, || f ||? = iis) [f (a)? da < 
oo. Then the Fourier series (10.3.2), page 563, converges to f(x) in the mean square sense. 


However, the mean square convergence tells us nothing about pointwise convergence. Its study is both deep 
and subtle. Since the Fourier coefficients (10.3.5) are obtained by averaging the given function with trigonometric 
functions, we cannot recover the function from its Fourier coefficients without some further smoothness conditions. 
The most obvious problem is that if two functions differ at a discrete number of points, then their Fourier coefficients 
will be identical. 


Definition 10.8: An absolutely integrable function f(z) is said to be piecewise monotone or satisfy Dirich- 
let’s conditions on the finite interval [a, }] if: 
1. f(x) has at most finitely many points of discontinuity in the close interval [a, b], which can be broken into 
finitely many subintervals such that f(x) is either nondecreasing or nonincreasing on each subinterval; 


. the right limits of the function f(a) exist at every point zo from [a,b) (which we denote as f(a + 0) or 
simply f(z¢)) : f(zo + 0) =limgsa,40 f(x) if a < ro <b; 


. the left limits exist at every point xo from the interval (a,6] : f(wo — 0) = f(vo) = limg-42,-0 f(z) if 
a<2 <b. 


Note that a piecewise monotone function is bounded everywhere on the interval because it has left and right 
limits at every point: the function is either continuous or has finite jumps (in discrete number of points within the 
closed interval). So tan(x) is not a piecewise continuous function because it has infinite jumps at points ka + 7/2, 
k = 0,41,+2,.... Also the function x sin(1/a) does not satisfy Dirichlet’s conditions because it has infinite many 
local maximum and minimum points around x = 0. 

Other examples of piecewise monotone functions are given in the previous section. For instance, every sawtooth 
function (see Example 10.3.1 and Problem 2) is a piecewise monotone function but not continuous. Their derivatives 
are identically 1 except a discrete number of points where they are undefined. 

Sufficient conditions for the pointwise convergence of the Fourier series were discovered by P. Dirichlet!°4 in 1829. 
He proved that the Fourier series of a piecewise smooth integrable function converges at each point x to its average 
of the limiting values. 


104Peter Gustav Lejeune Dirichlet (1805-1859) was a German mathematician of French origin, a student of Poisson and Fourier. 
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Theorem 10.7: [Dirichlet] Suppose that f(x) is a piecewise monotone (satisfies Dirichlet’s conditions) periodic 
real-valued function with the period 2¢. Then the function f is represented by a convergent Fourier series 


= nTx 
a : by sin), 10.4.2 
+ 2th (a cos ——— + by sin —— ? ( ) 
whose coefficients are given by 

fe nit 

i f(t) cos —— dt, i. = O, 1, Bo oo08 (10.4.3) 
L J_¢ £ 
ees NTE 
-£ 


The Fourier series (10.4.2) converges to f(z) at all points where f is continuous and to 


5 ile +0) + f(x—0)| = lim 5 [f(a +e) + f(a —€)| (10.4.5) 


at all points where f is discontinuous. This convergence is uniform on any closed interval that does not contain 
a discontinuity of f. 


We are going to outline the main idea of the Dirichlet theorem, while the complete proof can be found 
elsewhere, for example, in [27]. 

Equation (10.4.2) means that at every point x in the interval [—2, ¢], the function f(a) is the limit of the partial 
sums 


f(z) = lim Sy(z), (10.4.6) 

N-0co 
where the trigonometric polynomial Sy(x) is given in Eq. (10.4.1). We have used t for the dummy variable of 
integration in Eqs. (10.4.3), (10.4.4) to avoid confusion with the fixed x for which we are considering the sum of 


the series equals f(a). Since x is a constant for the variable of integration, we substitute the expressions (10.4.3), 
(10.4.4) of the Fourier coefficients into the ae term of the sum (10.4.2): 


NTxL TX nit . nme , nit 
An COS —— + by au. =; on cos TT + sin —— sin —— } dt 


L L 
nm -_ 
— a) = bs 
veo 7 d 


Then the partial sum (10.4.1) can be expressed as 


9=7f 


We now introduce a new variable of integration t = x + u and get 
N 
1 fe }4 NTU 
S =- >= + cos —— 
N(2) => I, 2 2d 2 


To find a simpler expression for the finite sum sy(u) = st S cos — we multiply both sides by 2 sin $4. This 


n=1 


ie 
feta al ential ae, 


yields 
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Since the sum is telescopic, all the terms except the one before the last appear twice, with opposite signs, and so 
cancel out. This leads to 


ur ; 1\ um sin [ sin [(N +35) F] un 
2sin 7 sN(u uw) =sin | (+5) | = Sn(a <5 on f(wt+u)du. 


The next step involves representation of the partial sum S,j(x) in a symmetric form: 


naz f = ee zac een eC eee (10.4.7) 


and finding the limit as N — oo. To achieve this, the interval = ae. (0, 2] can be broken into two subintervals, 
one of length 5 > 0, [0,4], and the rest [6, ¢], where 0 < sin2$ < sin 4. Since the limit of the integral over the 
interval [6, 4] is zero, we get 


Tm 


Sul a=2f sin oe | [f(a +u)+ f(x —u)] du + o(1), 


where o(1) approaches zero as N - oo. Using the monotonic property of the function f(a) on [0,6], we apply the 
mean value theorem to obtain the desired formula (10.4.5). wi 

Note that the Fourier series (10.4.2) converges to the function that provides a periodic expansion with period 
T = 20 of the given function f(x) defined initially on the interval of length T. How many terms are needed to achieve 
a good approximation depends on the order of convergence of the general term to zero. The faster the general term 
decreases, the more accurate approximation partial sums provide. If a periodic function f(a) is continuous, with a 
piecewise continuous first derivative, integration by parts and periodicity yield 


1 e 
n bn <— ‘ : 
Janls [onl <== fh UP G@lde 


If f(z) has finite jumps, then its Fourier coefficients will decrease as 1/n as n —> co. When f(x) has more continuous 
derivatives, we may iterate this procedure and obtain 


an 1 
lanl, [bal < Sf P(e) dz, k= 1,2)... (10.4.8) 


The estimate (10.4.8) can be used in a comparison test to show that the Fourier series is absolutely dominated by 
a p-series of the form Dest n—?, which converges for p > 1. A condition that f’ is a piecewise continuous function 
is sufficient: it is known that for any discrete number of points, there exists a continuous function for which its 
Fourier series diverges at this set of points. In 1903, Lebesgue!”° proved that the Fourier coefficients a, and bp, of a 
Lebesgue- or Riemann-integrable function approach 0 as n — oo. Andrew Kolmogorov gave an example in 1929 of 
an integrable function whose Fourier series diverges at every point. For uniform convergence of the Fourier series, 
we need to impose an additional condition on the function. 


Theorem 10.8: Let f(x) be a continuous real-valued function on (—oo, co) and periodic with a period of 2¢. 
If f is piecewise monotonic on [—, ¢], then the Fourier series (10.4.2) converges uniformly to f(x) on [—@, ¢] and 
hence on any closed interval. That is, for every « > 0, there exists an integer No = No(e) that depends on ¢ 
such that 


[Su (x) — f(a 


> + 5 (an Gog) by, sin mt) — f(a) 


for all N > No, and all x € (—oo, 00). 


Theorem 10.9: Ifa function f(x) satisfies the Dirichlet conditions on the interval |—@, ¢], then its Fourier series 
converges to f(x) at every point x where f(a) has a derivative. 


105Henri Léon Lebesgue (1875-1941) was a French mathematician most famous for his theory of integration. 
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In many practical questions, we deal with piecewise continuous functions that are not differentiable. For example, 
music noise, which resembles Brownian motion, is an example of everywhere continuous functions that are not 
differentiable. Sawtooth functions and square waves are typical examples of functions that occur in music synthesis. 
In 1899, Lipdt Fejér!°° proved that any continuous function on a closed interval is the Cesaro!®” limit of its Fourier 
partial sums (10.4.1) on that interval: 


f(z) = lim Cy(ax), where Cy(az) = So(x) + Si(x) +--+ Sn(x) 


10.4.9 


Cesaro’s approximation is highly recommended because it speeds convergence and reduces problems caused by the 
Gibbs phenomenon (see §10.4.2). There is known another method that accelerates convergence and eliminates most 
of unwanted Gibbs oscillations (see §10.4.2) in partial sums. The method of o-factors represents a function as 
f(x) = limy-+o0 $n (x), where 


£/N 


N- 
Oe NTL , n\_N 
sn(t) => >> (an cos = ly, sin —— ? ) sine (+) ay ae Sn (a +t) dt, (10.4.10) 


where Sy(x) is the Fourier N-th partial sum (10.4.2) and the multiple sinc(n/N) = 4 sin (4¢) is called the 
Lanczos!°° o-factor. However, the question of convergence of the Fourier series is not the same as the question of 
whether the function f(x) can be reconstructed from its Fourier coefficients (10.4.3) and (10.4.4). This question 
is of great practical interest when a sound must be recovered from its Fourier coefficients transmitted through the 


Internet or by cellular phones. 


Example 10.4.1: (Example 10.3.1 revisited) The function f(x) = —a/2, expanded periodically from the 
interval (—7,7), is piecewise smooth but not continuous (see its graph in Fig. 10.8 on page 564). Therefore, the 
Dirichlet theorem is applicable for this function, but Theorem 10.8 is not. Indeed, the Fourier series 


n 7 _4)\k _4\k 
Sn(2) => 4 1 sinka  —+ s(a) => 1 sin ka 


k=1 k>1 


converges at every point, but it does not converge uniformly. According to the Weierstrass M-test, a series }~ koi tk (x) 

converges uniformly if the general term can be estimated by a constant |f,(x)| < Mn for all x in the domain and the 

(-1)* 
R 


series )> z>1 Mn converges. By estimating the general term sin ka < ts we see that the M-test does not work 


because it leads to the harmonic series }7),, k~', which diverges. However, the series S(x) converges uniformly 
(but very slowly) in every finite closed interval not containing a discrete set of points nm (n = 0,+1,+2,...) of 
discontinuity. The infinite sum S(a) defines a periodic extension of the function f(a”) = —a/2 outside the interval 
(—7,7), where S(a) = —a/2. 

Now we calculate the Cesaro partial sum 


and o-factor partial sum 


N tlh N (—1)* ; N N (1) a/N 
sn(@) = ee ; sink(a + t)dt = = », ; i sin k(a + t) dt 
N 
(-1)* N . (kr \., 
=5- i Gq sn W sin kx. 


106 ipdét Fejér (1880-1959) was a Hungarian mathematician who was born as Leopold Weiss in Jewish family (Weiss in German means 
“white” while the Hungarian for white is “feher”). As the chair of mathematics at the University of Budapest, Fejér led a highly successful 
Hungarian school of analysis being the thesis adviser of great mathematicians such as John von Neumann, Paul Erdés, George Pélya, 
Marcel Riesz, Gabor Szegé, and Pal Turan. 

107 Rrnesto Cesaro (1859-1906) was an Italian mathematician. 

108 Cornelius Lanczos (1893-1974) was a Hungarian mathematician and physicist. 
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By plotting two finite sums, the Cesaro Cio(x) with 10 terms and Fourier partial sum $45(a), we see that Cjo(2) 
gives a smoother approximation. The graphs of these partial sums are presented on the opening page 545 of the 
chapter. Their mean square errors are 


T 2 

Ke = 7 s+ Sw(a) dx => As ~~ 0.569643, Ags ~ 0.0690432; 
T 2 

Acn = i 5 + Cu(2)| dx => As ~ 0.80802, Ajo = 0.127738. 
T 2 

Ane ] s+ sv(2)| dx => As 0.955757, Aro ~ 0.476326. 


ili 
4, O0<aKd, 
woy= {1 = ben < 30. 

Qo” «626? i _—~ , 


0 30 
Calculating the Fourier coefficients according to Eqs. (10.4.3), (10.4.4), we obtain 


3e 
~~ a por a (5 - wi) a=. 


1 9 2nt Q2nrx 3 2nt 2Qnrx 
eae ge 2 1 : - 3 femauaio ° 
5 = S- crear) (cos 3 ) cos 30 + S- a sin 3 sin 30 


n2>1 


converges uniformly to f(a) on the interval [0,3¢] because its general term decreases as O (n~*). The sum defines 
a continuous extension of f(x) on the whole line. 


Example 10.4.3: Consider a continuously differentiable function on the interval (—7/4,7/4) that does not satisfy 
the Dirichlet conditions: f(x) = 2° sin(1/a) because it has infinitely many local maxima and minima in a neigh- 
borhood of the origin. The graph of f(a) along with its continuous derivative f’(a2) = 3x? sin(1/ax) — x cos(1/z) is 
presented in Fig. 10.19. Looking at Fig. 10.20, we are not convinced that its partial sum Fourier approximation 


N 
> + », a, cos(4ka), 


fn(x) = 


where 
4 m/4 
ak = =| ax sin(1/x) cos(4ka) da, k=0,1,2,..., 


WT Jon/4 
converges pointwise (the first 10 values of coefficients a, are given in Fig. 10.20). Nevertheless, the Fourier series for 
the function f(z) converges pointwise despite it does not satisfy (sufficient) conditions of the Dirichlet theorem. Also, 
the mean square error of its approximation tends to zero because f(a) is a square integrable function (Theorem 10.6, 
page 570). For instance, with N = 30 terms, the error is about Ago © 5.7 x 1077. 
10.4.1 Complex Fourier Series 


Sometimes it is convenient to express the Fourier series in complex form using the complex exponentials 


F k k 
elkrz/l — cog (=) +j sin (=) ; k=0, 


— 


(10.4.11) 
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———_ x43 xSin[1/x] 


———_.-. 3x2 Sin[1/x] — x Cos [1/x] 


Figure 10.19: Example 10.4.3, graphs of the function f(2) = 2° sin(1/x) along with its derivative, plotted with 
Mathematica. 
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Figure 10.20: Example 10.4.3, the graph of the difference between the function f(x) = x° sin(1/x) and its Fourier 
partial sum approximation with N = 30 terms. 


Here j is the unit vector in the positive vertical direction on the complex plane, so that j2 = —1. From the formula 
(10.4.11), called the Euler formula, it follows that 


fas (=) -_ ; (ere? dhe uy = Reibrt/l cin (=) _ = (cltrave _ ean) — & eikra/t. 
J 


The general term in the Fourier series (10.3.2), page 563, which we denote by 


def k . (k 
tr = ax COS (=) + by sin (=) 


_ ak (ern 4 as 4 Oe (are: - esl) 
2 2j 


can be rewritten in the following form: 


_ ak — jb 


‘haa ap + jb 
fr = ~S o (4 Roe 


—jkra/e _ QE eikra/t or en ikne/t 


where +b 
a and A_K wg ATE oo, fork > 1. 
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If we write the constant term as fo = ao = ao/2, we obtain an elegant expansion, 


oe >) oer, (10.4.12) 


k=—0o 


called the complex Fourier series. The coefficients in the complex series (10.4.12) are expressed through a compact 
formula: 


dof" 
a=s / f(x) eShre/e da, k=0,41,+2,... (10.4.13) 
= 


— ap—jbe 1 ° kr _ dual kra 
Qk = 5 = = fF 162) [cos ( =) j sin ( ? ) ax 


a jinn fe 
=— IRAE 
oT, I, f(ax)e dx. 


because 


If f(a) is a real-valued function, then the coefficients of the Fourier series (10.3.2) are expressed through the coeffi- 
cients of the complex Fourier series (10.4.12) as 


Gry = An + aK = 2Raz and be = j(ax — We) = —2 Sag, 
where Rez = Rz = Re(a+jb) = ais the real part of the complex number z = a+jb and Im z = Sz = Im(a+jb) =b 


is its imaginary part. Since the expression w = e)™*/ is a complex number of unit length, the complex Fourier series 
(10.4.12) can be considered as a Laurent series!?? on the unit circle: 


co . k foe) ; 
f(a)~ So ax Cae = wv, wee", (10.4.14) 
k=—co k=—0o 


The Dirichlet theorem (page 571) shows that the sequence of trigonometric functions j cos “, sin is com- 
g g e f Sn>0 


plete in the space of piecewise smooth functions, and for such a function we have Parseval’s identity!!° 


L co 2 [oe] lee) 
WF? = [ \t@Pac =u S- lol? =z (B+ yok + doa] (10.4.15) 
_ n=1 


k=—0o n=1 
where Fourier coefficients a, and ax, by are defined by equations (10.4.13) and (10.3.5) on page 563, respectively. 


Example 10.4.4: The function f(#) = 1+ a becomes a sawtooth function when extended periodically from the 
interval (—1,1). Its complex Fourier coefficients are 


1 jsin(kt) | j_; 
oe 1 —ikme gy — 2 PN Seika k=+1,+2,.... 
Ok a +a)e x ia, oe 4, 


Since sin(k7) = 0 for any integer k, the coefficients are simplified to 
J e-ikm = (coska —j sink) = = fay"; 
For k = 0, we have 


1 
ao 1 
as Ss ix +a) dx 


Hence, the complex Fourier series for the function f(#) = 1+ on the interval (—1, 1) becomes (compare with the 
series in Problem 2(b), page 568, from §10.3) 


foe) . -1 a 
tt+e=14+ 02-1 4 ~ (-1)' 


k=1 k=—oo 


= j k [ jkra —jkra = 1 Hy 
= oe (<1F [eo — ek ]=1- a= (—1)* sin(kra). 


109Pierre Alphonse Laurent (1813-1854) was a French mathematician best known as the discoverer of the Laurent series. 
1M0Tt is named after French mathematician Marc-Antoine Parseval (1755-1836), who discovered it in 1799. 


10.4. Convergence of Fourier Series 577 


Since the given function f(x) = 1+ 2 is odd, its complex Fourier series becomes a sine Fourier series (see §10.5). 
One can plot partial sums, using, for instance, Maple: 

N:=10: f£N:=x-> 1 — 2*sum((—1)**(k*Pi)(-Dxsin(k*Pi*x) ,k=1..N); 

plot (£N(x) ,x=—-1..1); 


Example 10.4.5: (Example 10.3.2 revisited) Using formulas (10.4.13), we obtain complex Fourier coefficients 
of the function from Example 10.3.2: 


a ia a oa 
au=5/ (-ayerrde +f ese]? dee 


4 2 0 
et i ee ee 
k2 7? ken = Oke | Qkr 
= a ns Ns potido.: 
2k 72 =f) k2n2 Dkr om | he) ) ( ee 
because ; 
ek — cos(kn) +4 sin(km) = (—1)*, k= 0,41,42,.... 


Note that ap = 1. 


Example 10.4.6: (Pulse streams) One of the practical applications in analogue synthesis where Fourier series 
play an important role is a stream of square pulses, which is defined by the function 


1, when0<t<p/2, 
f(t} =40, when p/2<t<T—p/2, 
1, when T-p/2<t<T. 


Here p is some number between 0 and T, and f(t +7) = f(t). The Fourier coefficients in complex series (10.4.12) 


—8 0 T-8 ToT+8 


ws 


Figure 10.21: Example 10.4.6: Pulse stream. 


become real numbers, given by 


Dp i Pe ~2kaj/T 1 . (krp 
=. _=— mM dt = — — See ose? othe 
ao ad Qk fe ka sin ; p22, 


By increasing T while keeping p constant, the shape of the spectrum stays the same, but it is vertically scaled down 
in proportion, so as to keep the energy density along the horizontal axis constant. 


10.4.2 The Gibbs Phenomenon 


Many functions encountered in electrical engineering and in the theory of synthesized sound are intermittent but 
not continuous. These functions include waveforms such as the square wave or sawtooth function. Other examples 
are given in §10.3.3. A piecewise continuous function cannot be represented by a Taylor series globally because it 
is comprised by distinct smooth functions. Each of these branches may have a Taylor series expansion, but they 
cannot be united into a single series. Therefore, the sequence of polynomials 1 hes is not an appropriate set for 
expansions in the space of not smooth functions. 

On the other hand, we saw previously that many intermittent functions can be expanded into a single Fourier 
series. Since basic trigonometric functions (sine and cosine) are infinitely differentiable, one may expect a problem 
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representing a discontinuous function by a Fourier series. The American mathematician Josiah Willard Gibbs (1839- 
1903) observed in 1898 that near points of discontinuity, the N-th partial sums (10.4.1) of the Fourier series may 
overshoot/undershoot the jump by approximately 9%, regardless of the number of terms, N. This observation is 
referred to as the Gibbs phenomenon, which was first noticed and analyzed by the English mathematician Henry 
Wilbraham (1825-1883) in 1848. The term “Gibbs phenomenon” was introduced by the American mathematician 
Maxime Bocher in 1906. The history of its discovery can be found in [21]. 

To understand the Gibbs phenomenon, we consider an example, say, the sawtooth function $(0) = 6/2 on the 
interval (—7,7). It has the following Fourier expansion (see Example 10.3.1, page 564): 


0 (—1)*+1 : 
(0) =5= S- —— sin(k6), (8 + 27) = (8). (10.4.16) 


k>1 


Figure 10.22: The graph of the periodically extended function g from the interval (—7, 7). 


This function satisfies conditions of Dirichlet’s theorem 10.7, page 571; therefore, its Fourier series converges to 
~(8) at every point 0 € (—7,7). Let 


(=) 
Sr(@) = ye —— sin kO 
k=1 
be the partial sum of the corresponding Fourier series. Although this function converges pointwise to 4(@) at every 
point, the convergence is not uniform. This means that for a given ¢ > 0, there exists N such that 


|(8) — Sn(9)| <e 


for all n > N. If this number N can be chosen independently of the point 6, we get a uniform convergence, which 
leads to a continuous limiting function. In our case, the convergence is not uniform and our limiting function is 
discontinuous. At the points of discontinuity 0 = (2n+ 1)7, n =0,+1,+2,..., the series (10.4.16) converges to zero 
because all of the terms are zero. The peak of the overshoot /undershoot gets closer and closer to the discontinuity 
while convergence holds for every particular value of 0. 

To demonstrate such an overshoot /undershoot, we differentiate S,,(0) to find its local maxima and minima. We 
consider only the interval —7 < 6 < m because S,,(27 — 0) = —S,,(@). Using the geometric series sum and Euler’s 
formula (10.4.11), we have (according to Eq. (10.3.9), page 568) 


1 0 0 
cog WT + 8) nm + nd 


ik (0+ 2 2 
(—1)¥ coskO =RY  edh(@+m) — ae 
nae! » cos(0/2) 


I 
Me 


S;,(9) 


Since the function in the denominator, cos (0/2), is positive throughout the interval —7 < 0 < 7, we need to consider 


27+1 il 0 
only functions in the numerator. Hence, the zeroes of S},(@) occur at 0 = Gite — a where cos eins) =0 
n 
27 0 
and at 0 = eam a where sin a =0, jy =0+1,+2,.... To apply the second derivative test, we evaluate 


nm 
S!'(0) at these points: 


Sn a ~ r) = (-1) : - ~ ese (S20) sin a zi mm) 


n+1 


2 


pees 


sR 
ie) 
SQ 
= 
| 
= 
oer 
II 
| 
wl s 
fo) 
© 
ot 
SIS 
NY 
& 
| 
oO 
= 


At these points, the partial sum S,,(0) attains either a local minimum or local maximum. These maxima and 


minima alternate. For instance, the first local minimum value of S,,(0) near the point 6 = —7 happens at 6 = as es 
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because $7’ (= - n) > 0. However, at the next point 6 = 201 the function S;, (9) attains a maximum because 
n 


Fold (= - r) <0. 
n+1 
The value of S;,,(0) at the first minimum is 


1 ney aoe ogee kr i sin 
n _— — SS —__ — k =_— SS ed 
(= n) = k o n+l a 5ar kr 


k=1 


because sin (9 — km) = sin@ coskm — cos @ sin ka = (—1)* sin@ based on identities coskm = (—1)*, sinka = 0. The 
expression in the right-hand side is the Riemann sum for the negative of the integral (denoted by SinIntegral in 
Mathematica) 

™ sint 


Si(z) = i, = dt © 1.851937051982468, 
0 


where Si(z) = i Sint dt is the sine integral. Since the exact value ¢(—m) = —1/2 + —1.570796327..., the Fourier 
sum undershoots it by a factor of 1.1789797. Of course, the size of the discontinuity is not > but 7; hence, as a 
proportion of the size of the discontinuity, it is half of 1.851937... /1.570796327... = 1.1789797 ... or about 8.9490%. 
After the series undershoots, it returns to overshoot, then undershoot again, and so on, each time with a smaller 
value than before. As n increases, more terms are added to the partial sum and the ripples increase in frequency 
and decrease in amplitude at every point. Correspondingly, both the highest peak (the overshoot) and the lowest 
peak (the undershoot) narrow and move to 6 = 7, the point of discontinuity. 

In general, if a function f(z) has a finite jump discontinuity at the point xo, the vertical span extending from 
the top of the overshoot to the bottom of the undershoot has length 


Si(rr) |f (ao + 0) — f (xo — 0)| & 1.1789797444721672 |f (xo + 0) — f(xo — 0)]. (10.4.17) 


For the sawtooth function $(0) = 0/2, we see that (a — 0) — (7 +0) =7. Therefore, Eq. (10.4.17) predicts an 
overshoot having approximate height 1.85. This prediction is confirmed in the figure on front page 545, where the 
graph shows that partial sums have a negative peak value of about —1.85 and a positive peak value of about 1.85 
near the point x = 0. 

The same function, considered on the interval (0, 27), has the different Fourier series 


0 =$=2-y sin kev. 


2. 22 
k>1 


The figure at the right clearly shows the Gibbs phenomenon at the point 9 = 0, where partial sums with N = 50 terms 
undershoot (0+) = 0 by approximately 0.25 and overshoot ¢(0—) = 7 by about the same value. When N -> co, 
these undershoots/overshoots increase in magnitude and approach approximately —0.281141/3.42273, respectively. 


Therefore, formula (10.4.17) allows one to predict the max- = 


imum/minimum error in the Fourier pointwise approxima- 
tion. 

The Gibbs phenomenon is a good example to illustrate 
the distinction between pointwise convergence and uniform 
convergence. For pointwise convergence of a sequence of 
functions S,, (@) to a function ¢ (6), it is required that for 
each value of 0, the values S;, (9) must converge to ¢ (0). 
For uniform convergence, it is required that the distance 
between S;, (0) and ¢(@) is bounded by a quantity which 
depends on n and not on 6, and which approaches zero as 


n tends to infinity. [ 
-0.28 


580 Chapter 10. Orthogonal Expansions 


1. Prove the formula 
sin (m + 3) 0 


sin (5 0) 


Dm(0) = S> ei”? 142 Se = 


n=—m n=1 
The function Dm(6) is called the Dirichlet kernel. 
2. Prove the following properties of the Dirichlet kernel: 


(a) Dm(0)=2m+1; (b) [fo Paleyae = x; (c) "Deas 25 Oe 4a): 


3. Prove the formula 


The function Km(@) is called the Fejér kernel. 
4. Find the complex Fourier series for the function (2 = e}*) ae (2 - ee) lal <7. 
l/h, for —h/2<a<h/2, 


5. Find the complex Fourier series for the function h(a) = 
0, elsewhere, 


on the interval (—3h, 3h), where h > 0. 


1, -l<2x<0O, 


6. Consider the 2-periodic function f(x) = 
az, O<a<l. 


(a) Find its Fourier series. Observe the Gibbs phenomenon for the corresponding Fourier series: what is the highest 
peak you expect near the point of discontinuity x = 0? 


(b) Calculate Cesaro partial sum C'9(#) and o-factor partial sum s19(x) with 10 terms. By plotting these sums, do 
you observe the Gibbs phenomenon? 


7. Consider the function of period 12 defined by the following equation on the interval —6 < x < 6: 


0, -6<2<¢-3, 

z+3, -3<2<0, 
f(z) = 

3-2, 0<2#<3, 

0, 3<2<6. 


Find its Fourier series. By differentiating, determine the Fourier series for its first derivative f’(x). Which of these two 
Fourier series for f(x) and f’(x) converges uniformly? 


8. Does the function f(a) = 2? sin(1/a) satisfy the conditions of the Dirichlet theorem? 
9. Acceleration of convergence. Consider the series 


Se) = To sin ka, 0<a<z7, 


k>2 


which converges rather slowly. Using the identity po- = ¢ + RRP? we can represent S(a#) as the sum S(x) = 
f(z) + 9(x), where 


fle) =z sinke, ale) = 0 gay sink. 


k>2 k>2 


Find the explicit formula for f(a). Note that g(x) is a continuous function because its Fourier series has the general 
term that decreases as k~* when k > oo. 


In each of Problems 10 through 13, assume that the given function is periodically extended outside the original interval (0,7). 
(a) Find the Fourier series for the extended function. 

(b) 

(c) What is the highest peak value predicted near x = x for the partial Fourier sum? 

(d) 


Calculate the least square error for partial sums with N = 10 and N = 20 terms. 


Sketch the graph of the function to which the series converges on the interval (—7, 27). 


10.4. 


10. 
11. 


14. 


15. 


16. 
17. 


18. 


19. 


20. 


21. 


22. 


23. 


24. 


25. 
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f(z) = 2? — 2; 12. f(a) = cos*(x); 
f(x) = cos(x/2); 13. f(x) = sinha. 
Prove the formulas: 


 jxt _ sin(nt/2) jon4iye/2 j(2e—1jt _ Sin(nt) int 
= t#2 =o t : 
» e sin(t/2) e€ (t 4 2mrn), » e€ int) e€ (t A mn) 


Express explicitly the Cesaro partial sum 


So(x) + Si(z) +--+ + Sw(a) 
N+1 
similar to Eq. (10.4.7). 
Find the complex Fourier series for the function (3 - e}*) me (3 a ee) |ja| <7. 
Find the complex form of the Fourier series for f(a) =e” on the interval (—7, 7). 
0, -l<a<0, 
l-a, O<a<l. 


Find its Fourier series. Observe the Gibbs phenomenon for the corresponding Fourier series: what is the highest peak 
you expect near the point of discontinuity « = 0? 


Consider the 2-periodic function f(x) = 


Find the Fourier series of a square pulse function on the interval (0, 27): 
s(x) =1 for |2| <7, s(z)=0 for a < |a| < 27. 


Assuming that the function f(x) = 2? 


Does it converge uniformly? 


is extended periodically from the interval (0,2), expand it into the Fourier series. 


Assuming that the function g(x) = 2? 


series. Does it converge uniformly? 


is extended periodically from the interval (—2,2), expand it into the Fourier 


Find the Cesaro partial sums for each of the functions f(x) and g(x) in two previous exercises. Which of these partial 
sums gives better approximation compared to the Fourier partial sums? 


“e, =2<2<0, 

1, O<a2<2. 

Find its Fourier series. Observe the Gibbs phenomenon for the corresponding Fourier series: what are the highest peaks 
you expect near the points of discontinuity « = 0 and x = 2? Then find and plot the Cesaro partial sums; do you 
observe the Gibbs phenomenon? 


Consider the 4-periodic function f(x) = 


Find the Fourier series for the function and determine the maximum peak its partial sums are approached, 
e+1, -2<2<0, 
r)= x+4)= f(x). 
f(@) i pence, f@t4 =F) 
Acceleration of Convergence. In this problem, we show how it is possible to improve the speed of convergence of 


a Fourier series. Consider the series 


S(x) “Spa cos (=) sin kx, 0<a<7, 


k>1 


which converges rather slowly. Using the identity wo = ¢- ESL we can represent S(z) as the sum S(x) = 


f(x) — g(x), where 


f= yr: ea (=) sinkx, (2) =~ aeT Ges (=) i 


k>1 k>1 


Find the explicit formula for f(a). Note that g(x) is a continuous function. 


In each exercise, expand the given function into complex Fourier series (10.4.12), page 576, on the interval | — 7, z|. 


26. 


f(t) =e", a #0; 27. f(t) =coshat, a 4 0; 28. f(t) = (2—cost)7'; 29. f(t) =4+t. 


582 Chapter 10. Orthogonal Expansions 


10.5 Even and Odd Functions 


Recall that f is called an even function if its domain contains the point —x whenever it contains the point x, and 
if 

f(-2) = f(a) 
for each x in the domain of f. Similarly, f is said to be an odd function if its domain contains the point —x 
whenever it contains the point x, and if 


f(-«) = —-f(z) 
for each x in the domain of f. Any function f(x) can be decomposed uniquely as a sum of its even part and its odd 
part: 
f(z) + f(-—«# x) — f(-—a« 
pia) = f@4ICA) , H)~ Feo) 


Geometrically, an even function is characterized by the property that its graph to the left of the ordinate is a mirror 
image of that to the right of the vertical axis. The graph of the odd function to the left of the ordinate is obtained 
from that on the right by a single rotation of 7 radians about an axis through the origin perpendicular to the 
coordinate plane. 

Any function that is a linear combination of monomials x? with even (odd) powers p is an even (odd) function. 


. . . . . 2k . . . 
Since the Maclaurin series for cosine function cosx = Dezo(—1)* Ger contains only even powers, it is an even 


function. Similarly, the sine function sing = Deso(—1)* Sea is an oie of an odd function. A sum or 


difference of two or more even functions is an even function; for instance, 2? + 1 — cos is an even function. On the 
other hand, a product of two even or odd functions is an even function, while the product of an even and an odd 
function is an odd function. For instance, sin? x and cos? x are both even functions, while 2 sina cosa = sin(2z) is 
an odd function. 

If we average an even function over a symmetrical interval —a < x < a, we get the same result that we would 
obtain for the interval —a < x < 0 or for the interval 0 < x <a: 


[ stoyav=— fateyax= f ofa), 


for an even function g(a) = g(—a). If we average an odd function over the interval —a < x < a, we get the negative 
of the average for the interval 0 < x < a, and the average for the interval —a < x < 0. For any odd function f(z) 
and any positive number a, we have 


0 a a 
f(e)az=— f f(a) da => f(x) dz = 
= 0 —a 
For example, if g(a) is even and periodic with period 2@, then the product of sin(kaa/@) and g(a) is odd for any 
integer k. Thus, the Fourier coefficients b, in Eq. (10.4.4), page 571, are all zeroes. Similarly, if f(a) is odd and 
periodic with period 2¢, then cos(kra/) f(a) is odd, and so the Fourier coefficients a; in Eq. (10.4.3), page 571, are 
all zeroes. 

Therefore, when an odd function is represented by a Fourier series, its expansion will be a sine Fourier series, so 
all coefficients of cosine terms are zeroes: ag = 0 and a, = 0 for all n; hence, 


k 
f(x) = S- by sin a (10.5.1) 
n>1 
where 
=F af f(a ) sin So = 7 Of, f(x ) sin a, oe 1, 2y2226 (10.5.2) 


We refer to this series (10.5.1) as the Fourier sine series. It can be considered a series for the function f(x) with 
a domain of the interval [0, ¢], which is extended in an odd manner to the interval [—, 0] (that is, f(—«) = —f(«)). 
Similarly, if a function g(x) is an even function on an interval [—@, ¢], then its Fourier series contains only cosine 
functions. Therefore, such a series is called the Fourier cosine series (all coefficients b,, in Eq. (10.4.4) are zeroes): 


ae > Qn, COS —_ (10.5.3) 


n21 
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where 


£ L 


A function on an interval [0, 2] can be extended in either an odd way or an even way on the interval [—@,0]. As a 
result, the same function can have two different Fourier series representation on the interval [0,¢] with respect to 
cosine and sine functions, depending on its extension. 


2 a 
— if g(a) cos de,  n=0,1,2,.... (10.5.4) 
0 


0 £ 2e 3€ 


Figure 10.23: Graph of the function f(x) from Example 10.5.1. 


Example 10.5.1: Consider the function defined on the interval [0,32], 2 > 0, by the equation 


Figure 10.24: Partial Fourier sum approximation (10.4.1) of the function f(x) with N = 5 terms. 


Figure 10.25: Graph of the even expansion of f(a) from Example 10.5.1. 


Its cosine Fourier series is 
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SF FT 


Figure 10.26: Partial cosine Fourier sum approximation of the function f(a) with N = 5 terms. 


0 £ 2¢ Ae 


Figure 10.27: Graph of the odd expansion of f(a) from Example 10.5.1. 


Figure 10.28: Example 10.5.1: Partial sine Fourier sum approximation of the function f(x) with N = 5 terms. 


Example 10.5.2: Consider the function f(x) = x? on the interval [0,2]. First, we extend it in an even way (see 
Fig. 10.29(a)), which leads to the Fourier cosine series 


2 ao NTL 
Lo = ot ) Qn, COS a 
n2>1 


where 


2 
8 
a= f x dx = 3, 


‘ 2 NTL 1 
Qn = | 2&* cos— da = —> cos(n7) = => 
0 2 n 


This yields the following cosine series: 


For N > 0, its partial sum approximations are 


N 

4 16 —1 

2 ~ Cn(t)=3+>) ( i 
n=1 


n 


cos (=) . (10.5.5) 


ow we extend the function x* into a negative semi-axis in an odd way (see Fig. 10. , which leads to the 
N d the f i = gati i-axis i dd Fig. 10.29(b hich lead h 


sine Fourier series oe 
a = Sb, sin (*) (0< a < 2), 


no>1 
where 
2 
2. (NTx _ 16 a 8 in 
n= f x sin(“*)a = 3 (-1)" - 1] -=— (9) 
Therefore, 
—1)" 2k+1 
ay 1) nme 32 1 (2k + ate 
1 n 2 m3 (2k +1) 2 
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(a) | (b) — 


Figure 10.29: Extensions of the function x? to the negative semi-axis: (a) evenly, (b) oddly, and (c) in periodic 
way. 


Figure 10.30: Example 10.5.2. Partial sum approximations with N = 15 terms of (a) Sy(a) and (b) Cy(z). 


and its N-th partial sum becomes 


1 _ (2+ 1)ra 


x? ~ Sn(a = 


where we use only odd indices: n = 2k +1 in the latter sum. This series is comprised of two series that converge 
at different speeds: the latter converges uniformly to a continuous function because its general term decays as 
(2k + 1)~%. However, the former converges to a piecewise continuous function. Therefore, the trigonometric sine 
polynomials Sjv(2) converge to x? on interval (0,2) much slower than Cy(a), the cosine series. Such behavior 
of partial sums Syy(a) and Cy(zx) is predicted: the even expansion of x? is a continuous function, while its odd 
expansion is discontinuous. 
For the periodic extension (see Fig. 10.29(c)) with half period ¢ = 1, we have the general Fourier series 
—_ a 
x + Sol [A, cos(nrx) + B, sin(nrx)] , 
n21 


0 1 2 8 
Ay = [ @+2)Pde+ [ vde= | ede——, 
-1 0 0 3 


where 


f 1 
4 
Ags fe + 2)? cos(nma) dx +f x cos(nmx) dx = nen’ 
0 1 4 
By, = / (x + 2)? sin(nma) dx +f a? sin(naa) dx = ——. 
bas : nT 


This leads to 


4 4 1 41 
2 _ 
x ae ee 72 608 ( nx) ==> 7 ot n(nmz) (0< a < 2). 
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To estimate the accuracy of these truncated series, let us summarize the calculations of the N-th partial sums for 
different values of N at « = 2 and the corresponding mean square errors, Ag, Ag, and Ap, for each approximation, 
in the following table: 


Fourier True 
series series series value 
; 


2.74809 |} 3.84572 | 0.000753343 |} 1.96143 | 0.154325 
2.44911 |] 3.92094 | 0.000101564 || 1.98023 | 0.0790706 
0.0645216 || 3. d 7m 1.99597 | 0.0161307 
0.0064813 || 3.99838 | 8.7 x 10~'° |} 1.99959 | 0.0016207 


Here Ag(N =f |x? — Sn (a thy dz, Ac(N =f |x? — Cn (x ? dx, and Ar(N =f |x? — Frv( x) |" da are mean 
square errors = each approximation. At a point 7 = 1 al the given nee x? is ae we have 


1.12562 | 0.993457 1.00183 
0.936559 | 1.00183 1.00048 
0.987269 | 1.00008 1.00002 


The Fourier series partial sums (except the cosine series because an even extension of x? is a continuous function) 
demonstrate the Gibbs phenomenon near the points of discontinuity « = 0 and x = 2, which is clearly seen from 
Figures 10.30(a) and 10.31. 


Figure 10.31: Partial Fourier sum approximation, with N = 15 terms, of the function x? extended periodically with 
period T = 2. 


As previous examples show, a function defined on a finite interval [0, @] may have three different Fourier series 
expansions depending on how the function is extended outside the given interval. Its selection is usually dictated by 
the reason of such series that may have different rates of convergence. 

Sometimes we can make another observation when Fourier coefficients with even indices or odd indices are zeroes, 
so we need a definition. 


Definition 10.9: A periodic function with period T that satisfies the relation 


t(e+5 


is called an odd-harmonic function or half-period antisymmetric. Similarly, the function g(x) that satisfies 


T 
g (: + 5) = g(x) is referred to as an even-harmonic function or half-period symmetric. 
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For example, sine and cosine are odd-harmonic with T = 27 because sin(x+7) = — sin and cos(%+7) = —cosz. 
Any function ¢(a) can be represented as a sum of its half-period symmetric and antisymmetric parts: 


_ 6(a) +0(@+T/2) , oe) — d(@+T/2) 
dq) =e ewe 
2 2 
Products and sums of half-period symmetric and antisymmetric functions have the same properties as corresponding 
properties of even and odd functions. Namely, a sum of two odd-harmonic functions is an odd-harmonic one. 
Similarly, the product of two odd-harmonic functions is an even-harmonic one, and so on. If f(x) is half-period 
antisymmetric and g(x) is half-period symmetric, then 


T T/2 T T/2 
ii f(a) dx = — : f(x) da and | g(x) da = - g(x) dx (10.5.6) 


T/2 T/2 


7 ng Ts 
and so f(x) dx = 0 and i g(x) dx = 2 f g(x) da. 


kira . Qkra 


krax Qkhrx : big jG 
The trigonometric functions sin —— = sin and cos —— = sin are both half-period symmetric if k is 


even, and half-period antisymmetric if k is odd. Therefore, if g(x) is half-period symmetric, g(a+T'/2) = g(x), then its 
Fourier coefficients with odd indices (a2,41 and bg,41) are zeroes, while if f(x) is antisymmetric, f(a+T/2) = —f(x), 
then its Fourier coefficients with even indices (a2, and bo) are both zeroes. 


Theorem 10.10: If a periodic function f(x) with period T is also an odd-harmonic function, then its Fourier 
series is 


2n + 1)27rx » (2n + 1)2aa 
i) = se G@2n+1 COS a ar S- bon+1 sin ae , (10.5.7) 
n>0 n>0 
caapee (2n + 1)2at ee (2n + 1)2nt 
n+l = 7 ——a pil = i —$—$ <= 10.5.8 
Q2n+41 zh f(t) cos F dt, ben+1 zf f(t) sin i dt ( ) 
If a periodic function g(a) with period T is also an even-harmonic function, then its Fourier series is 
ao Ania _ Anirax 
g(x) = > +) aan cos—_— + ) | bon sin —— (10.5.9) 
nol n21 

A Anit Ang he Annt 
nS a ; r== i dt. 10.5.10 
Aan = 7 | f(t) cos r dt, be r | f(t) sin r ( ) 


Example 10.5.3: The square wave sounds vaguely like the waveform produced by a clarinet: 


. if0<a<q7, 
s(x) = 
-1, ifa@<a2< 2q; 


When the function s(a) is extended periodically to all values of x with period 27, it becomes an odd function and 
its Fourier expansion contains only sine functions. Moreover, since it is also half-period antisymmetric, its Fourier 
coefficients can be calculated using either Eq. (10.4.4) or along half period interval according to Eq. (10.5.8): 


s(a + 27) = s(x). 


Le a ak 1 sin(nt)|" 1. sin(nt) |? 
On = - | cos(nt) dt — - | cos(nt) dt = — anny a, =0, 
T Jo T dn T Nn | 7 a 
ae 1 7 1 cos(nt)|" 1 a1 a 
by = - | sin(nt) dt — ~/ sin(nt) dt = —— cot) pe) 
T Jo T dn T nm |reg 7 a 
_1/ (-1)" n i Ps 1 (-1)"\ J4, ifn=2k+1 is odd, 
ot n non n 10, ifn is even, 
9 Tv 
borg = — in(2k + 1)tdt = ——— k=0,1,2,.... 
2k+1 =f sin( + ) m(2k +1)’ payeey 
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Figure 10.32: Example 10.5.3, graph of the partial sum 4 ae snes 


Therefore, the sine Fourier series for this square wave is 


sin(2k + 1)z a : sin3x2 sindaz 1, if0<a<q, 
yo = sin z+ + fee J = 
T 


= 2k+1 3 5 -1, ifa@<a< 2n. 


The above series does not converge uniformly because its general term approaches zero as (2k + 1)~+. Let us 


consider its partial sums S;,(z) = )7/_9 De By plotting its graph (Fig. 10.32), we definitely observe the 
Gibbs phenomenon. The estimate (10.4.17) on page 579 predicts the height of an overshoot to be about 1.18. 


Figure 10.33: Graph of the periodic expansion of g(a) from Example 10.5.4. 


Example 10.5.4: Consider the continuous function of period T = 8 defined by the following equations in the 
interval -4< a4 < 4: 


(a+4)/2, —4 
—«/2, —2 
x/2, 0< 
(4—2)/2, 2< 
The graph of this function is shown in Fig. 10.33. This function cannot be represented by a single Taylor series, but 
a Fourier series is appropriate in this case. The given function is both even and half-period symmetric. Therefore, we 


can find coefficients in its Fourier series using either standard Euler—Fourier formulas (10.3.5) or half-period formulas 
(10.5.4) or Eq. (10.5.10) for half-period symmetric functions—they all yield the same Fourier cosine series: 


nme 1 og (Lt 2k) te 
=5+ Dian cos—-= 5+) An cos —— a eae a 
n2o1 n21 a" 


g(x) = 


Si 


where coefficients a, and A, can be calculated as follows 
1/4 1 7 2 
An = ;/ g(x) cos de - sf g(x) cos dz, A, = | g(x) cos da. 


20 
ip Evaluate / sin(sin x) sin(4a) da. 
0 


2. Determine whether the given function is odd, even, or neither. 


10.5. Even and Odd Functions 589 


(a) V1 +27; (c) cot 2a; (e) «1/3 +sin a; 
(b) x? + 42°; (d) «'/? cosa; (f) csc x”. 


3. If f(x) = 2? — 2°, 0 <a <1, sketch the graph for —3 < x < 3 if 


(a) f(x) is an odd function, and of period 2; 

(b) f(x) is an even function, and of period 2; 

(c) f(x) is of period 1; 

(d) f(a) is an odd-harmonic function, and of period 2. 


4. Which of the following are odd functions, which are even functions, and which are odd-harmonic functions? 
(a) 65 cos11a+5 cos33a —cos992; (b) 5 sinda — 4 sin 12% + 3 sin 20a. 


5. Prove that if a function is both an odd function and an odd-harmonic function, in determining the coefficients of the 
odd sine terms, we may take the averages from 0 to T'/4. 


6. Prove that if a periodic function of period T is both an even function, and an odd-harmonic function, in determining 
the coefficients of the odd cosine terms, we may take the averages from 0 to T'/4. 


7. Find the sine and cosine Fourier series of the function defined on the interval [0, 3]: 


x+2)?, if0<2<1, 
ra) ={! fe 
c, ifl<a2<3. 


8. Show that the sine series of period T = 2¢ for the constant function f(x) = 7/4 on the interval 0 < x < £ is 
a> sin ees sophie te A ey SE Ee BE 
ra e may 20 cae Oe ae 
9. Prove the identity (established by L. Euler in 1772) 


4A yk 228 (2k 4+ 1)x 1, |x| < %, 
=a 2k+1 -1, -$<|z|<7. 


10. Find the Fourier series of each of the following functions on the interval |x| < 7: 
(a) sin’ z; (b) cos* a; (c) sin* 2; (d) sin 2x cos x. 


11. If a function is expanded in a sine series S() of period 2@, which represents it on the interval (0, €[, and a cosine series 
1 
C(a) of period 2é, which represents it on the same interval, show that 5 [S(a) + C(x)] is a Fourier series for the function, 
which is zero on the interval —¢ < x < 0, and equal to f(x) on the interval 0 < x < £. 


12. Consider the function f(a) = x? on the interval (0,2). Find the Fourier series, the sine Fourier series, and cosine Fourier 
series for this function on the given interval. Which of these series converges the fastest? 


In each of Problems 13 through 20, assume that the given function is periodically extended outside the given interval (0, £). 
If explicit formulas are not possible to obtain, use numerical approximations with at least 6 decimal figures. 


(a) Sketch the graphs of the even extension and the odd extension of the given function of period 2¢ over three periods. 
(b) Find the Fourier cosine and sine series for the given function. 


(c) Calculate the least square error for partial sums with N = 10 and N = 20 terms. 


13. f(x) =a2(4-—2), 0<a4<4; XL; O0<a<rT, 
16. f(x) = 4 7, T< ax“ < 2r, 
14. f(x) = *, 0<a<§F; LZ-T, @W<u< 3a; 
2 
ze, O<a#<1, 17. f(z) =22 —2, 0<2<1; 


15. f(x) = 41, l<ar< 2, 
0, 2<4<3; is. f@) ===, U<e< 4; 


x 
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Summary for Chapter 10 


0, 0<a<l, sina, 0<a4<T7, 
f(x) = 41, 1<a2< 2, 20. f(x) = <0, T<ax < 2n, 
(x—1)?, 2<a<3; cost—1, I@a<u< 3n. 


Summary for Chapter 10 


1. 


For a linear operator L : V — V, a complex number J is called its eigenvalue if there exists a nonzero (also called 
nontrivial) element y such that 
Ly =dy. (10.1.1) 


The corresponding solution y of Eq. (10.1.1) is called the eigenfunction (or eigenvector) of the operator L. The problem 
of determination of eigenvalues and eigenfunctions is called the Sturm—Liouville problem. 


The norm of the function f(x) (with weight p > 0) is 


IIe = j=), Fa)? p(x (10.2.2) 


Convergence with respect to norm (10.2.2) is referred to as L?-convergence or mean-square convergence. 


A square integrable function f(x) can be expanded with respect to the complete set of eigenfunctions: 


= Sen tinle), (10.2.11) 


k>1 


where the coefficients 


(f, de) 7 
a |e |I? = yap fic £) be(x) p(x)dz, k=1,2,.... 


are called the Fourier constants (or coefficients) of f(x) with respect to the set of orthogonal functions {¢4(x)},5, 
and the weight function p(x). 


A periodic function f(x) of period T = 2¢ can be expanded into the Fourier series 


k 
~ BD a cos =) +0 sin (2), 
ar 
m=Zf s ) cos (AT) a, n-Z[ fle) si ) sin (22) a. 
T/2 T 


where 


5. For a square integrable function on the interval [—Z, ¢], its Fourier series converges to f(a) in the mean square sense. 


6. Complex Fourier series: f(x) ~ 02_., ax ef*™*/", where 


e 
Or, = a / f(z) e skra/l ay k =0,+1,+2,.... 
x |, 


At the point of discontinuity, the Fourier series overshoots/undershoots the jump by about 8.9%. This observation is 
referred to as the Gibbs phenomenon. 


An odd function can be expanded into the sine Fourier series 


where 


=3 [ H@) ) sin 52% de = tf tes dx, k=1,2,.... 


An even function can be expanded into the cosine Fourier series 
NTL 
pean y An COS 7 
n>1 
where 


NT 


e 
an =F fale) cos de, n=0,1,2,.... 
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Review Questions for Chapter 10 


Section 10.1 

In each of Problems 1 through 6 find the eigenvalues and eigenfunctions of the given boundary value problem. Assume that 
all eigenvalues are real. Hint: Seek a solution to the Euler equation ax*y” + by’ + cy = 0 in the form y = 2” (c1 cos (k Inz) 
+ce2 sin (k Inz)). 


1. 2?y” —Tay’+Ay=0 (0<a2 <4), y(1)=0, y(4)=0. 

2. ay” +ay’+rAy=0 (0<2 <2), y(1)=0, y(2)=0 

3. ay” —3ry'+rAy=0 (0<2 <3), y'(1)=0, y(3) =0. 

4. o?y"” +5ry’'+rAy=0 (0<2<4), y(1)=0, y'(4)=0. 

5. 2y"” —32y’+rAy=0, (0<a2< 2), y41)=0, y(@)=0, @>1. 
6. ay” +5ry’+Ay=0, (0<2< 8), y1)=0, y(@)=0, €>1. 


In each of Problems 7 through 15 assume that all eigenvalues are real. 


(a) Determine the form of the eigenfunctions and find the equation for nonzero eigenvalues. 
(b) Determine whether = 0 is an eigenvalue. 

(c) Find approximate values for Ai and 2, the nonzero eigenvalues of smallest absolute value. 
(d) Estimate A, for large values of n. 

7 yy’ +rA(A =0 (O<a< 1), y'(0)=0, y(1) 4+ 4y'(1) =0. 

8 y+4\y=0 (0<a<1), y(0) — 3y'(0) =0, y’(1) =0 

9 y’+rAy=0 (O<2x<1), 3y(0) + y'(0) =0, y(1) =0. 
10. y”+9y=0, y(-¥s)=0, y(1s) =0. 
lL. y+ 4y'+(A4+3)y=0 (0<2< 1), y(0)=0, y(1) =0. 
12. y’—-8y'+rAy¥=0 (0<2<3), y(0)=0, y/(3) =0. 
13. y’ +6y’+Ay=0 (l<2x<3), y(1)=0, y(3) =0. 
14. y” — 8y/ + (16+A)y=0 (O<2< 1), (0) = 0, (1) =0. 
15. y’ +6y +(8+Ay=0 (0<2< 1), y(0)=0, y/(1)=0. 
16. Consider the Sturm—Liouville problem 

ay” = 3A (ay’ a y) ; y(1)=0, y(2)=0. 


Show that its eigenvalues are complex numbers. 


17. Determine the real eigenvalues and the corresponding eigenfunctions in the boundary value problem 
y" +2y-AQyt+y')=0, (0) = 0, (3) = 0. 


18. Solve the Sturm-—Liouville problem for a higher order differential equation 


y9 —My=0 O<2<2%), y(0)=y/(0)=0, y"()=y"( =0. 


19. In some buckling problems the eigenvalue parameter appears in the boundary conditions as illustrated in the following 
one: 
yO +My" =0 (O<a<%), — y(0)=0, y(0)—2dy"(0) = 0, y(2) = y'() =0. 
Solve the Sturm—Liouville problem and find the smallest eigenvalue. 


20. A quantum particle freely moving on a circle is modeled by the Schrédinger equation and the periodic boundary 
conditions 


Solve the corresponding Sturm—Liouville problem. 


The Priifer’’ substitution is aimed at replacing unknown variables y(x) and y’(z) in the self-adjoint differential expression 
10.1.7) with an equivalent pair of variables R(x) and 6(x) according to the equations 
g 


p(x)y"(x) = R(x) cos(O(x)), (a) = R(a) sin(A(x)). 


The variables R and @ are polar coordinates in the Poincaré phase plane (py’,y); they are referred to as the amplitude and 
phase variables, respectively. 


111Frnst Paul Heinz Priifer (1896-1934) was a German mathematician from the University of Miinster. 
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21. Show that the Priifer variables R and 6 satisfy the polar equations 
R? = (py')? +y’, tang — 4. 
(py) +y i 
22. Show that, in terms of the Priifer variables, the self-adjoint differential operator (10.1.7) is transformed into a pair of 
first order differential equations 
dé/dx = (Ap — q) sin? 6 + < cos” 6, 
dR/dz = (p+ —Ap+ q) R sin @ cos 0. 


Section 10.2 of Chapter 10 (Review) 


1. Express the root mean square value of a function f(x) for the interval a < x < c, denoted as f,., in terms of the average 


ac) 


of the same function for the intervals a < « < b, denoted by f,,, and b < x < c, denoted by f,,, and the lengths of 
these intervals. 


2. Find the norm for each of the following functions on the interval indicated 
(a) e*; O<a2<7/4; (b) 1/2’, 2<2<5; (c) 2a*-4r+3, O<2 <2. 
3. By taking the proper averages of the expressions 
(a) cos(a+a)=A+ Ai cosx+A2sinz, (b) sin(e+a)= B+ Bi cosx+ Be sing, 


determine the values of coefficients A’s and B’s. 
mie if0<2 <2, 


; over the interval 0 < 2 < 4. 
4-2, if2<a<4; 


4. Find the norm of the function f(x) = 


cosz, if0<a< 7/2, 


; over the interval 0 < x < 27. 
0, if t/2 <a < 27; 


5. Find the norm of the function f(x) = 
6. What constant function is closest in the least square sense to sin? x on the interval (0,7)? 


3 


7. What multiple of cos x is closest in the least square sense to cos’ x on the interval (0, 27)? 


8. Let Ly = y™. Suppose that the domain of L consists of all functions that have four continuous derivatives on the 
interval [0, 2] and satisfy 
y(0)=y'(0)=0 and —y"(¢) = y"(€) =0. 
Show that L is a self-adjoint operator. 


9. Show that the series ¢1 (x) + ae fees oe +--+ is not the eigenfunction series for any square integrable function, 
n 

g2(x) n(x) 
J/2 In2 Jn Inn 


10. For what positive value of a will the functions x and sin z be orthogonal on the interval [0, a]? 


but the series ¢1(x) + +--+ is. 


11. Consider the Sturm—Liouville problem (10.2.4), page 555, where p(x), p’(x), q(x), and p(x) are continuous functions, 
and p(x) > 0, p(x) > 0. 


(a) Show that if \ is an eigenvalue and ¢ a corresponding eigenfunction, then 


é e 
rf P paw = | (p6? + 06%) de +S HI + 2 v(0)6*(0), (10.5.1) 
0 0 Ba Bo 

provided that 89 4 0 and §; 4 0. 
(b) Modify the previous formula when 89 = 0 or 6; = 0. 
c) Show that if q(z) > 0 and if ao/fo and ai/; are nonnegative, then the eigenvalue \ is nonnegative. 
(c) g g g 
(d) Under the conditions of part (c), show that the eigenvalue X is strictly positive unless ao = a; = 0 and q(x) = 0 

for each x € [0, é]. 


12. Let ¢1(a) and ¢2(x) be two eigenfunctions of the Sturm—Liouville problem (10.2.4) corresponding to the same eigenvalue 
A. By computing the Wronskian W [¢1, ¢2], show that it is identically zero for all x. Using this, show that the eigenvalues 
of the boundary value problem (10.2.4) are all simple. Hint: use Abel’s theorem 4.10, page 201. 


13. Find w so that the exponential functions ¢;,(x) = e&*”’* (k = 0,+1,+2,...) become orthogonal on an interval (a,b). 
Here j is the unit vector in the positive vertical direction on the complex plane such that j? = —1. 


14. Prove the law of cosines: — || f + gl|? = |IfIl? + llgll? + 2(F, 9)- 


In each given Sturm—Liouville problem, determine all real eigenvalues and eigenfunctions. 
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15. y’ +y't+y=—-drAy (0<2#<1), y(0) =y'(1) =9; 17. y/+2y’ty=—-Ay (0<2< 1), y(0) =y(1)+y'(1) = 
0; 
16. y”+2y'+2y=—-Ay (0<2<1), y'(0) =y(1) =0; 


Section 10.3 of Chapter 10 (Review) 


1. Find the Fourier series for the piecewise continuous function with period 3¢ 


an O0<a<f, 
f(z) = 91, b<a< 2, 
$8, Wean< ae 
2. Find the Fourier series for the periodic function f(x) = |cosx| (the absolute value of cos x). 


3. Show that the Fourier series (10.3.3), page 563, may be written in amplitude-phase form 


F(t) ~ > + Sod sin(kt + pp). 


k>1 
The coefficient d; is called the amplitude and y;, the phase (or phase angle) of the kth component. 
4. Let F(t) be a square integrable function of the interval [—7,7], and c be some constant. Given the Fourier series 
F(t) = 3+ Vxs1 (ae cos kt +b; sin kt), what is the Fourier series for F'(ct)? 
5. A function is periodic, of period 40. Find the Fourier series which represent it, if it is defined in the interval —20 < x < 20 
by 
—10, if —20<2<0; 
f(x) = 40, if O<a2< 10; 
20, if 10< 2 < 20. 


6. Find the Fourier series for the function t(7 — t) on the interval [0, x]. 
7. Find the Fourier series for the function 1*t — ¢® on the interval [—7, 7]. 
8. Find the Fourier series for the function sin t + cos 2t on the interval [—7, 7]. 
9. Find the Fourier series for the following piecewise continuous functions. 
0, —17 <a <—7/2, 0, —1<a<—n/2, 
ut+n/2, —-1/2<2<0, -1, -7/2<a2<0, 
(a) f(x) = (d) f(z) = 
x, 0<a<7/2, 1, 0<a<7/2, 
0, T/2<U<T. 0, T/2<u<T. 


0, —7<2«2 <0, 


sing, O<a<7. 


0, -€<2<0, 
uz, O<aK<e. 


(b) f(x) -{ (e) f(z) = 


xv, —r<2 <0, 
0, O<a<tT. 


(c) f(x) = 


0, Zc 
us 
2 


cosxz, — 


(f) f(x) =| 


Section 10.4 of Chapter 10 (Review) 
In each of Problems 1 through 4, assume that the given function is periodically extended outside the original interval 
(0, 7). 


(a) Find the Fourier series for the extended function. 
(b) 
(c) What is the highest peak value predicted near x = 7 for the partial Fourier sum? 
(d) 


Calculate the root mean square error for partial sums with N = 10 and N = 20 terms. 


Sketch the graph of the function to which the series converges for three periods. 


1. f(a) = 2°; 3. f(x) = sin? (x + 1/2); 
2. f(x) = cos 22; 4. f(x) = cosh. 


In each Exercise 5 through 10, determine the value of Gibb’s overshoot/undershoot at each point of discontinuity when 
the given function is expanded into the Fourier series. 
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0, -l<2«< 0, iT 0<a<l, 
5. = 2)= : 8. = 2) = : 
f(e) — dewey fe+2)= Say f(e) ro toeey, — fe+2)= Say 
2, -l<a<0O, -1, -l<2<0O, 
f(e) i. beeen fe+2)= Sy f(e) . beeen £e+2)= Sy 
1, 0<a<l, 2s 0<a<l, 
7 - 2 10 = 2 
f(e) a Loney, fe+2)= Say f(e) oe Ieeey  Fe+2)= ste) 
Section 10.5 of Chapter 10 (Review) 
1. Which of the following functions is even, odd, or neither? 
(a) sin 5; (b) 2a: (c) cos(2z); (d) sinha; (e) a. 


2. Decompose each of the following functions into the sum of an even and an odd function 
(a) e; (b) # — 22 4. 4¢— 3; 


3. Which of the following are odd functions, which are even functions, and which are odd-harmonic functions? 


(c) te’ 


11 
(a) 5-6 cos $ +8 cos; (b) 5 sin = +2 sin. 
4. Ifa function f(t) is periodic of period T = 4@, odd, and also an odd-harmonic function, show that its Fourier coefficients 


(10.4.3) and (10.4.4) are given by 
ben+1 = an f(t) 


5. Ifa function f(t) is periodic of period T = 4€, even, and also an odd-harmonic function, show that its Fourier coefficients 
(10.4.3) and (10.4.4) are given by 
an f(@) 


6. Expand the function f(#) = # defined on the interval 0 < x < @ into Fourier sine and cosine series. 


2n@n +t 


ao = An = ban = 0,7 dt (n=0,1,...). 


2n(@n +t 


ao = A2n = bn =0, a2n+1 = dt (= Oy 1 yesc) 


7. Prove the following expansions on the interval (—7, 7): 


a) ey Be = 2, 


2 


(b) eg cos(na) _ 3a = T 


n a a) 
n>1 n>1 
sin(2k+1)a 4 . T\?] cos(2k+1)a mom, 
=) » (2k+1)? oe i 7 (« stgn(z) a = td) = Opie ss |e; 
,sin(2k — 1)a 8x ,cos(2k—l1)a 4a? is 
7) 82 ee 1) Pe ae ee <2 
(e) 2 1) (Qk — 1)? m2 (f) DS 1) (Ok — 1) 3 1 for |a| < 3 


8. Prove the following expansions on the interval (0, 27): 


cos(nx) _ 2. cos(nz) _ 3x7 — 6n|a| +27? 
(a) es In |2 sin 5]; (b) = — a - - 6 
n>1 n2>1 
sin(nt) am-2 cos(nz) x (x* — 3n|2| + 277) 
= : d a EE 
(c) ‘Ss n 2° ( ) n3 12 ; 
n21 n>1 
2,2 4 4 . 3 2 2 
nun —6 x T sin(nt) — a — 3na° + 20° a 
(e) S>(-1) 7 cos(nz) = 3° 4’ (f) a | ne 
n>1 n2>1 
(g) yo ne 1-5 (Su 2a? + 4x -2‘), 
nol 
9. Using expansions from Exercise 8, prove the identities 
2 2 4 
7 1 7 1 T 1 
i eer b) —=)*\——_. —=)y ——_. 
@) Go Le@ ©) 3 Gta ©) 96 ys (i + 2k)4 
k>1 k>0 k>0 


In each of Problems 10 through 19, assume that the given function is periodically extended outside the original interval (0, @). 
(a) Sketch the graphs of the even extension and the odd extension of the given function of period 2¢ over three periods. 
(b) Find the Fourier cosine and sine series for the given function. 


(c) Calculate the root mean square error for partial sums with N = 10 and N = 20 terms. 
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din 9 2 
3 = — : 1 
10. f(x) = 7° — a, O<aK<r 16: #ey= x, O<«e<l, 
1 O0<4r<2 a, 1l<a2<2; 
ll. f(z) = 4” ; 
e-l, 2<2<3. 
x O0<ax<l 
Qu, O<a<2, : : 
12. f(z) = 17. = 
3-4, 2<2<3; 
1, O<a<q, 
13. f(x~)=<0, a<a < 2n, és : Log, O0<a<i, 
2, n<a<3rn. del a-3, i<ae<l, 
14. f(x) =(#@-1)?, O<a <2. 
2 
0 1 1 0 1 
image Vee ieee se 
1, l<a2<3. 3-4, 1l<a<3 
20. Show that 
4 a: . (2k4+1)ra ds, 0<a<8, 
@) ie oT 
on + -1, -€<2<0; 
(-1)*¥ . (2k41)rx @ =) e e 
b =-=1 ¢({— + — ese =: 
(P) rs kOe tea Ir a 
k>0 
-—6-2, -6<2<-3, 
(c) 24 (—1)* ne (2k+1)ra ‘ r 
7? 24 (2k + 12 G =) ew 
2 6— 2, 3<2<6; 
1152 1 _ (2k +1)rx , 
(d) a a ara: sin 122 — 2”, 0<a< 12; 
k>0 
Qn 1 (2k + 1)rax TX 
= =-In|tan(=)], 0 é; 
(e) 3 > ORF cos ? n [tan ( 55 <a< 
—1)* 1 0 £ 
(t) ye! con Che = | V<e< 5, 
iO +1 -1 3<a<& 
20 1 (2k+1)ra 
(g) =) a C08 =--—f, O0<ar<. 
om (2k + 1) L 2 
21. For arbitrary positive number z, expand the function cos zx into even Fourier series on the interval |x| < 7. By choosing 
particular values of x, prove the formulas 
1 2z | 1 1 1} 1 22 
—————S SS | —— t, =—|/--— —___. 
sintz 7m | 22? + ge , Ree ogee paar 
k>1 k>1 
22. The acoustical wave form w(t) = e* cos (27 - 200¢) corresponds to a flute-like tone with a pitch of 200 Hz that sounds 
from t © —2sec to t = 2sec. Expand w(t) into the Fourier cosine series. 
23. Consider a continuous function f(x) = 2? on the open interval (0, 2). 


(a) 
(b) 


sums in the previous part. 


Expand the function f(a) into the sine Fourier series. Does 


Find the Fourier series for f(a) assuming that f(a) is extended periodically with period T = 2. 


Determine points of discontinuity and find corresponding overshoots and undershoot values for the Fourier partial 


Expand the function f(x) into the cosine Fourier series. Does the series converge uniformly? 


this series exhibit the Gibbs phenomenon? What are 


the values of overshoot and undershoot for the partial sine Fourier sums? 


wT 


24. Evaluate the integral / 


—T 


Ge cos 5t — t? sin 3t) dt. 


Hint: Don’t spend more than 10 seconds. 


< \ Taylor & Francis 
hi! bivadbos Group 
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Chapter 11 


Partial Differential Equations 


This chapter serves as an introduction to linear partial differential equations (PDEs, for short) that are used for 
modeling physical phenomena with more than one independent variable. Frequently, the independent variables are 
time t and one or more of the spacial variables, which are usually denoted by x, y, and z. For example, u(z, y, z,t) 
might represent the temperature of the three-dimensional solid at spacial point (a,y,z) and time t. A partial 
differential equation is an equation involving partial derivatives of a dependent variable. 

Traditionally, a course on partial differential equations includes three types of equations: parabolic, hyperbolic, 
and elliptic equations. Their prototypes originated from the heat transfer and diffusion equations (u; = aV7u), 
wave propagation (us = c?V2u), and time-independent or steady processes (Vu = 0), respectively, where V is the 
gradient operator. Their derivations and detailed physical interpretations can be found elsewhere [14]. Therefore, 
we consider three different types of equations that possess different properties and that can all be solved using the 
same method. This method is known as separation of variables, and it is often called the Fourier method. It was 
first derived and studied by J. d’Alambert and later used by L. Euler to solve spring problems. 


11.1 Separation of Variables for the Heat Equation 


We start this section by considering the temperature distribution in a uniform bar or wire of length ¢ with a perfectly 
insulated lateral surface and certain boundary and initial conditions. Let the x-axis be chosen to lie along the axis 
of the bar, and let = 0 and x = @ denote the ends of the bar. We also assume that the cross-sectional dimensions 
are so small that the temperature can be considered constant within any cross-section but may vary from section to 
section. To describe the problem, let u(z,t) represent the temperature of the point x of the bar at time t. Assuming 
the absence of internal sources of heat, the function u(x,t) can be shown (see [14]) to satisfy the one-dimensional 
heat conduction (or transfer) equation: 


Ou O7u 


= a B12 or in a shorthand notation uz = AUgz. (11.1.1) 
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The positive constant a in Eq. (11.1.1) is known as thermal diffusivity; it is a material property which describes 
the rate at which heat flows through a material, typically measured in mm?/sec or cm?/sec. Usually, the thermal 
diffusivity is very sensitive to the variation in temperature and is chosen as a constant for simplicity; it can be 
calculated as 


a = K/(ps), 


where « is the thermal conductivity (W/(m-°K)), p is the density (g/cm*), and s is the specific heat of the material 
of the bar or wire (J/(kg-°K) ). Common values of a are given in the following table: 


Material Thermal diffusivity Material Thermal diffusivity 
mm?/sec mm?/sec 


Gold Silver, pure (99.9%) 


Copper at 25°C Inconel 600 at 25°C 
Aluminum Steel, 1% carbon 
Water at 25°C Paraffin at 25°C 


In addition, we assume that the ends of the bar are held at constant temperature 0°C and that initially at t = 0 
the temperature distribution in the bar is known (or measured) to be f(x). Then u(x,t) satisfies the boundary 
conditions 

u(0,t) =0, u(é, t) = 0, (11.1.2) 


and the initial condition 
u(x,0) = f(z) (0<a<). (11.1.3) 


Equation (11.1.1) together with the conditions (11.1.2) and (11.1.3) is called the initial boundary value problem 
(IBVP, for short). The conditions of (11.1.2) are called the Dirichlet boundary conditions or conditions of the first 
kind. The described physical problem imposes a compatibility constraint (or conditions) on the function f(z): 
it must satisfy f(0) = f(€) = 0 to match the homogeneous boundary condition (11.1.2). However, the solution to 
the IBVP exists without any constraint on f(a) at end points. 

The heat transfer problem (11.1.1) — (11.1.3) is linear because unknown function u(x,t) appears only to the first 
power throughout. Since the differential equation (11.1.1) and boundary conditions (11.1.2) are also homogeneous, 
we solve this problem using the method of separation of variables. While the method originated in the works 
by d’Alembert, Daniel Bernoulli, and Euler, its systematic application was carried out by Joseph Fourier. When 
boundary conditions are not homogeneous, direct application of the method is not possible. To bypass this obstacle, 
we present interesting and important variations on the heat problem in the following section. 

To solve the IBVP (11.1.1) — (11.1.3), we apply the separation of variables method. It starts with disregarding 
the initial conditions (11.1.3) and seeking nontrivial (meaning not identically zero) solutions of the partial differential 
equation (11.1.1) subject to the homogeneous boundary conditions (11.1.2) and represented as a product of two 
functions: 

u(a,t) = X(x) T(t), (11.1.4) 


where X(x) is a function of x alone, and T(t) is a function of t alone. Substitution of the above product into 
Eq. (11.1.1) yields 
X(«) T(t) = aX"(x) T(t), 


where T(t) is the first derivative of T(t) with respect to time t and X"(a) is the second derivative of the function 
X(x) with respect to the spatial variable x. Dividing both sides of the latter equation by aX (x) T(t) (we will not 
worry about X(x) T(t) being 0), we get 

Tt) _ X"(a) 


aT(t) X(x)~ 
On the left-hand side we have a function depending only on time variable t, and in the right-hand side there is a 
function depending only on the spacial variable x; therefore, we separate the variables! Since x and t are independent 
variables, the only way a function of t can equal a function of x is if both functions are constants. Consequently, 
there is a constant that we denote by —A, such that 


7 X" (x) 7 
ae CO Se 
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or 


T(t) + adT(t) =0, (11.1.5) 
X"(xz)+rAX (x) = 0. (11.1.6) 
Substituting u(x,t) = X(«#) T(t) into the boundary conditions (11.1.2), we get 


A product of two quantities is zero if at least one of them is zero. Since we are after a nontrivial solution, T(t) 
cannot be identically zero, and we get the boundary conditions 


X(0)=0, X(@)=0. (11.1.7) 


The homogeneous differential equation (11.1.6), which contains a parameter A, subject to the homogeneous boundary 
conditions (11.1.7) has obviously a trivial solution X (x) = 0. However, we seek nontrivial solutions; therefore, we 
need to find such values of parameter for which the problem has nontrivial solutions. Such values of X are called 
eigenvalues and the corresponding nontrivial solutions are called eigenfunctions. The problem (11.1.6), (11.1.7) 
is usually referred to as the Sturm—Liouville problem (see §10.1). 

Since we know from §10.1 that negative values of \ cannot be eigenvalues, we consider only the case when A > 0. 
Equation (11.1.6) has the general solution 


X(x) = A cos (xv) +B sin (xv) ; 
where A and B are arbitrary constants. The boundary conditions demand 
X(0)=4=0,  X(0)=B sin (¢VX) =0. 
Since B cannot be zero (otherwise we have a trivial solution), we get the condition for determining 2: 


sin (VX) = 0 = vi= ot, Wi 1,2,3,000, 


Therefore, the Sturm—Liouville problem (11.1.6), (11.1.7) has a sequence of solutions, called eigenfunctions, that 
correspond to eigenvalues: 


2 
Xn(e) = sin, An = () ye eS 1S Benes 


Turning now to Eq. (11.1.5) and substituting A, for A, we get the first order differential equation 
T + arnT(t) = 0, 
—arnt 


which has the general solution T,,(t) = C), e 
and T,,(t), we obtain a sequence 


, where C,, is an arbitrary constant. Multiplying functions X,,(x) 


un(x,t) = X;,(a)T,(t) = Cy ec * sin ee n=1,2,.. 


of partial solutions of the differential equation (11.1.1) that satisfy the boundary conditions (11.1.2). Since both 
Eq. (11.1.1) and the boundary conditions (11.1.2) are homogeneous, we can use the linearity of the heat equation 
to conclude that any sum of partial nontrivial solutions up(x,y) is a solution of Eq. (11.1.1), for which boundary 
conditions (11.1.2) hold. Therefore, the sum-function 


u(z,t) = \~ un(a,t) = S> One * sin — (11.1.8) 


nol nol 


is a solution of boundary problem (11.1.1), (11.1.2). This solution (11.1.8) is formal because we did not prove the 
convergence of the series (11.1.8) and its differentiability. Such a proof will require some lengthy mathematical 
arguments that would not improve our understanding. So we just assume that this is true. It remains only to 
satisfy the initial condition (11.1.3). Here is where the Fourier series makes its appearance. Assuming that the series 
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(11.1.8) converges uniformly (which means that we can interchange the signs of limit as t + 0 and the summation), 
we have to satisfy 


u(x,0) = > Un(x,0) = S- Cy, sin = f(z). (11.1.9) 


TT = 
n2>1 n>1 


Equation (11.1.9) looks familiar because it is a Fourier expansion of the function f(a) with respect to sine functions 
(see §10.5). Therefore, its coefficients are 


2 L 
Ch = al f(z) sin da (n=1,2,...). (11.1.10) 
0 


For a square integrable function f(x), the coefficients C,, in Eq. (11.1.10) tend to zero as n ++ oo (see §10.4). If 
t >e > 0, the series solution (11.1.8) converges uniformly with respect to independent variables x and t because it 
contains an exponentially decreasing multiple e~°”'. Therefore, for practical evaluation of u(a, t), the series (11.1.8) 
is truncated by keeping a few first terms. When t is small or zero, the convergence of the series asl Un(a, t) depends 
on the speed at which the Fourier coefficients C;, approach zero. From §10.4, it is known that the smoother the 
function f(a) is, the faster its Fourier coefficients tend to zero, and hence, the fewer terms in the truncated series 
are needed to obtain an accurate approximation. 


Example 11.1.1: ( Zero temperature ends ) Suppose that a copper rod of length 50 cm was placed into a reservoir 
with hot water at 50°C so that half of it is in the air at 20°C. At t = 0, the rod is taken out and its ends are kept at 
constant ambient temperature of 20°C. Let us denote the difference between the rod’s temperature and the ambient 
temperature by u(x,t), where x is the distance from the left end of the rod, « = 0. Then u(z,t) is a solution of the 
following initial boundary value problem: 


Ut = Ager, u(0, t) = u(50, t) = 0, u(a, 0) ts f(z), 


where a © 1.14 and 


f(x) = 


30, if0<a< 25, 
0, if 25<a2< 50. 


For this problem, the solution, according to Eq. (11.1.8), is 
2 
u(a,t) = Cre sin sey An = (=) , 

n2>1 
where the coefficients C,, (n = 1,2,3,...) are calculated to be 
2 pe 30 /?° 120 
Cn = ; (cx) sin dx = 5 ; sin de = — sin® (=). 
Therefore, the solution-sum becomes 
120 1 

u(x,t) = — —¢@ An? sin (=) sin? (=) (a 1.14). 


TT Tr 
nol 


The M-th partial sum gives approximation to the exact solution: 
M 
120 1 NTL nT 
Su(z,t) = — =e Wnt sin (=) sin? (+) (a + 1.14). 
T 
n=1 


We can plot the M-th partial sum using Mathematica (see the graph of the partial sum with 100 terms on page 
597). 


SF[x_, t_, M_] := (120/Pi)* Sum[(1/n) *Exp[-(1.14*n*Pi/50)°2 *t]* 
Sin[n*Pi*x/50] *(Sin[n*Pi/4])°2, {n, 1, M}] 
Plot3D[SF[x, t, 100], {x, 0, 50}, {t, 0, 10}] 
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Heat flux or thermal flux is the rate of heat energy transfer through a given surface per unit time. The SI unit of 
heat flux is watts per square meter. Heat rate is a scalar quantity, while heat flux is a vector quantity. According to 
Fourier’s first law, the heat flux q (with units W-m~?) is proportional to the gradient of the temperature u: g = —KVu, 
where « is the thermal conductivity of the material and V is the gradient operator. In a one-dimensional case, we 
have 


Ou m Us, if the direction of flux is along z-axis, 
—uUz, if the direction of flux is opposite x-axis. 


Example 11.1.2: (Insulated ends) Consider a thin rod with insulated ends, which means that heat flux through 
the end points is zero. Since heat can neither enter nor leave the bar, all thermal energy initially present is “trapped” 
in the bar. The temperature u(x,t) inside the rod of length @ at the spacial point x and at time t is a solution to 
the heat equation (11.1.1): 

Up = WUge, O<ar<f, 0<t<t<o, 


subject to boundary conditions of the second kind 
uz (0,t) = 0, Ux (l,t) = 0, 0<t<t<o, (11.1.11) 


and the initial condition 
u(a,0) = f(x). 
The boundary conditions (11.1.11) are usually referred to as the Neumann!!? boundary conditions. In general, 
boundary conditions of the second kind specify the normal derivative on the boundary of the spacial domain, 
To solve this initial boundary value problem, we use separation of variables. So we seek partial nontrivial solutions 
of the heat equation subject to the Neumann boundary conditions and represented as the product of two functions 


u(x,t) = X(a) T(t). 
Substituting this form into the heat equation, we get two differential equations that we met before: 
T+ adAT(t) =0, (11.1.5) 
X"(xz) + AX (x) = 0. (11.1.6) 
From the boundary conditions (11.1.11), we have 
X'(0) = 0, X'(L) = 0 


which, together with Eq. (11.1.6), constitute the Sturm—Liouville problem for X (a). Note that A = 0 is an eigenvalue, 
to which corresponds the eigenfunction Xo = 1 (or any constant). Indeed, setting A = 0 in Eq. (11.1.6), we have 
X" = 0, so the general solution becomes X (a) = Cy + Cox. Since X’ = C2, the boundary condition causes Cz = 0 
and the constant X (a) = Cj is an eigenfunction, for arbitrary C4. 

Assuming that A > 0, we obtain the general solution of Eq. (11.1.11): 


X(x) = C\ cos (xv) + Cy sin (xv>) 


for constants C, and C3. Since its derivative is X’(x) = —CyV/X sin (xv) + CoVX cos (2v9), we get from the 
boundary conditions that 


CoV =0 and CVX sin (ev>) + CoVX cos (ev>) =0. 
Remember that we assume A > 0, so C2 = 0, and from the latter equation, it follows that 
Cy sin (€VX) = 0. 


12Carl (also Karl) Gottfried Neumann (1832-1925) was a German mathematician who can be considered as the initiator of the theory 
of integral equations. For most of his career, Neumann was a professor at the universities of Halle, Basel, Tiibingen, and Leipzig. 
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If we choose C = 0, then we get a trivial solution. Therefore, we reject such an assumption and from the equation 


sin (¢v>) = 0 find eigenvalues and their eigenfunctions: 


2 
dn = (7) ) X,,(x) = cos —, n=0,1,2,.... 


The case n = 0 incorporates the eigenvalue \ = 0. Substituting A = \,, into the equation for T(t): T +a2dT = 0, 
and solving it, we find 
Lia C.e 


where C,, is a constant. Now we can form the solution of the given initial boundary value problem as the sum of all 


partial nontrivial solutions: 
ann 
Xn( Ce cos 
= 2s 7 
n>0 n>0 


To satisfy the initial condition u(z,0) = f(x), we have to choose the coefficients C,, in such a way that 


YS On Ane =SoCn cos = f(e). 


n2>0 n>0 


Since this is a Fourier cosine series for f(x), we get the coefficients (see Eq. (10.5.4) on page 583) 


2 £ 
-7/ F(«) cos da, n=0,1,2,.... 
0 


The temperature inside the bar tends toward a constant nonzero value: 


tim west) = 6 =3 f raya 


—arn t 


because all other terms in the sum have an exponential multiple e , which goes to zero as time elapses. 


Example 11.1.3: (One insulated end ) Consider a thin rod of length ¢ with one insulated end x = 0 and other 
end x = ¢ is maintained at zero degrees. Then u(z,t), the temperature in the rod at section x and time ¢, is a 
solution of the following initial boundary value problem: 


Ut = AUge, OA0,0)=0, wlljt)=0,. wi(2,0) = f(a). 


The compatibility conditions read as f’(0) = f(¢) = 0. Our objective is to determine how the initial temperature 
distribution f(x) within the bar changes as time progresses. Following the general procedure of Fourier’s method, we 
seek partial nontrivial solutions in the form u(#,t) = X(«) T(t), which yields the following Sturm-—Liouville problem: 


X"(x) + AX (x) = 0, X'(0)=0, X(é)=0. 
Substituting the general solution X (x) = Cj cos (xv) + C2 sin (« vy) into the boundary conditions, we get 


Co = 0, C cos (eva) =0 


Hence, \ must be a solution of the transcendental equation cos (wa) = 0, so V/A = 5 4+nz, and we find the 
solution of the Sturm—Liouville problem: 


_ {a+2n) . _ fan + 2n) _ 
ay = (*) ; Xn(c) = 0s (OE ; 10,1 De a 


7 LAyed am(1 + 2n) 
= S- Cn € cos (SE : 


n>0 


This leads to a series solution: 


To satisfy the initial condition u(z,0) = f(x), we should choose the coefficients so that 


anm(1 + 2n) _ 
Ch =3 [ ree os (SO oy) ) ae, m=O. Doak ts 
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11.1.1 Two-Dimensional Heat Equation 


Suppose that a thin solid object has a rectangular shape with insulated faces and of dimensions a and b. Denoting 
by u(z, y,t) its temperature at the point (a, y) and time ¢, the two-dimensional heat equation becomes 


us = aV2u or Ut = A (Ure + Uyy) - (11.1.12) 


Here the constant a@ denotes the thermal diffusivity of the material, and V is the gradient operator. We consider 
the Dirichlet boundary conditions when its edges are kept at zero temperature: 


u(0,y,t) = u(a,y,t)=0, O<y<b, 0<t, 


11.1.13 
u(a@,0,t) = u(a#,b,t) =0, O<a<a, 0<t. ( ) 

Assuming that the initial temperature distribution f(z, y) is known, we get the initial condition: 
u(z,y,0) = f(x,y), O<a<a, 0<y<b. (11.1.14) 


The solution of the problem is based on the separation of variables and follows a step-by-step procedure used to 
obtain a solution of one-dimensional heat equation. The details are outlined in Problem 30 (page 604). This gives 
the following “explicit” formula for the unique (formal) solution: 


co co k 
u(z,y,t) = S> 7 Ang ent sin sin, (11.1.15) 
n=1k=1 
where 
a eo 
Ane = Tt (S+%). WTB cape (11.1.16) 
and 
Ank -<f aw [ dy f(x,y) sin “= = sin —*. (11.1.17) 


In each of Problems 1 through 4, determine whether the method of separation of variables can be used to replace the 
given partial differential equation by a pair of ordinary differential equations with a parameter 1. 
1. ut = vse - 3. CUt=tUaz , 


2 2 
2: Ut =x Ura +t Uct , 4. Utt = Ure + Uct - 


5. Scaling is a common procedure in solving differential equations by introducing dimensionless variables. 


(a) Show that if the dimensionless variable € = «/@ is introduced, the heat equation (11.1.1) becomes 


Ou a Ou 


(b) Show that if the dimensionless variable 7 = at is introduced, the heat equation (11.1.1) becomes 


In Problems 6 through 11, find a formal solution to the given initial boundary value problem subject to the Dirichlet conditions 
Ut = AUer, u(0,t) = u(é,t) =0, u(a#,0) = f(x), 
when the coefficient of diffusivity a, the length of the rod @, and the initial temperature f(x) are specified. 
6. a=2,£=7, and f(a ee 
7. a=1, €=4, and f(x) = 
8. a= 2, €=3, and f(x) = 5 (violates the compatibility constraint). 
9. a=3, €=2, and f(x) = 2?(4— 2”). 


x? (violates the compatibility constraint). 
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10. 


11. 
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ifO< x2 <2, 
a= 2,¢=5, and f(x) = _ ' _ 
10 —2x)/3, if2<a<5. 
0, if0<2<1, 
a=1,=6,and f(z7)=41, ifl<2<3, 
0, if3<2<6. 


In Problems 12 through 15, find a formal solution to the given initial boundary value problem subject to the Neumann 
conditions 


Ut = Use, Uz (0,t) = us(£,t) =0, u(a#,0) = f(x), 


when the coefficient of diffusivity a, the length of the rod @, and the initial temperature f(a) are specified. 


12. 
13. 
14. 
15. 


a=3,€=3, and f(x) = 2°(3— 2)”. 
a=1, €=1, and f(z) 
a= 2, €=1, and f(x) = 1-—sin(37z) (violates the compatibility constraint). 
a= 2,€=4, and f(z) 


=e” (violates the compatibility constraint). 


= 1-cos(3ra). 


In Problems 16 through 21, find a formal solution to the given initial boundary value problem with boundary conditions of 
the third kind 


Ut = AUee, u(0, t) = Ux (2, t) — 0, u(z, 0) = f(x), 


when the coefficient of diffusivity a, the length of the rod @, and the initial temperature f(x) are specified. 


16. 
17. 
18. 
19. 
20. 
Zi, 


a=2,€=7/2, and f(x) =3 sin5x —7 sin 9a. 

a=1,€=1, and f(x) = 2 (1— 2”) (violates the compatibility constraint). 
a= 2, €=7/2, and f(x) = x (violates the compatibility constraint). 
a=3,€=3, and f(x) = 2(3— 2)”. 

a=4,€=2, and f(x) = sin(57a/4). 

a=9, €=3, and f(x) =1-cos(5ra). 


In Problems 22 through 27, find a formal solution to the given initial boundary value problem with boundary conditions 
of the third kind 


Ut = AUeex, uz (0, t) = u(@,t) — 0, u(x, 0) = f(x), 


when the coefficient of diffusivity a, the length of the rod @, and the initial temperature f(a) are specified. 


22. 
23. 
24. 
25. 
26. 
27. 
28. 


29. 


30. 


a=2,0=7/2, and f(x) = 4 cos3x — 6 cos 5a. 
a = 3, €=3, and f(x) 
a=1,€=2, and f(x) = sin rz (violates the compatibility constraint). 
a=2,€=2, and f(x) =2? —4. 

a= 2,€=4, and f(x) = 8 cos(7ra/8). 

a=9,€=3, and f(x) = 2? —9. 


= « (violates the compatibility constraint). 


Consider the conduction of heat in a copper rod (a ~ 1.11 cm?/sec) 50cm in length whose one end x = 0 is maintained 
at 0°C while the other one x = 50 is insulated. At t = 0, the temperature profile is 0°C for 0 < # < 25 and x — 25 for 
25 <a < 50. 


(a) Find the temperature distribution u(z, t). 

(b) Plot u versus x fort =0.5,t=1, andt=5. 

(c) Determine the steady state temperature in the rod when t ++ +00. 
(d) Draw a three-dimensional plot of u versus x and t. 


In the previous problem, find the time that will elapse before the left end x = 50 cools to a temperature of 10°C if the 
bar is made of 
(a) coopera#+1.11;  (b) molybdenum a 0.54; (c) silver a = 1.66. 


Prove the formulas (11.1.15) — (11.1.17) by performing the following steps: 


(a) Assuming that the heat equation (11.1.12) has a partial, nontrivial solution of the form u(x, y,t) = v(x, y) T(t), 
derive the differential equations for functions v(a#,y) and T(t): 


T+adrT(t)=0 and V?v(ar,y)+Av(a,y) =0, 


where 4 can be any constant. 
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3l. 


(b) Derive the boundary conditions for v(x, y): 


v(0,y,t) = v(a,y,t) =0, O<y<b, 
v(x, 0,t) = v(z,b,t) =0, O<a<a. 


(c) Assuming that a solution of the Sturm—Liouville problem for u(x, y) has the form u(x, y) = X(a) Y(y), derive the 
corresponding Sturm-Liouville problems for X(x) and Y(y): 


X" (2) +pX(a) =0, X(0)=X(a)= 
Y"(y)+(A-v)¥(y)=0, Y(0)=Y(b)= 


(d) Solve these Sturm-—Liouville problems for X(«) and Y(y) to determine the eigenfunctions: 


krax 


2 2 
Xx(x) = sin ; Yn(y) = sin ke 7 
a 


b” Ank = 7 qa + pe? hit SAD ss. 


(e) Substitute the eigenvalue ,,, that depends on two parameters n and k into the differential equation for T(t) and 
solve it: 
—ar?,t 
Tnk(t) = Anke Ce 1 ae ae 


(f) Take a double infinite series of the product Ty, (t)X()Yn(y) as a formal solution of the given IBVP and find its 
coefficients (11.1.17). 


In mathematical finance, the Black—Scholes''? equation is a partial differential equation (PDE) governing the price 
evolution of a European call or European put under the Black-Scholes model. Broadly speaking, the term may refer 
to a similar PDE that can be derived for a variety of options, or more generally, derivatives. 


For a European call or put on an underlying stock paying no dividends, the equation is: 


OV 1 2 20°V OV 7 
aE 7° Ba tees TY =", 


where V(a,t) denotes the value V of the option to buy or sell a particular security at price x and time t. The parameter 
o” is a measure of the volatility of the security’s return to the investor, and the constant r is the current interest rate 
on the risk free investment such as a government bond. The option to buy or sell is called a derivative of the underlying 
security. The formula led to a boom in options trading and legitimized scientifically the activities of the Chicago Board 
Options Exchange and other options markets around the world. The key financial insight behind the equation is that 
one can perfectly hedge the option by buying and selling the underlying asset in just the right way and consequently 
“eliminate risk.” 


Show that the Black-Scholes differential equation has solutions of the form V(x#,t) = C’ ety F assuming x > 0, and 
a, 8, and C are positive constants. Find the equation that relates these constants. 


In Problems 32 through 37, find a formal solution to the given initial boundary value problem subject to the Dirichlet 
conditions 


Ut = AUee u(0,t) =u(é,t) =0, u(x,0) = f(a), 


when the coefficient of diffusivity a, the length of the rod @, and the initial temperature f(x) are specified. 


32. 
33. 
34. 
35. 


36. 


37. 


a=5,€=4, and f(x) =6 sin2ra. 

a=9, €=3, and f(x) = 2° (violates the compatibility constraint). 
a= 16, €=2, and f(x) = 2 — 2 (violates the compatibility constraint). 
a=4,€=2, and f(x) = 2(2-2). 


fO<a2<4, 
a = 25, £= 10, and f(x) = oe ads 
20-22, if4<a2< 10. 
0, if0<2<1, 
a=4,=4,and f(z7)= 42, ifl<ar<2, 
0, if2<a<4 


113The Black-Scholes model was first published by an American economist Fischer Black (1938-1995) and a Canadian-American 
financial economist Myron Scholes (born in 1941) in their 1973 paper, “The Pricing of Options and Corporate Liabilities,” published in 
the Journal of Political Economy. 
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11.2 Other Heat Conduction Problems 


Previously, we considered initial boundary value problems for heat transfer equations when both the equation 
and the boundary conditions were homogeneous. Now we consider problems when neither the equation nor the 
boundary conditions are homogeneous and show that it is not an obstacle for the separation of variables method. 
To demonstrate how the method works, we will do some examples. 

We start with a heat conduction problem subject to “arbitrary” boundary conditions of the first kind (Dirichlet): 


Up = Ure + P(x, t), u(0, t) = To(t), ull, t) = Tr(t), u(z, 0) = f(z), 


where f(x) provides the initial temperature distribution in the bar of length @, ®(, t) is a given function representing 
a rate of change in the temperature provided by external sources, and Tp = To(t), Te = T(t) are known functions of 
time. Since the separation of variables method can be applied only when the boundary conditions are homogeneous, 
we shift the boundary data. In other words, we represent the required solution u(x,t) as the sum of two functions 


u(x,t) = v(a,t) + w(2,t), 


where vu(z,t) is any function that satisfies the boundary conditions. For instance, we choose it as 


— 
vu(a, t) = 7 Te) + z as 


Since v(z,t) is a linear function in x, we have vz, = 0. This yields the following problem for unknown function 
wW=U-—v: 


To(t). 


wy = AWee — 0 + O(a, 6), w(0,t)=0, w(é,t) =0, w(a,0) = f(a) — v(a,0). 
If we set F'(x,t) to be ®(xz, t)—v and v(x) = f(x)—v(a,0), we get the initial boundary value problem for determining 


w(a, t): 
Wt = AWee + F(a, t), w(0,t)=0, w(é,t) =0, w(x,0) = y(a), (11.2.1) 


which is similar to one for u(a,t), but it has homogeneous boundary conditions. Our objective now is to solve a 
nonhomogeneous heat equation subject to homogeneous boundary conditions as specified in problem (11.2.1). Note 
that temperature u(,t) does not depend on our choice of the function u(x,t); however, the functions F(x, t) and 
p(x) do. 

When the partial differential equation is not homogeneous, the separation of variables method asks us to find 
first eigenfunctions and eigenvalues of the Sturm—Liouville problem that corresponds to the similar problem for a 
homogeneous equation subject to the homogeneous boundary conditions: 


Wt = AWer, w(0,t)=0, w(é,t) =0, w(a,0) = y(z). 


Since its solution was obtained previously in §11.1, we seek a formal solution of the initial boundary value problem 
(11.2.1) in the form of an infinite series over eigenfunctions: 


‘=> °C, Xn(a), (11.2.2) 


nol 


where the eigenfunctions X,,(z) = sin 
consists of expanding the ae F( 


2 £ 
F(x,t)=S~ falt)Xn(z), — falt) => 7. F(x,t) sin ag. 
ae £ Jo e 
Substitution of the Fourier series for w(x,t) and F(a, t) into the given heat equation yields 


=> 0,0) ) = @ Wer + F(2,t) yaa Sy CAd) XN (2) + S> fn(t) X. 


n2>1 n21 n21 


) were obtained in §11.1, but C;,(t) should be determined. The next step 
into a Fourier series with respect to the same system of eigenfunctions: 


ee 
x, t) 


Since X}! = —ry, Xn, we get 


Wt = AWze + F(a, t) = y emt =-a SC.) An Xn(x) + S- Fn(t) Xn(x). 


n>1 n>1 n21 
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The above equation can be reduced to one sum: 


Y [En Xn(w) + an Xn(x) — fn(l) Xn(w)] =0, 


n>1 


or after factoring X,,(a) out, to 


DY [Cn(t) + adn Cult) = falt)] Xn() = 0. 


n>1 


It is known from §10.2 that the set of eigenfunctions {X,(x%)}n>1 is complete in the space of square integrable 
functions. Therefore, the latter is valid only when all coefficients of this sum are zeroes: 


Cn(t) + arn Ca(t)— fot) =0, n=1,2,.... 


Substituting Eq. (11.2.2) into the initial conditions w(x,0) = y(x), we get 


> Cn (0) Xn(x) = v(x) = Se Yn Xn(2), 


nol n2>1 


where 
D) a 
en =F foe) Xn(e) da 
0 


are Fourier coefficients of the known function y(a) over the system of eigenfunctions for the corresponding Sturm— 
Liouville problem. To satisfy the initial conditions, we demand that C,,(0) = y,. This leads to the following initial 
value problem for determining coefficients C,,(t): 


Cr(t) + aAn Ca(t) = fr(t),  Cn(0) = Yn. 
Since the differential equation for C,,(t) is linear with constant coefficients, the reader may consult §2.5 and verify 
that its solution is 


t 
C= | file dr+e,€ Or, woo, (11.2.3) 
0 


Example 11.2.1: (Temperature at the ends is specified ) We reconsider Example 11.1.1, but now with non- 
homogeneous boundary conditions of the first type: 


it = Ugg, u(0,t) = To, ul(l,t)=Tr, u(a#,0) = f(x), 


where f(a) is the initial temperature distribution in the bar of length @, and To, Ty are given constants. (They are 
chosen as constants for simplicity but could be functions of time t.) Since the separation of variables method can be 
applied only when the boundary conditions are homogeneous, we represent the required solution u(x,t) as the sum 
of two functions: 

u(a,t) = v(#) + w(a, t), 


where v(x) is a function that satisfies the given boundary conditions, for instance, 


v(x) = a ie 


e 7 To: 


Since v(x) does not depend on time ft, it is a steady state temperature. Recall that a steady state temperature 
is one that does not depend on time. This choice for v(a) is not unique—there exist infinitely many functions that 
satisfy the nonhomogeneous boundary conditions. For example, we can choose vu(a, t) as 


u(x,t) = Te(t) sin + To(t) cos ve 


It does not matter which function is chosen for v(x, t) to satisfy the boundary conditions because the given IBVP 
has a unique solution. Now we return to the problem for w: 


Wt = AWae + (QUrx — VE) , w(0,t)=0, w(é,t) =0, w(a,0) = f(a) — v(a). 
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Since in our case Uv; = QUyzx, the heat equation for w becomes homogeneous, and we can use its series representation 
from Example 11.1.1: 


w(x, t) = S- Ce oe sin, An = ()", 


nol 


where 


2 £¢ _ ant 2.7? le L—-«a« _ nn 
Cua Gf f(z) sin Far; f p+ 1] sin ae 


ae i ont 2 2 
= in (7, 
al f(a) sin 7 dx + 7(—1) Bes 0 


Example 11.2.2: (Heated rod ) Consider a thin rod of length ¢ when one end is kept at 0°C and the other end 
is perfectly insulated. Assume that at time ¢t = 0 the insulated end of the rod is lifted out of ice and one third of it 
becomes heated with a constant uniform source for finite time period (0,7), 7 > 0. The model IBVP is 


(PDE) w= atc, + O(a,t), O<r<l, 0O<t<t<o, 
(BC) u(0,t)=0, uz(€,t)=0, 0<t<t <a, (11.2.4) 
(IC) u(x, 0) = 0, 


where the external heat source ®(x,t) is expressed through the Heaviside function H(t) as 


0, if0<2< 2¢/3, 


@(x,t) = [H(t) — H(t—7)] x ‘ if 20/3<a< &. 


To find the separated solutions of the homogeneous PDE that satisfy the given boundary conditions, we insert 
u = X(x)T(t) into the PDE and BC, separate variables, and obtain the Sturm-—Liouville problem 

X"(r) + \X(x) = 0, X(0) = X'(£) =0. 
Solving the eigenvalue problem, we see that X,(a) = sin rOtin)e n =0,1,2,..., are eigenfunctions corresponding 


2 
to the eigenvalues A, = Je . So we seek a formal solution as a series over the eigenfunctions 


7 _ . W(1+2n)a 
uat= > C0) ile) = YC. ay ae (11.2.5) 
n>0 n>0 
Expanding the source function ®(2,t) into a similar series, we get 
4 m(1+2n) . r(1+2n)a2 
®(x,t) = |A(t) — H(t— —— ————- ——_—.. 
(2,8) = [HU — HE 1 Yo gay 108 EE sin 7 


n20 


Substituting these Fourier expansions into the given problem (11.2.4), we obtain the following initial value problem 
for the coefficients C,,(t): 
. 4 m (1+ 2n) 
Cn An Cr(t) = [A (t) — H(t — r)] ————~ ae, 
+n On(t) = [H() — H(t )] aa cos ES 


C,(0)=0, n=0,1,2,.... 


To solve this initial value problem, we apply the Laplace transform and obtain 


_ 4 m (1+ 2n) 
Cr(t) = nan (soe) [9n(t) — Gn(t—7)], 
where ‘ 
Init) = - [1 _ ee H(t), Qn = (a) 


Substitution of these values C,,(t) into Eq. (11.2.5) gives the formal solution of the initial boundary value problem 
(11.2.4). Since the coefficients C;,(t) decrease with n as n~?, the series (11.2.5) converges uniformly and the function 
u(a,t) is continuous. 
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So far, we discussed the Dirichlet (if the rod’s ends are maintained with specified temperature) and Neumann 
(when the rod’s ends are insulated) boundary conditions and their variations. However, a more general type of 
boundary conditions occur when the rod’s ends undergo convective heat transfer with the ambient temperature. In 
this case, we use Fourier’s law of heat conduction 


Ou 


— = (Vu-n)| peor = kU) — uP), (11.2.6) 
PEOR 


where U(t) is the ambient temperature outside domain R, OR is the boundary of R, k is a positive coefficient, 
which must be in units of sec~', and is therefore sometimes expressed in terms of a characteristic time constant 
to given by k = 1/to = —(du(t)/dt)/Au, n is the outward unit vector normal to the boundary OR, and Ou/On is 
the directional derivative of u along the outward normal vector. The time constant to is expressed as to = C/(hA), 
where C = ms = dQ/du is the total heat capacity of a system of mass m, s is the specific heat, h is the heat transfer 
coefficient (assumed independent of wu) with units W/(m?-°K), and A is the surface area of the heat being transferred 
(m7). The boundary conditions (11.2.6) corresponding to the convective heat transfer are referred to as boundary 
conditions! of the third kind. 
In the one-dimensional case, the boundary condition (11.2.6) at the end x = 0 reads as 


—u;(0,t) = k[U(t) —u(0,t)], t>0. 


At other end x = £, we have 
ug (l,t) =k[U(t) —ul(é,t)], tS 0. 


When the method of separation of variables is applied to the heat equation subject to boundary conditions of the 
third kind, it leads to the Sturm—Liouville problems (see §10.1) that do not admit “explicit” solutions and can be 
solved only numerically. 


In Problems 1 through 9, solve the initial boundary value problems. 

1. ut =9 ee +e7* (0 < a <3), u(0,t) = ux (3, t) = 0, ula, 0) = 0. 

2. uw =4Uxe (0 <x < 2), u(0,t) =te, u(2,t) = 0, u(x,0) = 2(2— 2). 

3. Ut = Ura (0 < & < 7/2), ux (0,t) = 1, u(m/2,t) = 7/2, u(x, 0) = sin(2x) cos(4x). 

A. ut = 25 Une (0< 2 <5), ue(0,t) = ue(5,t) = 1, u(e,0) = cos 42. 

5. ut = 16 ure + cos * (0 < x < 2), ue(0,t) = u(2,t) = 1, u(x, 0) = 27-4. 

6. ut = 4Uxe2 + H(t) — H(t — 2) (0 <a < 2), u(0,t) = ux(2,t) = 1, u(a,0) = x(a — 2)?. 

7. Ut =9Uee (0< a < 3), u(0,t) =t, ux (3, t) = 0, u(x, 0) = x. 

8. ut = Ura +t (O< a <1), uz(0,t) = 0, uz (1,t) = 1, u(a,0) = 0. 

9. ut = 25uUze +1 (0< x2 < 10), uz (0,t) = 1, u(10,t) = 0, u(x, 0) = =. 

10. Suppose that an aluminum rod 50 cm long with an insulated lateral surface is heated to a uniform temperature of 20°C, 
and that at time t = 0 one end x = 0 is kept in ice at 0°C while the other end x = 0.5 is maintained with a constant 
temperature of 100°C. Find the formal series solution for the temperature u(x,t) of the rod. Assume that the thermal 
diffusivity is a constant, which is equal to 8.4 x 107° m? /sec at 25°C. In reality, it is not a constant and varies with 
temperature in a wide range. 


11. A stainless steel rod 1m long with an insulated lateral surface has initial temperature u(xz,0) = 1— 2, and at time 
t = 0 one of its ends (2 = 0) is insulated while the other end (a = 1) is embedded in ice at 0°C. Find the formal series 
solution for the temperature u(z,t) of the rod when it is heated with a constant rate g. The thermal diffusivity can be 
assumed to be constant at 4.2x107° m/sec. 


12. Suppose that a wire of length @ loses heat to the surrounding medium at a rate proportional to the temperature u(z,t). 
Then the function u(z,¢) is a solution of the following IBVP: 


Ut = AUsxe — hu, u(0, t) — ualf, t) ae 0, u(a, 0) = f(x). 


By making substitution u(x,t) = e~" v(2, t), find the formal series solution of the problem. 


114Sometimes the boundary conditions of the third kind are mistakenly associated with (Victor) Gustave Robin (1855-1897), a professor 
of mathematical physics at the Sorbonne in Paris. Actually, Robin never used this boundary condition. 
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11.3. Wave Equation 


Consider a string stretched from x = 0 to « = @. Let u(a,t) describe the vertical displacement of the wire at position 
x and time t. If damping effects, such as air resistance, are neglected, the force of gravity is ignored, and oscillations 
of the string are sufficiently small so that the tension forces can be treated as elastic, then u(x,t) satisfies the wave 
equation : ; 
u= St oF 8 =o, Veet ULtSr <5, Cie) 
where 0 = 0, = 07u/0t? — 2? Vu = 07u/dt? — c? 62u/0x? is the wave or d’Alembert operator, simply called 
d’Alembertian, and the positive constant c is the wave velocity, with units [length/time]. Under these conditions, 
the equilibrium position of the string u(x,t) = 0 corresponds to a straight line between two fixed end points. 
D’Alembert!!° discovered around 1744-1746 a strikingly simple method for finding the general solution to the 
wave equation. Roughly speaking, his idea was to factorize the one-dimensional wave operator: 


a? ue? nf a? \ (2 a2 
~ Oo ° da? \ae °dr) \Ot ~ da)’ 


This allows one to reduce the second order partial differential equation (11.3.1) to two first order equations 


Ov Ov Ov Ov 
Ss: 0 and Ot +e De = 0. 


By introducing a new variable € = x + ct, each of the above equations is reduced to a simple ordinary differential 
equation 


dv Ov dx Ov Ot _ Ov , 1 OV 


dé de de HR Be cH’ 
which can be integrated directly. This leads to the conclusion that a solution of the wave equation (11.3.1) is the 
sum 


u(x,t) = fle+ ef) +g(z—c) (11.3.2) 


of two functions f(€) and g(€) of one variable. This formula represents a superposition of two waves, one traveling 
to the right and one traveling to the left, each with velocity c. However, in practice, traveling waves are excited by 


the initial disturbance 3 
u(z,0)=d(z), =| =v(2), (11.3.3) 
Ot |= 
where d(x) is the initial displacement (initial configuration) and v(x) is the initial velocity of the string. Upon 
substituting the general solution (11.3.2) into Eq. (11.3.3), we arrive at two equations 


d(z) = f(x) + 9(2), 
v(e) = f'(x) —cg'(c). 


Integrating the latter, we get 


[ v(x) da = cf (x) —cg(a). 
0 


This enables us to express the general solution in terms of the initial displacement and the initial velocity (called 
the d’Alembert’s formula) 


u(a,t) = 


cere Teese) 5. 2 [wee (11.3.4) 


2 2¢ Ja—ct 
The above formula allows us to make an important observation. Although the wave equation only makes sense for 
functions with second order partial derivatives, the solution (11.3.4) exists for any continuous functions d(€) and 
v(€) of one variable €. (Discontinuous functions cannot represent displacement of an unbroken string!) Therefore, 
justification that the function u(x,t), defined through the formula (11.3.4) for continuous but not differentiable 
functions, satisfies the wave equation (11.3.1) requires a more advanced mathematical technique, which is usually 
done with the aid of generalized functions (distributions). 


115 Jean-Baptiste le Rond d’Alembert (1717-1783) was a French mathematician, mechanician, physicist, philosopher, and music theorist. 
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Also, the formula (11.3.4) shows that the solution of the initial value problem for the wave equation (11.3.1), 
(11.3.3) is unique. Moreover, the formula presents u(x,t) as the sum of two solutions of the wave equation: one 
with prescribed initial displacement d() and zero velocity, and the other with zero initial displacement but specified 
initial velocity v(z). o 

Assuming the string is fixed at both end points, the displacement function u(z, t) will satisfy the initial boundary 
value problem that consists of the wave equation (11.3.1), the Dirichlet boundary conditions 


u(0, t) = u(2,t) = 0, (11.3.5) 


and the initial conditions (11.3.3). For compatibility, we require that the initial displacement be such that d(0) = 
d(é) = 0. 
First, we try to satisfy the first boundary condition u(0,t) = 0. From the general formula (11.3.2), it follows that 


0 = f(ct) + g(—ct) 


for all t > 0, so that g(€) = —f(—&) for any value €. Thus, the solution of the wave equation that satisfies the 
boundary condition u(0,t) = 0 is expressed through one (unknown) function 


u(x,t) = f(a+ct) — f(ct— 2). (11.3.6) 


Physically, this means that the wave, traveling to the left, hits the end x = 0 of the string and returns inverted as a 
wave traveling to the right. This phenomenon is called the Principle of Reflection (see Fig. 11.1, page 614). 

Substituting u(«,t) from Eq. (11.3.6) into the other boundary condition u(é,t) = 0, we get f(@+ ct) = f(ct — 2) 
for all t, so that 


F(6) = FE + 26) 


for all values of €. This means that the function f(€) in Eq. (11.3.6) must be a periodic function with the fundamental 
period T = 2¢. As a result, the function f(€) admits a Fourier series representation. To find the corresponding 
coefficients, we use the separation of variables method. 

In order to apply the separation of variables method, we need to find nontrivial solutions of the wave equation 
Ut = C7 Ure, Subject to the homogeneous boundary conditions u(0,t) = u(é,t) = 0, that is represented as the 
product of two functions, each depending on one independent variable: 


u(x,t) = T(t) X(a). 
Substituting this expression into the wave equation, we obtain 
T(t) X(z) =e? T(t) X" (a). 
After division by c? T(t) X (a), we get 


T(t) X"(z) 
——— = — £ bat ; 
2TH X(a)’ O<a<l, 0<t<t<aw 


In the above equation, the prime denotes differentiation with respect to a spacial variable, and the dot represents 
differentiation with respect to the time variable. The only way that the above equation can hold is if both expressions 
equal a common constant, which we denote by —X. This leads to two equations: 


T+erTH=0, 0<t<t* <a, (11.3.7) 
X" (x) +rAX (x) =0, 0<a<. (11.3.8) 

Imposing the homogeneous boundary conditions on u(x,t) = T(t) X(«) yields 
X(0)=0, X(0)=0. (11.3.9) 


Equation (11.3.8) together with the boundary conditions (11.3.9) is recognized as a Sturm—Liouville problem, con- 
sidered in 810.1. Hence, the eigenpairs are 


— (=y", Xn(a) = sin(“*), WSIS 
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With \ = \, = (n7/£)?, Eq. (11.3.7) becomes 


2 
P+() nm=0, 0<t<t <co. 


The general solution of T,,(t) is 


t 
+ Bn sin - “eotoeu.. 


T(t) = An cos ae 


where A, and B,, are arbitrary constants to be specified. Since we consider the homogeneous wave equation 
ii = C? Ure, any sum of partial nontrivial solutions u,(z,t) = T,(t) Xn(x) will also be a solution of the wave 
equation. Further, this sum satisfies the homogeneous boundary conditions u(0,t) = u(é,t) = 0. Now we look for a 
solution of the given initial boundary value problem in the form of the infinite series 


u(z,t) =~ A, cos (=) +B, sin (=) sin (=) (11.3.10) 


nol 


Since its initial displacement is specified, we have 


u(a, 0) = An sin(“*). 


n21 


This equation is the Fourier sine expansion for the initial configuration of the string. Its coefficients are 


An = 5 = fae) ) sin ( | dz,  n=1,2,3,.... (11.3.1) 


Term-by-term differentiation of the series (11.3.10) (this is justified if the series converges uniformly) leads to 


. Ou cnt . fennt cnt cntt | (NTL 
w=Z =D) [-S" a, sin ( 7 )+%s, cos ( 7 )| sin (“*). 


Cn TL 
u(a,0) = ‘Sy ? By, sin (=) = v(x) 
n>1 
Using Eq. (10.5.2) on page 582, we get 
2 e 
B, =—— | v(x) sin (=) dz,  n=1,2,3,.... (11.3.12) 
cnt Jo L 


Series (11.3.10), with the coefficients (11.3.11) and (11.3.12), is the (formal) solution of the given initial boundary 
value problem. 


Example 11.3.1: Solve the initial boundary value problem 


U = 25 Ure, 0<a4<20, 0<t<t <a; 
u(0,t) = u(20,t) = 0, 0<t<t<o; 
u(@,0) =sin(7x), u(#,0)=sin(27z), 0< a < 20. 


Solution. According to Eq. (11.3.10), the given problem has a series-solution: 
t t 
u(x,t) = = [An cos (=) + By, sin ()| sin (=) 


because the string has length = 20. Imposing the initial conditions leads to the equations 


u(x, 0) = sin(7a) =a Ay sin (=) , 


u(a,0) = sin(27x) = re “A. By, sin (=) 


n>1 
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While we could use formulas (10.5.2) on page 582, it is simpler to observe that the initial functions are actually 
eigenfunctions of the corresponding Sturm—Liouville problem. Since Fourier series expansion is unique for every 
“suitable” function, we conclude that these initial functions are Fourier series themselves. Therefore, 


Ago = ae An =0 for n x 20, 
40 


Pao = 1, By, =0 for n 4 40. 


Hence, Bay = 1/(107) and we get the required solution 


1 
u(x,t) = cos(5at) sin(aa) + jon sin(10zt) sin(272). 


Example 11.3.2: (Guitar ) Suppose we pluck a string by pulling it upward and release it from rest (see Fig. 10.23 
on page 583). If the point of the pluck is in the third of a string of length @ (which is usually the case when playing 
guitar), we can model the vibration of the string by solving the following initial boundary value problem: 


ib = C7 Une, O0<ar<h, 0<t<t<o; 
u(0,t) = u(é,t) = 0, 0<t<t* <a; 
3x ‘ 
7 0<aK<za, 
u(2,0) = fla) =) 4 
5p (8-2); Saeed, 
u(x, 0) = v(x) =0, 0<a<. 


Solution. The formal Fourier series solution is given by Eq. (11.3.10). Since the initial velocity is zero, we have 


u(a,t) = ys An, COS (=) sin (>) : 


nol 


The coefficients A,, in the above series are evaluated as 
9 re 
Ay = a f(a) sin (=) dx 


2 (330 . (nee 2 /° 8 _ (NTa 
=; = sin (=) de +5 ry aaa sin (“*) ae 
2 


_ nT nt 1 nt _ nt 
=— (3 sin — — nt cos “* ) +—= (2nz cos — +3 sin“) 
3 nen 3 


nen 3 3 
9 _ ont 
T pipe ee Ng Diane 
This gives the solution 
9 | nn cnmt\ | (nna 
u(a,t) = S- moe ge Cos (=) sin (=) 3 (11.3.13) 


n>1 


The coefficients in Eq. (11.3.13) decrease with n as 1/n?, so the series converges uniformly, and its sum is a 
continuous function. By plotting u(a,t) for a fixed value of t, we get the shape of the string at that time. Its partial 
sum approximations are presented in Fig.11.1 on page 614. 

According to Eq. (11.3.10), the solution of the vibrating string problem is an infinite sum of the normal modes 
(also called the standing waves or harmonics) 


wed 4, cos (=) +B, sin (=) sin (=) a ee (11.3.14) 


The first normal mode (n = 1) is called the fundamental mode or fundamental harmonic: all other modes are 
known as overtones (see §10.3.1). In music, the intensity of the sound produced by a given normal mode depends 
on the magnitude \/ A? + B?, which is called the amplitude of the n-th normal mode. The circular or natural 
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t=0.01 


04h 


Figure 11.1: Several snapshots of the guitar string, Example 11.3.2. 


frequency of the normal mode, which gives the number of oscillations in 27 units of time, is A, = nem /€. The larger 
the natural frequency, the higher the pitch of the sound produced. 

For a fixed integer n, the n-th standing wave u,,(x,t) is periodic in time ¢ with period 2¢/(cn); it therefore 
represents a vibration of the string having this period with wavelength 2¢/n. The quantities cn/(2) are known as the 
natural frequencies of the string, that is, the frequencies at which the string will freely vibrate. The fundamental 
mode oscillates with frequency c/(2¢). The factor A, cos “#* + B,, sin represents the displacement pattern 
occurring in the string at time t. When the string vibrates in normal mode, some points on the string are fixed at 


all times. These are solutions of the equation sin “= = 0. 


Example 11.3.3: (Piano) Sounds from a piano, unlike the guitar, are put into effect by striking strings. When 
a player presses a key, it causes a hammer to strike the strings. The corresponding IBVP is 


(PDE) wy =cuse, O<2<l, 0<t<t* <o, 


(BC) u(0,t)=0, u(ét)=0, 0<t<t* <o, (11.3.15) 
(IC) u(x,0)=0, uz(a,0) = (2), 


where the initial velocity is the step function 


1, whens<a<s+h, 
v(x) = 
0, otherwise. 


Here s is a position of the left hammer’s end and h is the width of the hammer. It is assumed that both s and s+h 
are within the string length ¢. 
Solution. Using series solution (11.3.10), we get 
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where the coefficients B, are determined upon integration: 


2 fet? nex 20 nm og nn(s +h) 


11.3.1 Transverse Vibrations of Beams 


The free vertical vibrations of a uniform beam of length @ are modeled by the beam equation 
UttCusere =0 (0<2< 2), (11.3.16) 


where c? = EI/(pA), E is Young’s modulus (with units in pascal), J is the moment of inertia (units kg-m) of a 
cross-section of the beam with respect to an axis through its center of mass and perpendicular to the (x, u)-plane, p 
is density (mass per unit volume), and A is the area of cross-section (m?). It is assumed that the beam is of uniform 
density throughout, which means that the cross-sections are constant and that in its equilibrium position the centers 
of mass of the cross-sections lie on the z-axis. The dependent variable u(x,t) represents the displacement of the 
point on the beam corresponding to position x at time t. 

The boundary conditions that accompany Eq. (11.3.16) depend on the end supports of the beam. Let us consider, 
for instance, a pin support of the left end « = 0 and a roller at the right end x = @. Both boundary conditions 
prevent vertical translation and both allow rotation. The only difference between a pin and a roller is that a pin 
prevents horizontal movement, whereas a roller does not. This case of boundary conditions is described by 


u(0,t)=0, u(é,t)=0, wee (0,t)=0, Uee(l,t) = 0. (11.3.17) 
To complete the description of the problem, we specify the initial conditions 
u(x,0) = f(x), ur(a,0) = v(x), 0<9< F (11.3.18) 


To solve the IBVP (11.3.16) — (11.3.18), we use the method of separation of variables. Upon substitution of 
u(a,t) = X(«x) T(t) into the beam equation (11.3.16), we get 


X4 (ax) T(t) 


_ _ 34 
X(x) T(t) me 


Taking into consideration the boundary conditions (11.3.17), we obtain the Sturm—Liouville problem 
XA (x) —d4X(z) =0, X(0) = X(2) =X" (0) = X"(0) =0. (11.3.19) 
The general solution of the fourth order equation X“ (x) — \4X (x) = 0 is 
X(x) = A coshAr + B sinhAr+C cosrx + D sin Az, 


where A, B, C, and D are arbitrary constants. The boundary conditions at « = 0 dictate that A = C = 0; 
correspondingly, from conditions at « = ¢, we get 


B sinh A€+ D sin Al = 0, B sinh Aé — D sin Al = 0. 


These are equivalent to B sinh A€ = 0 and D sin X¢ = 0, from which follows that B = 0 because sinh AC 4 0 for 
A > 0. This allows us to find the eigenfunctions and eigenvalues: 


Ris) Ae oz n=1,2,3,...- 
Going back to the equation T + c?\4T(t) = 0, we obtain the corresponding solutions 
Tn(t) = An cos (cAZt) + By sin (cd7t) . 


Forming the product solutions and superposing, we find the general form of the solution: 


2 24 2 2+ 
u(z,t) = > sin — (4, cos > + By sin > ) (11.3.20) 
nol 
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Using the initial conditions (11.3.18), we are forced to choose the values of coefficients A,, and B,, so that the Fourier 


expansions 
NTL 


f(t) = SP Ansin =, v(x) = Y) Bu ed sin 


n>1 no>1 


hold. This leads to the following formulas: 


e v4 
An=5 f f(z) sin" ae, By = —pa f v(@) sin a, m= 1,2)... 
£ Jo £ 0 


n212¢ 
Problems 


Consider the initial boundary value problem for the one-diemnsional wave equation 


Uit = CUce, O<a<%, 0<t<t*<o; 
u(0,t) = u(é,t) =0, 0<t<o@; 
u(x,0) = f(x), ut(z,0) = v(2), O0<ar<. 
In Problems 1 through 10, solve this problem for the given parameter values (c and @) and the given initial conditions. 
1. c=5, €=3, u(x, 0) = 0, w(x, 0) = 5 sin(rz). 
2. c=4, £=1, u(x, 0) = sin(rz), ur(x,0) = 0. 


if 
a oe Peete SRS TE elie Gua 
3(7—2)/(27), if t/3<a<z, 
L, if0<a<1/3, 
4. c= 3, €=3, u(az,0) = ¢ 1, ifl<a< 2, ur(x,0) = 0. 


3-a, if2<4<3, 
5. c= 2, £=2, u(a,0) = 2(2—2)?, ur(x, 0) = sin(272). 
x/5, if0<a2<5, 
(10—2)/5, if5<a2< 10. 
7. c= 2, €= 20, u(a,0) = #?(20 — x), ut(a, 0) = sin re. 


6. c= 1, £=10, u(x, 0) = sinma, ue(x, 0) = 


sin 732, if0<a< é/2, 


aya 
0, fe2<n<e, we) 


8. c= 1 f= fo) = 
1, if0<a<5, 


9. c= 3, £=10, u(x, 0) = 0, u(x, 0) = ; 
0, if5<a< 10. 


0, if0<a2<1, 
a—-l, ifl<2<2, 
3-2, if2<2<3, 
0, if3<an<A4. 


10. c= 2, €=4, u(x,0) = 0, ue (x, 0) = 


11. The initial boundary value problem below models the vertical displacement u(x,t) of a taut flexible string tied at both 
ends with homogeneous initial data and acted on by gravity (with constant acceleration g © 9.81 m/sec”). Solve this 
problem. 


(PDE) tint = CUre +9, 0<2r<l,0<t<t<on, 
(BC) u(0,t)=0, u(Zt)=0, 0<t<t*<o, 
(IC) u(x,0) = 0, u(x,0) = 0. 


12. Using Eq. (11.3.10) on page 612, show that for all x and t, u(x,t + €/c) = —u(—2,t). What does this imply about 
the shape of the string at half a time period? 


13. (Damped vibrations of a string) In the presence of resistance proportional to velocity, the one-dimensional wave 
equation becomes 
tie +2ku =Cuse (O<a<f, 0<t<t* < oo), 
where k is a positive constant. Assuming that k@/(mc) is not an integer, find the general solution of this equation 
subject to the homogeneous Dirichlet conditions (11.3.5) and caused into motion by a displacement d(x) without initial 
velocity. 
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11.4 Laplace Equation 


A temperature distribution that we get when time elapses unboundedly (t ++ +00) is called the steady state 
solution (or distribution). Since it is independent of t, we must have 0u/O0t = 0. Substituting this into two- 
dimensional heat equation (11.1.12), page 603, we see that the steady state distribution satisfies the so called the 
Laplace equation!!® (in two spacial variables): 


V-u=0 or Ure + Uyy = 0, (11.4.1) 


where V = (0/0z,0/0y) is the gradient operator, and uz, and uy, are shortcuts for partial derivatives, so 
Ure = 07 u/Ox? and uyy = 0?u/dy?. Laplace’s equation also occurs in other branches of mathematical physics, in- 
cluding electrostatics, hydrodynamics, elasticity, and many others. A smooth function u(a, y) that satisfies Laplace’s 
equation is called a harmonic function. 


0 u(x,0)=fo(x) 4 
Figure 11.2: The Dirichlet conditions for a rectangular domain. 


For simplicity, we consider a rectangular plate with specified temperature along the boundary. More specifically, 
we impose the Dirichlet boundary conditions 


u(x,0) = fo(a), ul(a,b) = fi(z), O<a<a, 


u(0,y) =go(y), ula,y)=galy), O<y<b, (11.4.2) 


as illustrated in Figure 11.2. A problem consisting of Laplace’s equation on a region in the plane when the values of 
unknown function are specified on its boundary is called a Dirichlet problem or the first boundary value problem. 
Thus, Eq. (11.4.1) together with the boundary conditions (11.4.2) is a Dirichlet problem for the Laplace equation 
over a rectangle. 

Rather than attacking this problem in its full generality, we split it into four auxiliary problems that are easier 
to handle. First we observe that this boundary value problem (11.4.1), (11.4.2) can be decomposed into four similar 
problems, each of which has only one nonzero boundary condition and u(x, y) will be zero along three edges of the 
rectangle. Namely, we consider the following four auxiliary boundary conditions for the Laplace equation: 


u(z, 0) = fo(x), ula, b) = u(0, y) = u(a, y) = 0; 
2. u(x,0) =0, u(a,b)= fi(x), u(0,y) = ula, y) = 0; 
3. u(x, 0) = u(a, b) =0, u(0, ¥) = go(y); u(a, y) = 0; 


4. u(x,0) = u(x, b) = u(0,y)=0, ula,y) = galy). 


1167 aplace’s equation is named after French mathematician Pierre-Simon de Laplace (1749-1827), who studied it in 1782; however, the 
equation first appeared in 1752 in a paper by Leonhard Euler (1707-1783) on hydrodynamics. 
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u= fo(z) 


u = galy) 
Since all these auxiliary boundary problems are similar, we consider only one of them. Namely, we solve the 
Laplace equation (11.4.1) in the rectangular domain 0 < x <a, 0 < y < b subject to the boundary conditions 


u(x,0) = fo(x), (a,b) = u(0,y) = u(a,y) = 0. (11.4.3) 
Substituting u(«, y) = X(x) Y(y) into Laplace’s equation gives X” Y + X Y” =0, so 
xX” y” 
a a 
for some constant A. Since the boundary conditions at « = 0 and x = a are homogeneous, the function X (2) must 
satisfy the familiar eigenvalue problem 
X" +X =0, X(0) = X(a) = 0. 
The eigenvalues and associated eigenfunctions were found previously, hence, 


nr 


2 
An = (“) ‘ X,,(x) =sin—, n=1,2,3,.... 
a a 
From the differential equation Y” — A,Y = 0, we find 


Yn(y) = An cosh ume + B, sinh ne 
a a 
where coshz = 4 (e* +e7*) is the hyperbolic cosine and sinhz = 4 (e* —e-*) is the hyperbolic sine function. 
Therefore, the formal solution of the auxiliary problem #1 is 
u(z,y) = ye [An cosh wes + Br sinh “4 sin —. 
a a a 
nol 
To satisfy the boundary conditions (11.4.3), the coefficients A,, and B, must be chosen such that 
NTx 
0) = = Ay sin —, 
u(e,0) = fo(t) = 7 An sin™ 


n>1 


b b 
u(a,b) =0= x la, cosh — + By, sinh =| sin ae 


no>1 


The first equation is recognized as the Fourier series expansion of fo(x); therefore, 
9 a 
A= - | fo(x) sm dz, n=1,2,.... (11.4.4) 
a Jo a 
The boundary condition at y = b leads to 


b b 
A, cosh iis + B,, sinh a 0. 
a a 
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Hence, the coefficient B,, is expressed through the known value A,: 


b 
By, = —Ay,y, coth uae : 
a 


and the solution of the given auxiliary problem #1 becomes 


ni nab nt NTx 
u(a, y) => ) An cost =m —coth “a sinh a sin a 
no>1 


Other auxiliary boundary problems can be solved in a similar way, and they are left as exercises. A smooth solution 
of Laplace’s equation is unique if it satisfies some boundary conditions. However, not every boundary condition leads 
to existence and uniqueness of the corresponding boundary value problem. Let n be the outward normal vector to 
the boundary OR of the region R. If the outward normal derivative of the harmonic function u(x, y) is specified, 


Ou ae 
= f, we get the boundary conditions of the second kind, usually referred to as the Neumann condition. For 


dn |ap 

the rectangular domain (0 < « < a, 0 < y < b), the Neumann boundary conditions are read as 
Ou Ou Ou Ou 
a = — = ga(y), —(x,0) = , —(x,b) = ‘ 11.4. 
Fe (O.4) = golds Fela) = gals FCO) = folw), Feed) = fila) (11.4.5) 


The functions go(y), ga(y), fo(x), fo(~) in the boundary conditions (11.4.5) cannot be chosen arbitrarily. Since we 
consider a steady state case, the total flux of heat across the boundary of R must be 0. This means that the total 
integral along the boundary must vanish: 


[sow y+ [savday+ [ fo(e)ae + [ f(x) da = 0. 


This is a necessary condition for a Neumann problem to have a solution. Moreover, this solution, if it exists, is not 
unique—any constant can be added. 

Note: Since the boundary of the rectangle has four corner points, we have to impose the compatibility 
conditions at these points. Usually, any irregularity at the boundary leads to a condition on the solution behavior 
in a neighborhood of that point. 


11.4.1 Laplace Equation in Polar Coordinates 


Previously, we solved several problems for partial differential equations using expansions with respect to eigenfunc- 
tions. The success of the method of separation of variables depended to a large extent on the fact that the domains 
under consideration were easily described in Cartesian coordinates. In this subsection, we address boundary value 
problems for a circular domain that are easily described in polar coordinates. 


We consider the steady state (or time-independent) temperature distribution in a circular disk of radius a with 
insulated faces. To accommodate the geometry of the disk, we express the temperature, which is denoted by u(r, ¥), 
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in terms of polar coordinates r and ¥, with « = r cos¥ and y= r sinv. Then u(r,¥) satisfies the two-dimensional 
Laplace equation in polar coordinates: 


Ou 1du 1 du 


v2 


If the function u(r, V) is specified at the circumference of radius a, we get the boundary condition of the first kind 
or Dirichlet condition: 
u(a,V) = fv), O0<V<2r. (11.4.7) 


Equation (11.4.6) subject to the boundary condition (11.4.7) is called the Dirichlet problem for the Laplace 
equation. There are two known Dirichlet problems: the inner (interior) problem when the solution is defined inside 
the circle (r < a), and the outer (exterior) problem, when the solution is defined outside the circle (r > a). 
In order for u(r,¥) to be a single-valued bounded function in the disk 0 < r < a, we require the function to be 
2n-periodic: 
u(r, 0) = u(r, 0 + 27) and u(0,0) < oo. (11.4.8) 


For an outer Dirichlet problem (r > a), the conditions (11.4.8) are replaced by 


u(r, 0) = u(r, 0 + 27) and lim u(r, v) < oo. 


Tr++oo 


To use the method of separation of variables, we seek partial, nontrivial 27-periodic solutions of the inner Laplace 
equation (11.4.6) in the form 


u(r, 0) = R(r) O(V), O<r<a, -t<U<T. 
Substituting into Eq. (11.4.6) and separating variables gives 


PR(r)+r Rr) _ 0") _, 
Rr) a)” 


where A is any constant. This leads to the two ordinary differential equations 


PR" (r) +r Ri (r) —AR(r) =0, (11.4.9) 
6" (0) + A\O(V) = 0. (11.4.10) 

By adding the periodic condition 
Q(9) = O(0 + 2m), (11.4.11) 


we get the eigenvalue problem (11.4.10), (11.4.11) for the function O(v). This Sturm—Liouville problem has nontrivial 
solutions only when is a nonnegative integer \ = n (n = 0,1,2,...) to which correspond eigenfunctions 0,,(0) = 
Gdn cosnd + by, sinnd, where a, and b, are arbitrary constants. 

Equation (11.4.9) is an Euler equation (see §4.6.2) and has the general solution 


Ri(r)=kir” +kor™” ifn>0, and Ro(r)=kitkelnr ifn=0, 


with some constant ki, k2. The logarithmic term cannot be accepted because the function u(r,#) is to remain 
bounded as either r ++ 0 or r +} +00, which the logarithmic term does not satisfy. Thus, the eigenfunction 
corresponding to A = 0 should be chosen as a constant. For 4 = n > 0, one of the coefficients k, or kz must be 
zero to make the function R,,(r) bounded either at r = 0 (inner problem) or at r + +00 (outer problem). Since we 
consider an inner Dirichlet problem, the partial nontrivial solutions will be 


Un(7, V) = Rp(r) OW) =r” (ay cosnd + by sin nv) , n=0,1,2,.... 
By forming an infinite series from these nontrivial solutions, we express u(r, ?) as their linear combination: 


u(r, 0) = + + S- r” (dn cosnd + by sin nv) . 


n>1 
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It is more convenient to write this series in the equivalent form 
u(r, 0) = alee S- (=) (a, cosnld + bp sinnd) (r <a). (11.4.12) 
2 n2>1 


The boundary condition (11.4.7) then requires that 


u(a, 0) = f(v) = gt S- (ay cosnv + by sinnv) , 


n>1 


which is the Fourier expansion of the function f(v). Its coefficients are obtained from the Euler—Fourier formulas 
(10.3.4), page 563. 
For the outer Dirichlet problem, the series solution is 


ao a\” : 
u(r, 0) = = “+ » (=) (a, cosnvd + b,psinnd) (r >a), (11.4.13) 
where coefficients a, and b, are determined by Eq. (10.3.4), page 563. | 
If the outward normal derivative is specified at the boundary such that 
Ou(r, V 
oun) = 99), O<0<2r, (1.4.14) 


then we have the boundary condition of the second kind or Neumann boundary condition. The corresponding 
boundary value problem (11.4.1), (11.4.14) is called the Neumann problem or the second boundary value problem. 
There are also two known problems—the outer (for r > a) and the inner (for r < a) problems. 

The procedure to solve the Neumann problem is exactly the same as before, and its solution is represented either 
by formula (11.4.12) for the inner problem or (11.4.13) for the outer problem. To satisfy the boundary condition 
(11.4.14), we have 


Ou(r, Y) 1 


Or | _, =g(0) = — d n (ap cosnd + b,sinnd) (inner problem), 
v 1 
Ou(r, V) = 90) =-- b> n (ap, cosnd +b, sinn’) (outer problem). 
u r= n>1 


Since the above Fourier series do not contain the free term ag, the term must be zero: 


1 us 
a == | g(0) dd = 0, 


—T 
which is the compatibility constraint. For the inner problem, the Euler—Fourier coefficients are 


a qT a qT 


An = — g(8) cos(n¥) dv, b, = — g(¥) sin(nd) dv. 


nT J_7z NT J_7z 


For the outer Neumann problem, the coefficients have the same values but opposite signs. 


1. Solve the second auxiliary interior Dirichlet auxiliary boundary problem in the rectangle V2u = 0,  u(x,0) = 0, 
u(a,b) = fo(x), u(0,y) = ula, y) = 0. 

2. Solve the third auxiliary interior Dirichlet auxiliary boundary problem in the rectangle V?u = 0, u(x,0) = u(a,b) = 0, 
u(0,y) = goly), ula,y) = 0. 

3. Solve the fourth auxiliary interior Dirichlet auxiliary boundary problem in the rectangle V?u = 0, —-u(a,0) = u(x, b) = 
u(0,y) =0, ula,y) = ga(y). 

4. Solve the first auxiliary Dirichlet boundary value problem in the rectangle 0 < x < 7,0 < y < 1, given fo(x) = 
5 sin3a — 1. 


5. Solve the second auxiliary Dirichlet boundary value problem in the rectangle 0 < x < 3,0 < y < 2, given f(x) = 
x(3— 2). 
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14. 


15. 
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Solve the third auxiliary Dirichlet boundary value problem in the rectangle 0 < x < 1, 0 < y < 1/2, given go(y) = 
7 sin(47y). 
Solve the fourth auxiliary Dirichlet boundary value problem in the rectangle 0 < x < 4,0 < y < 1/4, given ga(y) = 
5 sin(87y). 
Solve the Neumann problem for Laplace’s equation in the rectangle 0 < x < 2, 0 < y < 1 subject to the boundary 
conditions 


Ou _ Ou 

Oy y=0 Oy y=1 
Solve the Neumann problem for Laplace’s equation in the rectangle 0 < x < 2, 0 < y < 4 subject to the boundary 
conditions 


du 
02 | ,-9 


Ou 
Ox 


Ou 
Ox 


_ ou 
Ox 


= 0. 


y=4 


= - Oy 


x2=0 


Solve the third boundary problem for Laplace’s equation in the rectangle 0 < x < 1/2, 0 < y < 1 subject to the 
boundary conditions uz (0, y) = u(m/2, y) = u(x, 1) = 0, u(#,0) = 2. 


. Solve the third boundary problem for Laplace’s equation in the rectangle 0 < « < 7, 0 < y < 27m subject to the 


boundary conditions u(0,y) = us(m, y) = u(x, 0) = 0, u(a, 27) = sing. 


. Solve the third boundary problem for Laplace’s equation in the rectangle 0 < x < 4,0 < y < 1 subject to the boundary 


conditions uz(0,y) = 1—y, u(4, y) = uy(#,0) = u(a#, 1) = 0. 
Find the solution u(x, y) of Laplace’s equation (11.4.1) in the semi-infinite strip 0 < x < o0, 0 < y < 1 that satisfies 


the boundary conditions 

u(x, 0) = 0, u(x,1) =0, u(0, y) = 1 — cos(sy). 
A Newton potential is a harmonic function of the form v = v(r), where r is the distance to the origin, which is 
defined for all r > 0. Find all Newton potentials on R? that are solutions of the differential equation a (r ) =0. 
When a harmonic function in cylindrical coordinates does not depend on angle ¥, it is called axially symmetric. 
Assuming that partial nontrivial solutions of Laplace’s axially symmetric equation 


Urr + (1/r) Ur + Uzz = 0 


can be written in the form u(r,z) = R(r) Z(z), show that these functions R(r) and Z(z) are solutions of the following 
ordinary differential equations: 
rR +R’ +rR=0, Z" —-N’Z=0. 


The equation for R(r) is the Bessel equation of order zero. 


Consider Laplace’s equation in the domain a < r < b bounded by two circles r = a and r = b written in polar coordinates. 
Since the boundary of the annulus consists of two circles, the boundary conditions should be specified at each circle. This 


time, the origin is not in the problem domain. Therefore, there is no reason to discard the separation of variables solutions 


Inr and r” or r— 


” as we did when solving boundary value problems for circular domains. In Problems 16 through 20, solve 


Laplace’s equation (11.4.6) in the annulus having given inner radius a, given outer radius b, and subject to the given boundary 


conditions. 
16. (l<r<3) u(1,¥) =cos3¥, u(3, 0) = sin 20. 
17. (2<r<5) ur(2,0) =cos20, u(5, 0) = sin 4d. 
40, if0<0 
is C2rea1eH=™ er 
0, ifm <v < 27; 
0, if0<v<qT, 
cos50, ifm<0 < 2n. 
19. (l<r<5) u(1,¥) =cos5v, ur (5, 7) = cos 30. 
20. (2<r <3) in 0) =cos4¥, u,(3, V0) = sin 70. 


In Problems 21 through 26, you are asked to solve Laplace’s equation (11.4.6) in circular domains; therefore, it is 
appropriate to use polar coordinates. Using separation of variables, solve the corresponding outer or inner boundary 
value problems. 


21. 


Find the harmonic solution u(r, ¥) for r > 2 that satisfies the boundary condition u(2,0) = J. Note that this 
function is discontinuous at J = 0. 
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22. 
23. 


24. 
25. 
26. 


Determine the harmonic solution u(r, ¥/) for r < 3 that satisfies the boundary condition u(3, 0) = 2 — 2 cos 2v. 
Find the harmonic solution u(r, ¥) for r > 4 that satisfies the boundary condition 

MoO= | tance 

Find the harmonic solution u(r, ¥) for r < 1 that satisfies the boundary condition u(1, J) = 3 cos3v. 

Find the harmonic solution u(r, ¥) for r < 2 that satisfies the boundary condition u,(2,0) = sin 30 — 4 cos 5v. 


Find the harmonic solution u(r, ¥) for r > 3 that satisfies the boundary condition 


(3,0) 0, if0<V<1, 
Ur\9, = 
cos30, ifm@#<v < 2r. 


Consider Laplace’s equation in the circular sector 0 < r < a, 0 < J) < a. In Problems 27 through 29, solve Laplace’s 
equation (11.4.6) in the sector subject to the given boundary conditions. 


27. 


28. 


29. 


Find the harmonic function in the circular sector 0 < r < V2, 0 < 0 << 1/4 subject to the boundary conditions 
u(r,0) = 1, u(r, 7/4) = 147, u(V2, 0) = 142 sind. 

Find the harmonic function in the circular sector 0 < r < 1, 0 < 0 << m/2 subject to the boundary conditions 
u(r, 0) = r?, u(r, 7/2) = —r? — r3, u(1, 0) = cos 20 + sin 3¥. 

Find the harmonic function in the circular sector 0 < r < 1, 0 < J << a/4 subject to the boundary conditions 
u(r, 0) = 0, u(r, 7/4) =1+r*, u(1,d) = 1—cos 40 + sin 30. 
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aly 


The heat transfer (or conduction) equation uz = c? Vu, subject to various boundary conditions and the initial condition 
u(z,0) = f(x), has a unique solution that can be obtained by using the separation of variables method (also known as 
the Fourier method). 


. The separation of variables method is applicable only when the boundary conditions are homogeneous. Otherwise, it 


requires shifting the boundary data. 


. When the separation of variables method is applied to the one-dimensional heat equation uz = c?Urx with the Dirichlet 


boundary conditions u(0,t) = u(é,t) = 0, it leads to the Sturm-—Liouville problem X” + AX = 0, X(0) = X(£) = 0, 
where u(x, y) = X(x) T(t). Finding the eigenfunctions X,(x) = sin“, n = 1,2,..., we build the general solution as 
a sum of all partial nontrivial solutions: 


22 nme 
= At os 
u(x,t) = ) Cre °° sin 


n>1 


where the coefficients C, are determined by the Euler—Fourier formulas (11.1.10), page 600. 


. When boundary conditions are not of the first kind (Dirichlet), it leads to a Sturm-—Liouville problem subject other 


boundary conditions. Then there exists a series solution in the form 7,5, In(t) Xn(x), where Xn(z) are eigenfunctions 
of the corresponding Sturm—Liouville problem. 


. When the boundary conditions are not homogeneous, it requires a shift of these data to make them homogeneous and 


then application of the separation of variables method. 


. The wave equation wit = C’Uxz can also be solved by separation of variables. The solution of the corresponding initial 


boundary value problem (11.3.1), (11.3.2), and (11.3.5) is given by formula (11.3.10), which is an infinite sum of the 


normal modes 
t t 
Un(a#,t) = [40 cos (= ) + B,, sin (=) sin (>) ; 7 Ss 1, 285208 


The first normal mode is called the fundamental mode or fundamental standing wave; all other modes (standing waves) 
are known as overtones. 


. A steady state temperature distribution u(x, y) inside a plane domain satisfies the Laplace equation uzz + Uyy = 0 or 


simply V?u = 0, where V is the gradient operator. Any smooth solution of Laplace’s equation is called the harmonic 
function. 


. For Laplace’s equation in some planar domain R, it is common to consider three types of boundary conditions: the first 


kind or Dirichlet u|,, = f, Neumann or second kind 0u/On|,, = f, and the third kind uly, + kOu/On|y, = f, where 
f is a given function and Ou/On|5, is the outward normal derivative to the boundary. 


. The following trigonometric identities are proved to be useful in many calculations: 


2 cos? 6 = 1 + cos 26, 2 sin? 6 = 1 — cos 20. 
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Figure 11.3: Standing waves. 


Review Questions for Chapter 11 


Section 11.1 
In Problems 1 through 4, find a formal solution to the given initial boundary value problem subject to the Neumann 
boundary conditions 
Ut = Aer uz (0,t) = uz(Z,t) =0, u(a#,0) = f(x), 


when the coefficient of diffusivity a, the length of the rod @, and the initial temperature f(x) are specified. 
1. a=9, €=1, and f(a) = cos(mx) — 10 cos(3rz). 
2. a=1,€=4, and f(x) = 5a (violates the compatibility constraint). 
3. a=9, €=7, and f(x) = 2? (violates the compatibility constraint). 
4. a=1, =1/2, and f(x) = cos? (27x). 


In Problems 5 through 8, find a formal solution to the given initial boundary value problem with boundary conditions of 
the third kind 
Ut = AUce u(0, t) = ua (Z, t) = 0, u(x, 0) = f(a), 


when the coefficient of diffusivity a, the length of the rod @, and the initial temperature f(x) are specified. 
5. a=4,€=7, and f(x) =a sing. 
6. a=4, €=1, and f(x) = 2? — x (violates the compatibility constraint). 
7. a=1,£=7/2, and f(x) =x cosa (violates the compatibility constraint). 
8. a=1, £=1/2, and f(x) = 2(1/2— 2)’. 


In Problems 9 through 12, find a formal solution to the given initial boundary value problem with boundary conditions 
of the third kind 
Ut = AUce uz (0, t) _ ull, t) _ 0, u(x, 0) = f(a), 


when the coefficient of diffusivity a, the length of the rod @, and the initial temperature f(x) are specified. 
9. a= 16, = 2m, and f(z) = 5 cos %. 
10. a=4, €=1, and f(x) = 2? — 2x (violates the compatibility constraint). 
1l. a=4, €=7, and f(x) = cosa (violates the compatibility constraint). 
12. a=9, €=3, and f(x) = 2? (3-2). 


13. The ends of a thin, laterally insulated bar of length 7 are maintained at a temperature of zero degrees. At time t = 0, 
the temperature profile is u(z,0) = 59 sin(x). At time t = 1/2, the temperature u(z,t) at the center x = 7/2 of the 
bar has decreased to a value of about 8°C; that is, u(7/2,1/2) + 8. What is the thermal diffusivity of the bar? What 
is Ua(m, 1)? 


14. The ends of a thin, laterally insulated bar of length 2 are maintained at a temperature of zero degrees. At time t = 0, 
the temperature profile is u(z,0) = 100(1—cos7a). At time t = 1, the temperature at x = 1 has decreased to a value 
of about 49; that is, u(1,1) + 49. What is the thermal diffusivity of the bar? What is wz(0,1)? 
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15. 


Consider a rod of length @ = 1, one end of which is maintained at zero temperature, u(0,t) = 0, but another end x = 1 
of the rod is in a solution of water fixed at the same temperature of zero. (Zero refers to some reference temperature.) 
Using Newton’s law of cooling, the boundary conditions can be shown to be 


u(0, t) = 0, ue(1,t) + hu(1,t) = 0, 


where h > 0 is a constant that depends on the rate of heat flow across the boundary. Find the temperature in such 
insulated rod. 


Section 11.2 of Chapter 11 (Review) 


1. 


Assume that a thin, laterally insulated bar of 10 units length has its two ends maintained at constant temperatures, 
u(0,t) = 10 and u(40,t) = Tv, which is to be determined. At time t = 0, the initial temperature in the bar is known to 
be u(x,0) = a. A probe inserted into the bar center measures the temperature and finds it to be approximately 27°C 
and 29°C at times t = 1.45 and t = 2.5. Determine the unknown end point temperature JT; and the thermal diffusivity 
of the bar’s material. 


. Find all solutions of the form u(x,t) = X(x) T(¢) for each of the equations. 


(a) w= rune + Lux, (b) we = User + 2ua- 


In each Problem 3 through 7, use the separation of variables method to find a formal solution of the given initial boundary 
value problem. 


3. 
4. 
5. 
6. 
7. 


Ut =4Ure (0<a2<2), u(0,t) =0, u(2,t) =2e*, u(x,0) =a. 

Ut = 25 Use (O< a <5), u(0,t) = 20, u(5,t) = 10, u(w,0) = 20 — 2a + sin(272). 
Ut =9Uee +2 (0<2 <3), ux(0,t) = 0, u(3,t) = 90, u(x, 0) = 102. 

Ut = Ure +2(x—1)? (0<a@<1), u(0,t) =0, ue(1,t) = 0, u(x,0) = 0. 

ut = 4Uee + cos(rx) (0<a@ <2), ux(0,t) = 0, ux(2,t) = 0, u(x, 0) = 2 cos? (ra). 


In Problems 8 through 13, use separation of variables to find the series solution u(x,t) = }7,51 Xn(x) Tn(t) of the given initial 
boundary value problems for the heat conduction equation with non-smooth temperature profiles. Then plot w3o(x,t) for 
t=1, 5, 10, where war(a,t) = 0, Xn(x) Tn(t) is the truncated partial sum with M terms. Observe a striking characteristic 
of the heat equation expressed in the smoothing effect on the initial temperature distribution. 


8. 


9. 


10. 


11. 


12. 


13. 


14. 


15. 


16. 


17. 


a+3, if0<a4<2, 
1+4(5-2), if2<a<5. 


Ut = Ure (0< a <5), u(0,t) =3, u(5,t) =1, u(a#,0) = 


1, if0<a2<2, 


=3tert1(0<2<3), ux(0,t)=0, ue(3,t)=0, u(x, 0) = 
es a ue) i f2<2<3. 


2, if0<a<3, 
0, if3<a<4. 


ut =4Ure (O< a <2), u(0,t) =1, u(2,t)=1, u(x,0) = |cosza}. 


sin’(ra), if0<a <1, 
1, ifl<a2<3. 


ut = 8Ure (0< a <4), u(0,t) = 2, uz(4,t) =0, wen={ 


Ut =9Ure (O< 2 <3), uz(0,t) = 0, u(3,t) = 1, u(z,0) = 


0, if0<2<7/2, 


Ut = Ure (O<a<m), wu(0,t) =0, ,t)=1, u(x,0) = 
: ( ) (0,4) ae a ee ifm/2<a<T. 


Consider a bar 50 cm long that is made of a material for which a = 4 and whose ends (x = 0 and x = 0.5) are insulated. 
Suppose that the bar is heated by an external source with a rate which is proportional to 1— x”. Derive the formal 
series solution assuming the bar had initial zero temperature. 


A copper rod 5cm long with an insulated lateral surface has initial temperature u(x,0) = (1 — 2x)”, and at time 
t = 0 both of its ends (x = 0, x = 0.05) are maintained with a constant temperature of 40°C. Find the temperature 
distribution in the rod assuming that its thermal diffusivity is constant with the value a ¥ 1.11 x 107‘ m?/sec. 


Suppose that a tin rod 20cm long with an insulated lateral surface is heated to a uniform temperature of 20°C, and 
that at time ¢ = 0 one end (x = 0) is insulated while the other end (x = 0.2) is maintained with a constant temperature 
of 40°C. Find the formal series solution for the temperature u(x,t) of the rod. Assume that the thermal diffusivity is 
a constant, which is approximately equal to 4 x 107° m/sec at 20°C. 


Consider a silicon rod of length 30cm whose initial temperature is given by u(a,0) = x?(30 — 2)/15. Suppose that the 
thermal diffusivity is approximately a ~ 0.9 cm/sec, and that one end (a = 0) is insulated while the other end (x = 30) 
is maintained with a constant temperature of 20°C. Find the formal series solution for the temperature u(x,t) of the 
rod. 
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18. One face of the slab 0 < x < 2¢ is insulated while the other end x = 22 is kept at a constant temperature 20°C. The 
initial temperature distribution is given by u(z,0) = 0 for 0 < a < @, u(w,0) = 20 for € < x < 2. Derive the formal 
series solution. 


19. Let a niobium wire of length 1m be initially at the uniform temperature of 20°C. Suppose that at time t = 0, the 
end x = 0 is cooled to 5°C while the end x = 1 is heated to 55°C, and both are thereafter maintained at those 
temperatures. Find the temperature distribution in the wire at any time assuming the thermal diffusivity to a constant 
a 2.5 x 107° m/sec. 


Section 11.3 of Chapter 11 (Review) 


1. The derivation of the wave equation starts by obtaining an equation of the form ut = Css, where s is arc length. 
Upon introducing the more convenient coordinate, the spacial variable , its derivative is expressed as ds/dz = /1 4+ u2. 
Show that the wave equation becomes 


Utt = rea (1 + uz)? Une, 
which leads to the linear equation (11.3.1) when u2 is disregarded as being small. 


2. A string has length 7, and units are chosen so that c = 1 in the wave equation (11.3.1). Initially, the string is in its 


equilibrium position, with velocity Se (a, 0) = sina — sin(3az). Determine the displacement function u(z, t). 


3. Use the method of separation of variables to derive a formal solution to the telegraph equation wuz: + uz + u = Cue, 
when 471°c? > 30, in the finite domain 0 < x < £ subject to the boundary and initial conditions 


4, Find all product solutions u(«,t) = X(x) T(t) of the modified wave equation ure + ku = Cuce- 
Consider the initial boundary value problem for the wave equation 


wi =CuUsr, O<a<l, 0<t<t <oco; u(0,t)=u(Zt)=0, O0<t<oc; 
u(z,0) = f(z), ut(x,0) = v(x), 0<a<. 


In Problems 5 through 8, solve the problem for the given parameter values (c and £) and the given initial conditions. 


5. c= 1, ¢=2, f =0, g = cos? (5772). 7. c=3, €=6, f =cos7ra, g = 0. 
6. c=2,0=4,f=0, 8. c=m,€=2n, f =0, 
1—cos(Q27z), 0<a<2, a. 0 <2 <7; 
v= . = 
0, 2<2<A4; 0, m<au<2n. 


Consider an elastic rod of length @. The rod is set in motion with an initial displacement d(x) and initial velocity v(x). 
In each Problem 9 through 14, carry out the following steps given longitudinal velocity c, the length @, and the initial and 
boundary conditions. 


(a) Solve the wave equation uit = CUnn subject to the given initial and boundary conditions. 
(b) Plot u(x,t) versus x € [0,4] for several values of t = 1, 2, 10, and 20. 

(c) Plot u(a,t) versus t for 0 < t < 20 and for several values of « = 0, €/4, £/2, 30/4, and &. 
(a) 

9. ute = Ure (0< @ < 7/2), u(0,t) = u(m/2,t) = 0, u(x,0) = 2, u(x, 0) = (Qe — nm)’. 
10. wit = 25 tee (0 <a < 10), ux(0,t) = u(10,t) = 0, u(x, 0) = 0, us(x,0) = 2?(10 — 2). 
11. wit = 64tae (O< a4 <4), ux (0,t) = u(4,t) =0, u(#,0) =2(4—2), w(a,0) = 1. 


12. uit = 36tcz (0 < & < 3), u(0,t) = uz(3,t) = 0, u(x,0) = 0, w(x,0) = 0 for 0 < x < 1 and w(2,0) = x — 1 for 
l<2r<3. 


13. ure = 4use (0<@ <1), u(0,t) = u(1,t) =0, u(x, 0) =sinrx, ur(x,0) = 2 cos?(3rx) — 1. 
14. wit = 16 tae (O< a < 2), u(0,t) = ue (2,t) = 0, u(w,0) = sin(37a/4), ur(x,0) = cos(3mx/4). 


Draw a three-dimensional plot of wu versus x and t. 


Mechanical waves propagate through a material medium (solid, liquid, or gas) at a wave speed which depends on the 
inertial properties of that medium. There are two basic types of wave motion for mechanical waves: longitudinal waves and 
transverse waves. In a transverse wave, particles of the medium are displaced in a direction perpendicular to the direction of 
energy transport. In a longitudinal wave, particles of the medium are displaced in a direction parallel to energy transport. 
Longitudinal waves are observed, for instance, in elastic bars or rods when their vertical dimensions are small. By placing the 
z-axis along the bar’s direction so that its left end coincides with the origin, we can assume that its vibrations are uniform 
over each cross-section. Then the longitudinal displacement u(x,t) satisfies the one-dimensional wave equation (11.3.1). In 
Problems 15 through 20, find longitudinal displacements in rods under the given initial and boundary conditions. 
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15. 
16. 
17. 
18. 
19. 


20. 


21. 


22. 


23. 


24. 


25. 


26 


Utt = Ure (0O< a <7), u(0,t) = uz(a,t) =0, u(a,0) =z, u(x,0) = 0. 
ute = 4Ure (0< @ < 2), ue (0,t) = u(2,t) = 0, u(x,0) = cos(mx), ur(a,0) = sin(27a). 
Utt = 9Urx (0 <a <3), ux(0,t) = ux(2,t) = 0, u(x,0) = 27(3 — 2)”, ur(x,0) = sin(272). 
Ute = Cure — 2hur (0< a <£), u(0,t) = u(é,t) = 0, u(x, 0) = f(x), ue(a, 0) = 0. 
Ute = 25 Uae (0O< a4 <5), Ue(0,t) = u(5,¢) =0, u(ae,0) = cos 42, u(x, 0) = x(a — 5). 

0, O<a<il, 
Utt = Ura (0< a4 < 2), u(0,t) = ux(2,t) = 0, u(x,0) =0, we(x,0) = i ees 
Consider the wave equation with nonhomogeneous boundary conditions: 


(PDE) wt =Cure, 0<4<l,0<t<t*<o, 

(BC) u(0,t) =a, u(é,t)h=b, 0<t<t<o, 

(IC) ula, 0) = d(x), ur (x, 0) = v(z), 
where a and 6 are not both zero. This problem can be reduced to a similar problem with homogeneous boundary 
conditions by substitution u(x,t) = ui(x,t) + u(x,t), where ui(x,t) =a+ 44a. Unlike the case of the heat equation, 
the solution u(x,t) of the given wave equation does not have a limit in general as t + oo. Hence, ui(x,t) is not the 


time-asymptotic form of the solution. Show that the solution u(x,t) wiggles about wi(x,t). For this reason, a physicist 
still call ui(x, t) the equlibrium solution. 


Show the total energy 
£ £ 
E(t) =< / (u.)2dex + s/ (ue)?de, 
0 2 Jo 
where u(x,t) is a solution of the IBVP (11.3.1), (11.3.3), (11.3.5), is independent of time. 


If a piano string of length 0.5m has the fundamental frequency 261.626 Hz (it corresponds to “middle C” or “Do” in the 
European terminology), what is the numerical value of constant c in the wave equation? 


A wire of length 50cm is stretched between two pins. The velocity constant c in the corresponding wave equation 
(11.3.1) is approximately 560 m/sec. The wire is plucked at a point 10 centimeters from the end by displacing that point 
0.1 cm. This caused the initial wire profile to be the union of two straight lines connecting end points with the plucked 
point. Determine the displacement function of the wire, assuming zero initial velocity. 


To approximate the effect of an initial moment impulse P applied at the midpoint x = ¢/2 of a simply supported beam, 
solve the beam equation (11.3.16) subject to the boundary conditions (11.3.17) and the initial conditions (11.3.18), 
where f(x) = 0 and the initial velocity is 


P we é 

v(x) = Bpe? if BES ESTE, 
0, otherwise. 

Then find the limit as e > 0. 


. Solve simply supported beam equations (11.3.16), (11.3.17), which is put into vibration by constant initial velocity, so 
f(x) = 0 and g(x) = vo in Eq. (11.3.18). 
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1 


2 
3 


In P 


. Show that the real and imaginary parts of (2 + jy)” are harmonic functions on the plane R*, where n is a positive 
integer and j? = —1. 


. Find all product solutions u(x, y) = X(x) Y(y) of the Helmholtz equation uzz + Uyy + ku =0, where k > 0. 


. Apply separation of variables to obtain a formal series solution to the Poisson equation V7u = f(x,y) inside rectangle 
0<2<2,0<y< 7, subject to the homogeneous boundary condition u = 0 on all four sides of the rectangle. Consider 
f(x,y) = 1+ ay inside the rectangle. 


roblems 4 through 9, solve Laplace’s equation V?u = 0 in the rectangular domain (0 < x < a, 0 < y < 6) subject to the 


given boundary conditions. 


4. 


5 


6 
7 
8 


(0<a<1,0<y< 2) uz(0,y) = u(1, y) = 0, u(x, 0) = 0, uy (x, 2) = cos 27a. 
.(0<2<2,0<y<3) ul0,y) =y’, ul(2,y) = 0, u(x, 0) = 0, uy(x,3) = 0. 
.(0<a<1,0<y< 1) uc(0,y) =0, uc(1, y) = 3 cos3ry, uy(x,0) = uy(x, 1) = 0. 
.(0<a< In, O<y< 1) u(0,y) = u(27, y) = 0, uy(x, 0) = cosa, u(x, 1) = 0. 

a 


0<a<m, 0<y< 2r) u(0,y) =0, us(z,y) = 0, u(x, 0) = 0, uy(x, 27) = a. 
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9. (0<a<4, 0<y< 2) u(0,y) =0, us (4,y) =siny, uy(x,0) = uy(x, 2) = 0. 


10. Consider the general Dirichlet problem (11.4.1), (11.4.2) for the Laplace equation in a rectanglea <a <a,0<y<b. 
Its solution can be represented via the series 


u(xz,y) = yA sin (=) sinh a (b—y)+ So Bn sin (=) sinh y 


n21 n21 b 
nt nTx nee nre 
+ PS sinh = (a — x) sin 5 Y + me sin 5) Sin (> ¥ 


Find the values of coefficients An, Bn, Cn, and Dn. 


In Problems 11 through 16, a circular disk of radius a is given, as well as a function defined on the boundary r = a of the 
disk. Using separation of variables, solve the corresponding outer or inner boundary value problem. 


ll. r< 2, u(2,0) =8 sin3v. 144.r<4, u(4,0) =V0Q7r-V). 
12. r>3, u,(3,0) = 30? — 300 — 2°. 15. r<1, u,-(1,0) =8 sin 4d. 
13. r>1,  ur(1, 0) =3 cos3v. 16. r>2, u,-(2,0)=V0-—7. 


Consider Laplace’s equation in the domain a < r < 6 bounded by two circles r = a and r = b written in polar coordinates. 
Since the boundary of the annulus consists of two circles, the boundary conditions should be specified at each circle. In 


Problems 17 through 19, solve Laplace’s equation (11.4.6) in the annulus having given inner radius a, given outer radius b, 
and subject to the given boundary conditions. 


0 ifo0o<d in2v, if0<¥ 
17. (3<r<4)u(3,0)=2 ee: ae gene me em 
cos30, ifm<U< 2z; 0, ifam@<v < 2a. 


—4d, if v 2 if v 2 
is Gereaiasdt UPS A | pew” me ae 
; if 7/2<0 < 2r; 7x —40, if 37/2 <0 < 27. 
19. (2<r<5) u(2,¥) =sin3v, u(5,0) = cos 60. 
20. (l1<r <3) u(1,¥) =sin3v,  u-(3, 0) = sin 5v. 


Chapter 12 


Left: unique solution. Right: multiple solutions 


Boundary Value Problems 


As a result of separating variables in a partial differential equation presented in §11.2, we may encounter the 
nonhomogeneous differential equation 


-< [b(a) =| +q(x)u = Ap(x)ut f(x), (12.0.1) 
£ dx 
subject to some boundary conditions. Unlike homogeneous equations, the boundary value problem for the driven 
differential equation (12.0.1) does not need to have any solution (even trivial) or may have infinitely many solutions 
(see figure on this page). 

In this chapter, we will derive a procedure to find solutions of boundary value problems based on Green’s functions 
and discuss some Sturm—Liouville boundary value problems that play an important role in physics applications. Note 
that we used Green’s function previously for differential equations (§4.8) and initial value problems (85.5). 


12.1 Green’s Functions 


This section gives an introduction to Green’s functions and their applications for self-adjoint differential operators 
of the second order. We start with a motivational example. 


Example 12.1.1: Consider the cable or board of length @ shown in Figure 12.1 (page 630) that stays motionless 
along horizontal x-axis in its equilibrium position. It is pinned at « = 0 and x = @ and subject to a vertical load with 
a density of w(a) units of force per unit length. The loading density includes the cable weight. Under the loading, 
the cable deforms at distance y(x) from its equilibrium position. 

To derive a corresponding mathematical problem, we cut off a differential cable segment and impose the conditions 
of static equilibrium because the cable is motionless. Newton’s second law tells us that the sum of the forces acting 
upon the differential segment in both the horizontal and vertical directions is zero. Let Ta) be the tension at the 
point x. This force is applied tangentially to the cable curve because it offers no resistance to bending. Then the 
state equilibrium constraints are 


T(x + dz) cos@(x + dx) — T(x) cos@(x) =0 (projection on abscissa), 


12.1.1 
T(x+dz) sin@(x + dx) — T(x) sin0(x) = w(x) dx (projection on ordinate). ( ) 
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Figure 12.1: The cable is pinned at x = 0 and x = @. 


T(x + dz) 


x+dzx 


I 

I 

I 
av 


Figure 12.2: Forces acting on the differential cable element. 


Dividing both equations in (12.1.1) by da and letting dz ++ 0, we obtain 
< T(x) cosO(2)=0, — <-T(x) sin (2) = w(2) 
qa J (2) cos A(x) = 0, ag 2 (#) sin (x) = w(2). 


Since there is no motion in the horizontal direction, the z-component T() cos 6(x) of the tension must be a constant, 
which we denote by To. Then T(2) = To/ cos 6@(a), and the latter equation leads to 


d d d 
ce T(x) sin O(a) = 7 EE) sin O(a) = e To tan 0(x) = w(x). 
Using notation tan 6(a) = —dy/dx, where the minus sign arises from assumption that the downward position is 


positive, we get 
d d? w(x) der 
ap tne) =- Fa ve) = = Fea) 


Since the deformation is motionless at end points x = 0 and x = @, we obtain the following two-point boundary 


value problem for a positive differential operator L[D] = —D? = —d?/dz?: 
d’y 
Gaz =F), = -y0) = 0, _ (4) = 0. (12.1.2) 


By successive antidifferentiation of the equation, we get 


ule) = cox +e e-2)-5 fae f assioy-} fae f asse 


where cg and c; are constants of integration. Upon changing the order of integration, we find 


x L 
ula) = eox ber (€-2)— 5 | (x~s) f(s) as—5 | (s — x) f(s) ds. 
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Imposing the boundary conditions y(0) = 0 and y(¢) = 0, we determine the values of constants co and cy: 


U Ui 
at=5 f 8 f(s) ds, cot = 5 f (€ — s) f(s) ds. 


Thus, the deformation function is given by 


e 
ula) = f oa, Fat, (12.1.3) 
0 
where Ae 
= 0<tK<a, 
G(2,t)= 4 ae 2) 
a r<et<, 


is called the Green’s function for this boundary value problem (12.1.2). 


Now we turn our attention to the general case. Our starting point is the linear two-point boundary value problem 
for the self-adjoint (positive if g(x) > 0) differential operator L[a,D] = —Dp(a#)D + g(x), depending on the derivative 
operator D = d/dz: 


d 


da 


(a) | t+q(x)y=f(x), — aoy(0) — ary’(0) = 0, Boy(2) + Biy’(2) = 9, (12.1.4) 


on the interval 0 < a < ¢. The functions p(x), p’(x), q(x), and f(a) are assumed to be continuous on the closed 
interval [0, ¢], and 
lao] + lai] >0, [Bo] + [Bi] > 0. 


There is nothing special for choosing the left end point of the interval (0,¢) to be zero: any interval can be scaled 
into this particular case. It is convenient to introduce two boundary operators 


Boly] = aoy(0)—ary/(0) and — Bey] = Boy(¢) + Biy'(4). 
Then the boundary value problem can be written in compact form: 
Lx, Dy = f, Boly| _ 0, Bely| = 0, (12.1.5) 


where L[x,D] = —Dp(x)D + q(x) with D = d/dx. We consider self-adjoint differential operators of the second order 
for two reasons. First, all eigenvalues of a self-adjoint operator are real numbers and eigenfunctions corresponding 
to distinct eigenvalues are orthogonal. Second, a nonself-adjoint differential equation can be reduced (see §4.1.3) to 
a self-adjoint counterpart. Moreover, if the boundary value problem generated by the differential operator L|x,D] 
and the boundary conditions Bo, Be is nonself-adjoint, then the corresponding Green’s function is not symmetric 
and its eigenvectors are not orthogonal. 

Throughout this section we assume that the corresponding homogeneous problem (L[{x,D]y = 0, Boly] = 0, 
By[y] = 0) has only the trivial (identically zero) solution; this means that 4 = 0 is not an eigenvalue of the boundary 
value problem. It then follows from the Fredholm alternative (see [40]) that the inhomogeneous problem (12.1.4) 
has a unique solution. 

Our next task is to construct the Green’s function explicitly provided that the fundamental set of solutions 
{¢(x), 7(x)} to the equation L[x,D]y = 0 is known. That is, we assume that L[x,D]¢ = 0 and L/x,D]y) = 0 and 
the functions ¢(x) and 7(x) are linearly independent; namely, the Wronskian W[¢, ~|(x) = ¢w' — ¢'w of these 
functions is not zero on the interval [0,4]. Moreover, p(x) W[¢,7|(x) is a constant for a self-adjoint differential 
operator L[%,D] = —Dp(a%)D + g(x) because Dp(x)D W(x) = 0 for D = d/dz, which can be verified by substitution 
(see exercise 4 on page 634). 

To apply the formulas from §4.8 (the variation of parameters method), we seek a particular solution of L[x,D]y = f 
in the form 


y(x) = c1(x) (x) + ca(x) ¥ (2), (12.1.6) 


where 


Sy 


x) - FOU) gig ___F@) 42) _ 
POW OY a) Wed] 
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Integrating the above equations, we obtain explicit representations of c1(a) and co(z) in the quadrature form: 


* -fO) vo) _ f? 1060 ao, 
| OW ax(e) = f pO Ws, Wy dt + ca, (12.1.7) 


where c; and cy are constants of integration. Substituting these expressions into (12.1.6), we obtain the solution to 
the boundary value problem (12.1.5). This solution is the sum of a particular solution y,(a) of the nonhomogeneous 
equation L[x,D|y = f and the solution yp(x) = c1¢(x) + c2y)(x) of the homogeneous equation L[x,Djy = 0. Using 
formulas (12.1.7), we can represent y,(x) in integral form (12.1.3), where the kernel 


c1(a) = 


= OLiricl ne 
Pp , 


p(t) W[¢, d](t)’ 


is the Green’s function for the nonhomogeneous equation L[xz,D]y = f. Since (x) and W(za) are arbitrary linearly 
independent solutions of the homogeneous equation L[x,D]y = 0, the function y,(x) does not need to satisfy the 
boundary conditions. It is assumed that the coefficients c; and cp can be chosen in such a way that the sum 
Y =Y¥p + yn will satisfy the homogeneous boundary conditions Bo[y] = 0, Be[y] = 0. However, if we set ¢(x) = y1(z) 
to be a solution of L[y] = 0 satisfying the boundary condition Bo[y] = 0 at « = 0, and ~(x) = yo(x) to be a solution 
of Lly] = 0 satisfying the boundary condition By[y] = 0 at « = ¢, then no determination of arbitrary constants 
is required because yp, as given by integral formula (12.1.3) for the Green’s function, automatically satisfies the 
boundary conditions since the Green’s function (12.1.8) does. 

Note that if we set A = Dp(«)D and consider the boundary value problem (12.1.5) in the space of functions that 
satisfy the homogeneous boundary conditions Bo[y] = 0, Be[y] = 0, then it will resemble the problem (qI-— A)x =b 
that we discussed in §7.2.1 for matrices, and the Green’s function becomes an analog of the resolvent. The next 
theorem summarizes some basic properties of the Green’s functions. 


Theorem 12.1: Let G(x,t) be the Green’s function for the boundary value problem (12.1.4) that is assumed 
to have a unique solution. Then 


(a) G(a,t) is continuous on closed square [0, ¢] x [0, 4]; 
(b) G(z,t) is a symmetric function, G(x, t) = G(t, x); 
(c) for each fixed t, the partial derivatives OG/Ox and 0?G/0x? are continuous functions of x for x F t; 


(d) at x =t, the first partial derivative 0G/Ozx has a jump discontinuity: 


lim aor t)— lim oe t)=—-—; 
ast+0 Oe *” =) 0107 et) 


(e) for each fixed t, the function G(z,t) is a solution of the corresponding homogeneous boundary value 
problem 


(f) there is only one function satisfying the above properties. 


Example 12.1.2: Consider the nonself-adjoint boundary value problem 


y"+y'-2y=—-f), (0) =0, y(1) =0. 


Since the characteristic equation \? + \ — 2 = 0 has two real roots \ = 1 and \ = —2, the homogeneous differential 
equation y’” + y’ — 2y = 0 has two linearly independent solutions ¢(7) = e” and w(x) = e~?*. According to 
Eq. (12.1.3), a particular solution of the inhomogeneous equation can be represented in the quadrature form: y,(x) = 
J, Go(a,t) f(t) dt, where the kernel is 
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Then the solution of the given boundary value problem becomes 


=f Go(a, t) f(t) dt + cid(x) + cop (a). 


We choose constants c; and cz in such a way that the above function satisfies the given boundary conditions y(0) = 0 


and y’(1) = 0. This yields 
“aye jet) dee tf #0 Jer tat Te? [50 ye! dt 


+ soe (eo * —e \f 10 aS [ ceos0 


where G(a,t) is the kernel of this integral representation. As we see from the above formula, the Green’s function 
G(a, t) is not symmetric with respect to x and t, but it satisfies the boundary conditions: G(0,t) = 0 and G,(1, t) = 0. 
Its first derivative with respect to x has a jump of discontinuity at x = t: 


lim a —(az,t)-— lim os — (x,t) =- =-—-1. 


2 
ret+0 Ox at—0 Ox 3 
Example 12.1.3: Consider the self-adjoint boundary value problem for the operator L[x,D] = —Dx?D + 6: 
d 9 d / 
ap (Pay Y) FOV =F) (L<2<5), yf) =0, (5) =0. 


The corresponding homogeneous equation x?y” + 2xy’ — 6y = 0 is an example of the Euler equation (see §4.6.2), 
which has two linearly independent solutions ¢ = x? and 7 = x«~3. The Wronskian determinant of ¢ and w is 


W|¢, v(x) = det ik Ee | 


While we can use Eq. (4.8.8), page 240, to obtain the explicit expression (12.1.8), we proceed directly to show every 
step. Using variation of parameters (see §4.8), we represent the general solution of the nonhomogeneous equation as 


1 m 1 e 
y(x2) = sof t 3 f(t) dt+ sos | t? f(t) dt+ cz? + cor 3. 
x 1 
Since its derivative is 
! 2 ° 3 Sey ae -4 
y(z)=Fe t f(t)dt— <2 t* f(t) dt + 2ca — 3ce92", 
e 1 
we get the conditions on constants c; and cz to satisfy the given boundary conditions: 
2 ps 5 
y'(1) = =| t° f(t) dt + 2c) — 3c. =0, (5) = sf t f(t) dt + e157 + 25° = 0. 
1 1 


Solving this system of algebraic equations, we obtain 


—l s 2 —3 2 : 4,-3 —1,2 
= —__ 2t t) dt = — — t) dt. 
aan | (3t° + ) f(t)dt, co om | (5 5-1?) f(t) dt 


Cy 


This allows us to represent the solution in integral form: 


y(x) = ° ae, 2) f(t)dt = =a? [ t-? f(t) dt + sof ere dt 


— [ [—a? (3t? + 2t~) + 10a~? (54¢-8 — 5~14?)] f() de, 
1 
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where the Green’s function is 


0. es ; 25° 
—342 — ——__ + 2t-8 —34-3 forl<t< 
G(e,t) =< 4 9377" T6885 | )+ gare tS, for = 
ee os 3 22° 
9377 x?t-3 = 46885 xt? a 0377 (bre-* = ar?) ; for x < t < 5. 


Calculations show that the Green’s function satisfies the boundary conditions, G,(1,¢) = 0 and G(5,t) = 0. Also 


. OG . OG 1 
ge eee ae ee 


To see that the Green’s function is symmetric, we choose y1(x) = 3a? + 2273 and yo(x%) = 2? — 5°23 as two 
linearly independent solutions of the homogeneous differential equation x?y” + 2xy’ — 6y = 0. Their Wronskian 
is Wly1, yo|(2) = 46885 2~? ¥ 0 on the interval [1,5]. Since yj(1) = 0 and yo(5) = 0, we can use Eq. (12.1.8) to 
construct the Green’s function: 


1 (Sate) GP +2b*); L<r< 
go t< 


t) = ——_ 
Ce.) = Team * | (554-8 — 12) (30? + 20-2), 


1. For each of the boundary value problems, determine whether there are infinitely many solutions, a unique solution, or 
there is no solution. 


(a) y’+y=0, y(0)=0, y/(2x) =1. (c) y'+9y=0, y'(0)= 
(b) y”+4y=0, y/(0)=2, y(m) =0. (d) y’+9y=0, y'(0)= 


2. Show that the homogeneous second order ODE aa(x) y” (x) + ai(x) y’(x) + ao(x) y(x) = 0 can be put into self-adjoint 
form by multiplying by the integrating factor 


and find the form of p() and q(x) in Eq. (12.1.4) in terms of the coefficients a2(x), ai(x), and ao(x). 


3. In each exercise, find an integrating factor y(t) needed to convert the given differential equation into self-adjoint form 
/ 
(H(t) y(t) + H(t) a(t) y = H(t) g(t). 


(a) y” + 4y’ + 4y = cos 2t; (c) t/?y"” 447/74! 4 er y = t; 
(b) ety” —y' =e”; (d) (sint)y” — (cost) y’ +y =t#?. 


4. Show that the product of p(x) and the Wronskian W[¢, w](x) of two linearly independent solutions ¢, 7) to the second 
order differential equation L[x,D] y = 0 generated by the self-adjoint differential operator L[x,D] = —Dp(x)D+ q(x) is a 
constant. 


5. Derive the Green’s function for the given two-point boundary value problems. 


(a) -y" = f(z), y(0)—y'(0) =0, y() =0. 
(b) -y"—-y=f(z), y(0)=0, y"(r) =0. 

(c) -y"+y=f(@), y')=9, y() =0. 

(d) -y"+y= f(z), y(0)—y'(0) =0, y4) + y') =0. 

(e) cy" +y'= f(z), yO) <0, y)=0. (f) (1—2?)y"—2ay'= f(x), y'(0) =0, y(b) =0, b<1. 


6. Find the Green’s function for the given differential operator L[x,D], where D = d/dz. 


(a) De?*D + e”*; (c) DaD—9/a; 
(b) Da®pD+1/4; (d) DaD. 
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12.2 Green’s Functions for Linear Systems 


In this section, we consider boundary value problems for scalar linear differential equations of the n-th order and 
their reformulations as first order systems. We start with the existence and uniqueness theorem of a two point 
boundary value problem for second order scalar differential equations [20]. 


Theorem 12.2: Let p(x), q(x), and f(x) be functions continuous on 0 < x < ¢, where g(x) > 0. Let ao, a1, 
Go, 21 be constants, where |ao| + |a1| > 0 and |8o| + |f1| > 0. In addition, suppose that 


0 < aoa, 0< Bofi, and |ao| + |Bo| > 0. 
Then for any values a and £3, the boundary value problem 
-y" —plz)y +a(z)y=f(t), O<arK<8, 


agy(0) — ary’(0) =a, Boy(2) + Giy(2) = B, 


has a unique solution. 


In Chapter 6, we saw that systems of first order differential equations form a conceptual framework that includes 
the theory of n-th order scalar differential equations. Now we show that two-point boundary value problems for 
scalar equations can be recast as problems for first order systems. 

Let P(x) be an (n x n) matrix with continuous entries on the interval [0, ¢]. Let f(x) = (f1(x),..-, fn(a))? be an 
(n x 1) column vector whose component functions f;(x), i = 1,2,...,n, are also continuous on [0, ¢]. In the language 
of Chapter 6, P(x) and f(a) are continuous matrix-valued functions defined on [0,4]. Let B!® and Bl! be given 
constant (n x n) matrices and a be a known constant n-column vector. 

Consider the linear inhomogeneous first order system subject to the given boundary conditions 


y (x) =P(a)y(e) +£(2), — BMly(0) + BYy(0) = a. (12.2.1) 


Without any loss of generality, the vector a can be chosen as zero. Indeed, let z(x) be any column function that 
satisfies the given boundary conditions 
Bz (0) + BMz(e) = a. 


Then the n-column vector u(#) = y(x) — z(x) is a solution of a similar two-point boundary value problem with 
homogeneous boundary conditions: 


u(x) = P(x)u(x) + P(x)2(x) —2'(x)+£(x), BOu(o)+BMu(e) =o. 


In the vector equation, P(x)z(x) — z’(x) + f(x) can be considered as a known column vector g(x) and the above 
problem can be written in the vector form with homogeneous boundary conditions: u’(xz) = P(«#)u(2) + g(x), 
Blu(0) + BMu(é) = 0. Now suppose that a fundamental matrix &(2) for the homogeneous equation is known, so 


d®/dz = P(x«)® and det (x) £0 on (0, 4). 
Using the variation of parameters formula (8.3.3), page 455, we find the general solution of the nonhomogeneous 


vector equation y’(#) = P(x)y(x) + f(x) to be 


pose et 5 B(x) 


x L 
| ®~!(t) £(t) at— [ ®~'(t) f(t) a. 
0 x 


where c is an (n x 1) vector of arbitrary constants. Now we choose this column vector c so that the boundary 
conditions in Eq. (12.2.1) are satisfied: 


1 . 1 
BO 6(0)c — 5 Boao) [ &~'(t) f(t) dt + BY G()c + 5 Bea [ @-1(t) f(t)dt=a. 
0 0 


This leads to the vector equation with respect to c: 


1 £ 
B#6(0) + B41 6(2)| c = — |B 6(0) — B4 a(2)| | S(t) f(t) dt+a. 
2 0 
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We rewrite this system of equations in vector form: 


Bc=b, (12.2.2) 
where 
B= B@(0) + BY a0) ism X n matrix, 
1 é (12.2.3) 
b=5 [B"@(0) 2 BMG(0)| | &-1(t)£(t)dt+a_ is n-colomn vector. 
0 


If the matrix B is invertible (det B 4 0), then equation (12.2.2) has a unique solution c = B~'b and the boundary 
value problem (12.2.1) has the unique solution 


y(x) = &(x) Bb +5 8(2) [ero f(t) af ®~'(t) f(t) dt 


= / * G(x, 1) f(t) de, (12.2.4) 
0 


where the kernel G(z,t) is called the Green’s matrix-function. If the matrix B is singular, then the vector equation 
(12.2.2) has either no solution or infinitely many solutions. The latter happens when vector b is orthogonal to any 
solution z of the adjoint homogeneous equation B*z = O (see §7.2.1). For a singular matrix B, the homogeneous 
boundary value problem y’(2) = P(x)y(x), B!ly(0) + Bl! y(@) = 0 has nontrivial solutions. 

If the matrix B is not singular (det B 4 0), then the unique solution (12.2.4) does not depend upon any particular 
fundamental matrix. Indeed, according to Corollary 8.6, page 436, any two fundamental matrices ®(2) and W(x) to 
the homogeneous vector equation y’ (x) = P(x)y(«) differ by a constant multiple. Therefore, there exists a constant 
n Xn matrix C such that ®(x) = W(x) C and det C 4 0. Then expressions 


®(7)B and &(x)6~'(t) 
do not depend on particular choice of the fundamental matrix, and ® can be replaced by ®(x) = W(x) C. Indeed, 
B = B60) + B41 G(0) = [B°lw(0) # Bw (0)| C. 
Then its inverse is 
B-=c! [B"w00) r BMY (0) on 
Hence 


®(x)B-! = B(x) Co! [Bw eo BMY) = U(x) [Bw +BUW(O)| 


Similarly, it can be shown that ®(x)®~1(t) and b do not depend on what fundamental matrix has been chosen. 


Example 12.2.1: Consider the boundary value problem (12.1.5), page 631, for the self-adjoint differential operator 
L|x,D] = —Dp(x)D + q(x), where D = d/dx. We reformulate this scalar problem in vector form by introducing a 


2-column vector - . 
y(x) = (y(x), yo(2))” = (y(2), p(x) y'(a)) 
Then the scalar equation L[x,D]y = f will be equivalent to the vector equation 


Fee l= lew “| BE -[eeal: 


The homogeneous boundary conditions apy(0) — a1y’(0) = 0 and Soy(£) + B1y’(2) = 0 are incorporated as follows: 


oP v@+ [5 ax/no] 2 = [ol 


Therefore, the corresponding matrices become 


P(z) = Fe “| Bole |‘ aa Bae (° 0 
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Example 12.2.2: (Example 12.1.2 revisited) Upon introducing the 2-column vector y = (y1(z), yo(x))? = 
(y(x), y’(x))*, we rewrite the two-point boundary value problem in the vector form: 


selma] = 2“) fae *Lrei)s &<2< 


oo PO +]y zfxar= [ol 


The fundamental matrix for the homogeneous system of first order equations is 


1 = 1 
Be) =e =e ™| *, 2 +3e ip where A= [) Al 


Since ®(0) =I, we have from Eq. (12.2.3) that 
_ plo] [1] _ {1 0 0 0] a_l 3 0 
a #0) =|) + [a 1]° ~3|2(e-e-2) e+2e-?|’ 


and 
a 3 0 ” souk 
cm 6 F (e?-e) -e-2 | i eo Ea 


Example 12.2.3: (Beam equation ) The beam deflection equation, considered in §11.3.1, is 
Un + C’Usees = a(2) (0<a< 2), 


where c is assumed to be a positive constant and q(z) is the distributed load. Application of separation of vari- 
ables method (or Laplace transformation) leads to an ordinary differential equation subject to cantilever boundary 
conditions: 


dty y(0)=0, (0) =0, 
at eu = f(a) (O<a< 2), y (2) =0, y’"(e) —0. 


In this equation, 4 is a positive constant depending upon the radian frequency of the periodic loading and the 
physical properties of the beam, while f(x) represents the strength of the loading at point x along the beam. The 
left end (x = 0) of the beam is assumed to be anchored, and the right end (x = £) is free. 

Now we rewrite this scalar two-point boundary value problem in vector form by introducing a 4-column vector 


of unknowns 
y(x) = (yi (x), yo(a), ys(x), ya(@))” = (u(x), y'(2), 9" (x), y'" (a))”. 


Then the scalar beam equation will be equivalent to the vector equation: y’ = Ay +f, where the square 4 x 4 
matrix A and the known column-vector f(z) are identified from the equation: 


yi (x) 0 1 0 0] Jy(z) 0 

d | yo(x) 0 0 1 OF J ye(x) 0 

lil (0-8 0 absent Yess 
ya(x) m0 0 Of Lya(x) f(x) 


Its general solution can be written (see §8.3.1) as 


Le a i 
y(x) = 7 i ee te\dr— 5 / el) tr) dre 8" <, 
0) x 


where ¢ = (c1, C2, €3,¢4)? is a column vector of arbitrary constants. Since the matrix A has four distinct eigenvalues 
Aig = tu and A3,.4 = +jpy, its fundamental exponential matrix becomes 
cosh pit + cos pt + sinhpt+<4 sinut = coshyt—2, cost Fy sinh wt — Z sin pt 
at 1 | wsinhpt — p sin pt cosh pt + cos pt + sinhut+<4 sinut cosh pt — % cos pt 
e 9 pw? cosh ut — 2 cospt yw sinh pt — p sin pt cosh pit + cos pt 7 sinh pt + a sin pt 


pw? sinh pt+ psint pw? coshyt— pw? cospt a sinh pt — p sin pt cosh pt + cos put 
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Then 
ee ne be is ao — )) 

—s 7) | H(cosh ele — 7) — cos (a — 7 ; 
y(z) [ f( ) Me (sinh p(x = +) + sin (ax — T)) d 

? (cosh u(x — T) + cos p(x — 7)) 

; sinh u(x — T) — sin p(x — T) 
= _s, f(r) (cosh p(x — T) — cos (x — T)) 
4u3 Jo, y? (sinh u(x — 7) + sin x(a — 7)) 
3 (cosh (a — T) + cos p(x — T)) 


dr+ec. 


The boundary constraints arising from the cantilever connections become 


U U 
C1 = a ; f(r) [-sinh(ur) + sin(ut)) dt, c2 = af f(r) [cosh(uuz7) — cos(ur)] dr, 


£ ¢ 
c3 = i i: f(r) [sinh(ur) + sin(ur)| dt, ca = -; | f(r) [cosh(tu7) + cos(wr)] dr. 


1. Show that the Green’s matrix G(z,t) for the two-point boundary value problem (12.2.1) with homogeneous boundary 
conditions (a = 0) is 


G(z,t) = 5 ®(x) (B80) 4 BS) _ [BS(0) = B(0)| ®~'(t) 
1 | B(x) 6-1 (2), 0<t<a, 
“2 | -@(e) S(t), o<t<e 


2. Rewrite the given boundary value problem as an equivalent boundary value problem for a first order system (12.2.1). 
(a) (x?y')’-6y=—f(z) 1<2<2, y'(1)=0, y(2)=0. 
(b) (xy')'—-9y/a=—-f(x) 1<a<3, y(1)=0, y'(3)=0. 
(c) (e- y')'-3e-**y=—f(z) O<a<1, yO0)=0, 2y(1)+y'(1) =0. 
(d) (ec y')'+2e%y=-f(z) O<a2<1, y(0)—2y’(0) =0, y(1) =0. 
3. In each exercise from the previous problem, determine the Green’s matrix for the corresponding first order system. 


4. In each exercise, you are given boundary conditions for the two-point boundary value problem (12.2.1), where 
_— {38 —13 —_ ly (a) _ {ay 
P = E | 5) y(x) ~ ia ? a= a2 . 


Note that the fundamental matrix for the corresponding homogeneous equation is given to be e®”. Form the matrix 
B from Eq. (12.2.3) and determine whether the boundary value problem has a unique solution for every f(a) and a. 


(a) yi(0) = a4, yo(m) = a2; (c) yi(0) — yo(0) = a1, y1(7) + y2(7) = a2; 
(b) yi(0) = a1, y1(7) = 22; (d) yi(0) + y2(0) = a1, yi(m) + y2(m) = ar. 


5. Show that the two-point boundary value problem 
y=Ay (0<t<1)  w(0)=1, yo(1) =2, y(1) =3, 


where 
yi(t) 1 2 0 
yi=|ywt)], A= ]0 -1 O], 
y3(t) 3 0 1 


has a unique solution and find it. 


6. Consider the two-point boundary value problem 


7 ! 


Upon introducing a new variable t = i a(x) dx, reduce the given differential equation to the constant coefficient one 
and solve the corresponding boundary value problem. 
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12.3. Singular Sturm—Liouville Problems 


In certain applications, we come across Sturm—Liouville boundary value problems that do not satisfy all conditions 
(see §10.1) to be qualified as “regular” problems. To be more precise, we consider the self-adjoint (positive if 
q(x) > 0) differential operator (10.1.8): 


L = L|x,D] = —D (p(x) D) + q(x), D=d/dz, (12.3.1) 


where p(x), p(x) and q(x) are real-valued continuous functions on [a, 6]. It is assumed that the operator (12.3.1) 
acts on smooth functions that are defined on the interval [a, b] subject to the boundary conditions of the third kind 


aoy(a) — Boy'(a)=0, — ary(b) + Biy'(b) = 0. (12.3.2) 


For every such operator (12.3.1) and a positive smooth function p(a), we consider the Sturm—Liouville problem that 
consists of the differential equation generated by the self-adjoint differential expression 


La,Dly = dp(a)y or —(p(x)y’)’+G(@)y=Av(a)y, a <a <b, (12.3.3) 


together with the boundary conditions (12.3.2). 


Definition 12.1: The Sturm—Liouville boundary value problem (12.3.3), (12.3.2) is said to be regular if the 


functions p(x), p’(x), g(x), 1/p(x), and p(a) are continuous, and p(x) > 0, p(x) > 0 on the closed interval [a, }]. 


Now we consider a certain class of boundary value problems for the differential operator L[x,D] in which coef- 
ficients may be zero at some endpoints. When one or both of the boundary points in a Sturm—Liouville problem 
goes to oo, or when the coefficient p(x) in the differential operator (12.3.1) diverges at an endpoint, the problem 
becomes singular. Usually the boundary condition at the singular endpoint is forfeited, imposing only the finiteness 
condition instead. The corresponding system is called a singular Sturm—Liouville problem on the interval [a, b] 
if one of the following conditions is fulfilled: 


e p(a) = 0 and boundary condition at « = a is dropped, but assumed y(a) < co; 
e p(b) = 0 and boundary condition at x = b is dropped, but assumed y(b) < co; 
e p(a) = p(b) = 0 and no boundary conditions; y(x) is assumed to be a square integrable function. 


Accordingly, there are four different situations each arising from the zeroes of p(x). The corresponding differential 
equations may contain two real parameters a and @ along with a nonnegative integer n, used to identify the 
eigenvalue. 


1. If the function p(x) has two distinct zeroes at end points « = a and x = b. Then by appropriate translation 
and scaling, the Jacobi differential equation is discovered: 


(1-27) y"+((8-a)—-(2+a4+B)z)y’tn(n+atB+1)y=0. (12.3.4) 


2. If p(x) has a single zero, the generalized (or associated) Laguerre equation (discovered by N. Ya. Sonin in 1880) 
emerges: 
ry’ +(a+1—2)y'+ny=0. (12.3.5) 


3. If there is no zero, but the interval is infinite, we find the Hermite equation: 
y” —2ay' + 2ny=0 (—o0o < x < oo). (12.3.6) 


4. If p(a) has a double zero, the Bessel equation is discovered: 


zy" + (ax+ B)y'—n(nta—1)y=0. (12.3.7) 


In the following, we discuss these four cases along with some of their applications in partial differential equations. 
However, instead of focusing on the general case of the Jacobi equation, we consider its two important particular 
cases—the Chebyshev and Legendre equations, discussed in §12.4. 
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Example 12.3.1: The boundary value problem 
(a? y')’ + (222 —n?)y=0, O<2<4, y(2)=0, 


is not a regular Sturm—Liouville problem but a singular one. For this equation, we have p(x) = p(x) = x? and 
q(x) =n. Since the functions p(a) and p(x) are not positive at 2 = 0, we have an example of a singular Sturm-— 
Liouville problem. 


12.3.1 Green’s Function 


Consider the equation 
zy” + a¢P(x)y’ + Q(x)y = 0, (12.3.8) 


where P(x) and Q(a) are real-valued analytic functions of a real variable about the origin: 
P(x) = po + pitt pra? +---, Q(z) =q+nr+ qa? +---, (12.3.9) 
with convergent power series expansions. For small values of x, the equation (12.3.8) resembles 
zu" + eP(0)u' + Q(0)u = 0. 


This is Euler’s differential equation, which we discussed in 84.6.2. Recall that solutions of the Euler equation are 
obtained by looking for functions of the form u(#) = «™. It is reasonable to assume that equation (12.3.8) has a 
solution close to x” for small values of x. In other words, near the regular singular point x = 0, we seek a solution 
of Eq. (12.3.8) in the form 


Co 
y(z) =2™[eo teeter +-.-)= S- aw, 
n=0 


where co # 0. Substituting this formula into Eq. (12.3.8), we get the indicial equation m(m—1)+ P(0)m+Q(0) = 0, 
which is assumed to have two real roots, m, > mz. We make the transformation y = 2’”'u and substitute it into 
Eq. (12.3.8); this yields 

i(m, — 1) +m, P(2) + Q(z) 


zu” + [2m, + P(x)\u’ + oreo u = 0. 


Let p(x) = 2m, + P(x) and q(x) = [mi(m — 1) +m P(x) + Q(x)]/x. The limit 


myi(m, — 1) +m P(a) + Q(x) — [mi(mi — 1) +m, P(O) + Q(0)) 


a g(a) ~ ot @ 
im im POR=PO 4 jn LO- OO 
x—0+ x x—0+ x 


= mP'(0+)+Q/(0+) 
is finite. So, we actually need to consider the equation 
cu” + p(x)u’ + q(x)u = 0, (12.3.10) 


where p and q are continuous in some interval containing the origin and p(x) is differentiable in it. Also, since 
1— P(0) = m1 + me, we have po = p(0) = 2m; + P(0) = 14 (m1 — m2) > 1. 

We multiply equation tu” + p(t)u’ + q(t)u = 0 by a function G(t, 2) and integrate with respect to t by parts from 
0 to x. This leads to 


x 


tu'G(t, )|j=0 - [wey — p(t)G(t, x)] u’ dt +f q(t)u(t)G(t, x) dt = 0. 


If the function G(,t) satisfies the following equation and the conditions: 


. [x G(a, t)] — p(z)G(a,t) = 1, G(t,t) =0, im, ¢ G(z,t) =0, (12.3.11) 
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then it is called a Green’s function, and 
wie ue)+ | (OCG Dulide (12.3.12) 
0 


To solve Eq. (12.3.11), we set Y(t) = tG(t,x), then the equation for Y(t) becomes Y’ — p(t)Y(t)/t = 1, or, after 
multiplication by an integrating factor u(t) = exp {- f PG) att, we get 


ro= | f mparec 


with C = [" y(t) dt to satisfy the condition G(x, x) = 0. Let po = p(0), then 


u(t) = en{-[ HO ach = ep {— [Math pf f PO=P ae} =t-”° F(t), 


where F'(t) = exp {- i se) at}. We can rearrange terms to obtain 
po fh FO) 


Ej 
where F(t) is a continuous positive function in the neighborhood of the origin because lim pe) bo = p'(0). Since 
=> 
F(a) is bounded in some interval [0, h], there exist positive constants A and B such that 0< A < F(x) < B in (0, Al. 


Then for po > 1 we have 
B * dé B ee 
G(a,t)| < ery ae -(<) 
ee) t §P0 A(po — 1) a 
ax 


B [*dé B 
oS} aS 
lee. < 5 f Z 4 "5 


forallO <t<a<h. If pp =1, then 


So the function G(a,t) is integrable and, in any case, for every x such that 0 < x < A the limit of the product 
xG(a,t) tends to zero: lim,-494 7 G(a,t) = 0. 

Now we have to prove that a solution of the integral equation (12.3.12) satisfies the differential equation (12.3.10). 
Assume that u(x) is a solution of the integral equation (12.3.12) for 0 <a<h. Then 


: ar. EO) f* alduceyers 
i NO caC ITO Ce | a 


ss 
o 
t 


om Fo 
u'(2) = a + [ q(t)u(t)Gue(t, x) dt = — seule) — p(x)u' (x), 
becaus 
—— tPo-1 F(z) {Po-1 F(z) 
G, (x,t) = —- FQ) aro” G,(a, 2) = = Gex(t, x) = F® aro p(x) 


Hence xu” + p(x)u' (x) + q(x)u = 0. 


12.3.2 Orthogonality of Bessel Functions 


We start with a famous Bessel function (see §4.9). Other examples of singular Sturm—Liouville problems and their 
solutions are provided in the next section. 

The Bessel functions J,,(z) are used to express eigenfunctions $,,(#) = Jy (av.nv/é) of the singular Sturm—Liouville 
problem 


d d 7 
eShge \ae | ha y=0, 0<a<8, y(0)<~m, y(4) =0, 
dx dx x 
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corresponding to the eigenvalues A, = Galt where Q,,,, is the n-th positive root of J,(z) = 0. Let us consider 
the parametric Bessel equation (4.9.8), page 249, containing parameters a and (: 


gy! +ary! +(a%2?—v?)y=0 and 2? y+ ay! + (622? —v*)y=0. 
We know from §4.9 that the functions u(#) = J,(ax) and v(#) = J,(Gx) are their solutions. Using Lagrange’s 
identity (10.2.3) on page 555, we obtain 


e 
(A? — a7) | x J,(ax) J,(Bx) dx = J) (al) J, (Be) — Jr(at) Ji (Be). 


Upon choosing af and (é to be zeroes either of the Bessel function (J, (a) = 0 and J,(80) = 0) or its derivative 
(J! (af) = 0 and J! (62) = 0), the right-hand side of the above formula vanishes and we get the orthogonality relation: 


e 
| x Jy(ax) J, (bx) dx = 0, ifaF B. (12.3.14) 
0 


The first few approximations to the positive roots of the equation J,(a) = 0 are given in the following table. 


2.404825557696 5.520078110286 8.653727912911 11.79153443901 


3.831705970208 7.015586669816 10.17346813507  13.32369193631 
For any positive a = 8, we have 
£ 
1 
uIP? = | vJ2(ax) de = 35 {lal I)(a0)) + (a2 -v*) R(at)}. (12.3.15) 
0 a 
The formula (12.3.15) is simplified when a is chosen to be a root of either J,(aé) = 0 or J) (al) = 0. The orthogonality 
property (12.3.14) allows one to expand an “arbitrary” function into the Fourier—Bessel series of order v: 


A A : ; 
f() =5— A, Jy (aor), =o f(x) JL(ra,) x dz, (12.3.16) 


r>1 
where {a;},.5 is a sequence of all roots of J,(aé) = 0 and || J,||? = PJ 41(a)/2. If {ar},.51 is a sequence of all roots 
of J/(aé) = 0, then expansion (12.3.16) is valid with the square norm ||J,||? = 3 G = 4) J? (al). At first glance, 
expansion (12.3.16) has only theoretical meaning with no chance to use it manually. However, a computer solver 
makes this task pretty manageable, as the following examples show. For instance, Mathematica and Maple share two 
dedicated built-in symbols, BesselJ and Bessel JZero/BesselJZeros, to evaluate the Bessel function and find its 


roots, respectively. MATLAB has two similar commands—besselj and besseljzero. Maxima uses bessel_j, while 
Sage utilizes the nomenclatures bessel_J, bessel_Y, bessel_I, and bessel K. 


Example 12.3.2: Consider the function f(x) = x(1—.2) on the interval [0,1]. We expand it into the Fourier—Bessel 

series with respect to two Bessel functions: Jo(a) and J;(x). Recall that Jo(0) = 1 and J;(0) = 0. Since f(0) = 0, 

we expect that the series (12.3.16) with respect to Ji(x) will give a better approximation to f(z). First, we calculate 

the square norms of the Bessel functions, corresponding to first six roots of J,(a) = 0: 

r=1 r= 2 r=3 r=4 r=5 r=6 
[a= 
A 


Then we calculate the values of coefficients A, (r = 1,2,...,5) in Eq. (12.3.16). Next we build N-th term 
finite sum approximation and plot this partial sum with N = 5 terms in Fig. 12.3. The mean square error of 
approximation with respect to the Bessel functions of order 0 is about As(Jo) ¥ 0.000111967, and with respect to 
the Bessel functions of order 1 it is much smaller, As(J,) * 7.21913 x 10-7. The fact that Jo(0) = 1 hinders the 
approximation at x = 0 with respect to Bessel functions of the zero order. 


! into the Fourier—Bessel series: 


i 1 © TD, (xOp 
(2) =A du(ear), A= f POE igs 
0 


= 2 2 
1l+z2 ee} 1+2 


Example 12.3.3: Now we expand the function g(x) = (1 + 2”) 


Since g(0) = 1/2 4 0, we expect a better approximation of the Fourier—Bessel series with respect to the Bessel 
function of order 0. Indeed, the mean square error of 5-term approximation with respect to the Bessel functions of 
order 0 is about As(Jo) © 0.0122404, and with respect to the Bessel functions of order 1 is much larger, As(J,) © 
0.0442942. 
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Figure 12.3: Example 12.3.1. The graph of the N-th partial sum approximations with N = 5 terms with respect to 
(a) Jo(x) and (b) Ji (x), plotted with Mathematica. 


Example 12.3.4: Consider a thin elastic membrane that vibrates in accordance with the two-dimensional wave 
equation 
Utt = oa (Uae + tig) ’ x + y” < a’. 


Here, a is the radius of a circular membrane; it is convenient to use wave equation in polar coordinates 
c 9 1 il 
U=C | Upp += Ur + = Uo | 5 O<r<a, 0<6< 2rn. (12.3.17) 
r r 


Assuming that the displacement u(r,6,¢) from equilibrium is zero at the rim, we get the following boundary condi- 
tions: 


|w(0, ,t)| < oo (bounded at the origin), 
u(a,6,t) = 0 (fixed edge), (12.3.18) 
u(r, 0,t) = u(r, 6 + 27, t) (single-valued function). 


We seek a solution of the wave equation (12.3.17) when the initial displacement and velocity are specified 
u(r, 0,t = 0) = fil(r, 9), ut(r, 0,t = 0) = fo(r, 6). (12.3.19) 
In accordance with separation of variables, we assume that partial nontrivial solutions have the form 
u(r, 0,t) = v(r, 0) T(E). 


Then we obtain the equation for T(t): 


T(t) +CAT(t) =0, 
and the following eigenvalue problem for v: 
10 Ov 1 0?u 
—_—_-— — _ = < 
a(t) + age 0 (0<r<a), 
subject to the boundary conditions (12.3.18). 


Let us assume again that 
u(r, 0) = R(r) O(8). 


Substituting this form of the solution in our equation and dividing by R(r) O(), we obtain 


TB gi a pa, ae. BE ay 
Rdr\' dr gee Rdr\' ar ee ee 


which leads to 
6” + WO =0. 


The function u(r, @) must be a single-valued and differentiable function, as should 0(6) be. Since 0(@) is a periodic 
function with period 27, i.e., O(@) = O(8 + 27), we get 4p = n”, a positive integer; otherwise, a nontrivial periodic 
solution does not exist. Therefore, 


0,,(0) = A, cosné + B,, sin nd, n=0,1,2,..., 
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2 


where A, and B, are some constants. Using this value of w = n*, we get the following singular Sturm—Liouville 


problem for R(r): 
1d dR n? 
eens — ——— = < 
Gakic ) Rr) 0 (0<r<a), 


r dr 


subject to the corresponding boundary conditions 


Introducing a new variable = V\r and setting R(r) = R (+) = y(x), we obtain Bessel’s equation of the n-th 
order: 


with the boundary conditions 


y (ava) = 0, |y(0)| < co. 


The only function that satisfies (up to an arbitrary constant multiple) the Bessel’s equation, which is bounded at 
the origin, is Jn(x) = Jn (rv). From the first boundary condition, it follows that J, (avX) = = 0. If it is the 
m-th root of the equation J,(j:) = 0, then 


ue 
ee n=0,1,2,...; m=1,2,..., (12.3.20) 

a 
is the sequence of eigenvalues. The eigenfunction corresponding to this eigenvalue is 


ust) 
Ram(r) =Jn | vr}. (12.3.21) 


Using Rnim(r) and ©,,(6), we form partial nontrivial solutions 


Un,m(1,9) = In (= ‘) (An m cosné + Bn m sinné) 


that we have to multiply by Tym(t), the general solution of T” + c?An mT = 0: 
Tn m(t) = An.m COS (CAn mt) + bn,m Sin (CAn mt) - 


Summing all nontrivial solution, we get the general solution of the wave equation (12.3.17) subject to the boundary 
conditions (12.3.18): 


(n) 
(r, 0, t) = cos nd So [Qn.m COS (CAnjm) t + bnjm Sin (CAn,mt)] In (= ) 


n>0 mel 


+ S- sin nd So [A n.m COS (CAn mt) + Brim sin (CAn,mt)] In (= *) 


n21 m21 


that contains four arbitrary constants. For their determination, we use the initial conditions: 


isi 
u(r, 0,0) = >> SS an.mcosnd + S © An.m sinnd Jn (= 3 r) = ntro 


m21 |n>0 nol 


ur 
tis(t,0,0) = >> Anim SS bn.m cosnd + S~ Bnmsinnd In (= 7 ‘) = fo(r, 6). 


mel n20 n21 
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The coefficients of the above Fourier-Bessel expansions can be obtained using orthogonal relations. For instance, 


1 Qr 1 a (n) 
An.m = - | sin nb dd ——,; | rdr fi(r,®) In pur). 
T Jo Il Jnl ” 


Now suppose that the initial displacement is f1(r,0) = (r — a)? sin 20 while the initial velocity is zero. These initial 
conditions dictate that dnjm = bnm = Bnym = 0, and Anm = 0 for n # 2. The coefficients Ag, must satisfy the 
relation 


= 
a 
bo 
5] 

ll 
a 
3 
Q 
3 
= 
| 
= 

bo 
bo 
= 
aN 
eS 
KS 
3 
Ne 


(=i) 2 cima (2) 
where H,(z) = be oF or (=) is the s-th Struve function and pm’ (m =1,2,...) are roots 
= T(kK+3)0(k+s4+3) 2 
k>0 
of the equation J2(j:) = 0. The square norm (12.3.15) is simplified as ||.J2||? = B(u ”) /2. 


1. Expand the function f(«) = 2? into Fourier—Bessel series (12.3.16) on the interval [0,1] with N = 5 terms with respect 
to the Bessel function of order 0 and order 1. Which of these two approximations gives better result? 


2. Expand the function g(x) = 1— x? into Fourier—Bessel series on the interval [0, 1] with N = 5 terms with respect to the 
Bessel function of order 0 and order 1. Which of these two approximations gives better result? 


3. Prove the Dini!” series expansion: 
(2) =o Ar Jn (rar), reenter [ f(x) In (rar) x dx, 
= (a2l2 — n? + h2€?) [In (are)] 
where a, is defined by the boundary condition hJn (al) +a J}, (al) = 0. 
4. Prove the Fourier—Bessel series expansion: 


2a? ¢ 
=co+ bone (rar), Cr = (22 — nr) [JolaO/ | f()Jo(xar) x da, 


ral 


£ 
where a, is defined by the boundary condition Jj (aé) = 0 and co = 4 [ f(x) ada. 
5. Consider the singular Sturm—Liouville problem 


—(y')' = Ar y(z) (0<a< 8), 
y(x), y'(x) bounded as x+> 0, y(t) = 0. 
Show that Ao = 0 is an eigenvalue of this problem corresponding to the eigenfunction yo(x) = 1. If \ > 0, show formally 
that the eigenfunctions are given by ¢n(x) = Jo (x./an/£), where a, is the n-th positive root (in increasing order) of 
the equation Jo(./a) = 0. 


6. By evaluating at x = 1, 2, 3 and performing the appropriate numerical integration, give empirical evidence that the 
following formulas may indeed be correct. You may want to show that these integrals satisfy the appropriate differential 
equations subject to corresponding initial conditions. 


'M7Ulisse Dini (1845-1918) was an Italian mathematician and politician born in Pisa. 
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a ese "opel een : _2 * cos (#8) 4 
(8) Jo(a) == | c0s (# e080) a (>) (ey =2 f BE g 


7. Evaluating numerically at x = 1, 2, 3, give empirical evidence that each of the following formulas may indeed be correct. 


(a) Jo(x) +2 yy Jor(x) = 1; (c) 2 So(-1 )* Jons1(x) = sine. 
k>1 k>1 

(b) Jo(x) +2 So(-1 )* Jon (a = COS Z; 
k>1 


For each of the functions in Exercises 8 through 22 on the specified interval [0, £], obtain 7°_, A; Jo (war), the five-term 
partial sum of the Fourier—Bessel series and graph it along with the given function. Then repeat the calculations for the 
Fourier—Bessel finite sum approximation ae B,J; («B8,) with respect to the Bessel function of the order 1. Which of these 
two approximations gives better result? 


<a<l, 
(x) 19. f(x) = 2" yeshe pla 2 
9. f(x) = Vz, 6=4; l-a, 1l<r<2, 
10. f(a) =e"*, = 2; 
ll. f(x) =e*/(1+ 27), 0=3/2; “" O0K< 1 
Fle) =e /(. +2"), 6=3/ eee ec 
12. f(x) = cos* a, € = 2n; 1, l<a<2, 
13. f(x) =1— V2, = 1/2; 
14. f(w) =a sina, 2= 7; a1. f(a) = sing, O<a<a/2, )) 
15. f(x) =e” sina, € = 7/2; (new) a 1, n/2Q<ga<n, ’ 
16. f(a) = |x —-1|, 2=2; 
17 f(x) =|sinz|, £ = 27; z, O0<2z<il, 
< 1, 22. = ? < »f= 3 
18. f(a) = x, O<a< 0=2: f(x) 1 l<a<2 3 
1, 1l<e<2, 3-2, 2<¢2<¢3 


23. Using separation of variables, solve the initial value problem for the wave equation in polar coordinates 
ru _ 2 Ou ldu, 1 Pu 
Ot? Or? or Orr? 06? 

u(a,0,t)=0, wu(r,6,0) = f(r.0), uz(r, 0,0) = 0. 


(O<r<a), 


24. Consider the steady state temperature distribution u(x, y) in semi-infinite strip 0 < # < co, 0 < y < a. Physically 
speaking, of course, this is an unrealistic idealization, perhaps concocted to simulate a rectangular configuration when 
one of the sides is much larger than another. Using sine-Fourier integral transform 


Sw) = [ Fle) sin (wo) de =2 [°F w) sin (we) du, 


solve the following Dirichlet problem: 
Una + Uyy = 0, ula, 0) — 0, u(a, T) = f(a), u(0, y) =0. 


25. Expand the function f(a) = x into Fourier—Bessel series (12.3.16) on the interval [0,1] with 5 terms with respect to 
the Bessel function of order 2 and order 3. Which of these two approximations gives better result? 
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12.4 Orthogonal Polynomials 


Let us choose a set of linearly independent integer powers q,(a) = x” (n = 0,1,2,...) and some interval (a, b), which 
can be infinite. For a fixed weight function p(x), this set {¢n(x)}ns0 is not orthogonal. However, we can construct a 
linearly independent set of polynomials p,,(a) (each of degree n, n = 0,1,2,...) that are orthogonal on that interval 
with weight p(a). Some famous orthogonal polynomials are summarized in the following table!!®. 


Legendre 
Chebyshev 


Chebyshev—Hermite 
Chebyshev—Laguerre 


These orthogonal polynomials (as well as many others) are eigenfunctions of the corresponding singular Sturm— 
Liouville problem, generated by a self-adjoint differential equation (12.3.3), page 639. These polynomials have 
numerous applications, and their properties have been intensively analyzed. In this section, we concentrate on only 
one very important property—orthogonality, followed by corresponding orthogonal expansions with respect to these 
polynomials. 


12.4.1 Chebyshev’s Polynomials 
The differential equation on the interval (—1,1) with a positive parameter 
2 


—_* y= 0 (12.4.1) 


d 
a ae 7 . ( { 2 ‘) 
(l—a*)y" —ay' +rA*y=0 or ie ary )+ — 


is called Chebyshev’s!!® equation. By imposing conditions at the end points for solution y(x) to be finite, we 
obtain the singular Sturm—Liouville problem that has eigenvalues A = n, integers. Any point from the interval 
(—1,1) is an ordinary point for Chebyshev’s equation. Thus, any series solution will converge for || < 1. Assuming 


that 
Co 
k=0 


substitution into Chebyshev’s equation (12.4.1) gives us 


So anya(k +2)(k + Dek ~ Yo auth —1)a Do oaks! +S aaa = 


k=0 
or = 
Zaz + aor” + [6a3 — a1 + Aa] e+ Ss [(k + 2)(k + Llagye(d? — k?)ax] v* = 0. 
k=2 


Equating coefficients of like powers of x to zero, we obtain the following relations, listed in the table. 


Power of x Coefficients Recurrence 

x 2a2 + A*ap = 0 or ag = —r* ao /2 

a 3+ 2a3 + (A? — 1)ay =0 or a3 = (1— A?) a1/6 
x" 4+ 3a4 + (A? — 2?)az =0 or a4 = —(A? — 2?) ap/12 
ak (k+2)(k+ lapse +(A2—k2)an =0 or Gere = - haha she 


'18Polynomials Pp (x) were introduced in 1785 by the French mathematician, Adrien-Marie Legendre (1752-1833), who made important 
contributions to special functions, elliptic integrals, number theory, and the calculus of variations. Other polynomials, Tn (x), Hn(x), and 
LIn(ax) were defined by the Russian mathematician, Pafnuty Chebyshev (1821-1894), in 1859. A generalization of Laguerre polynomials 
were discovered by Russian professors Yu. V. Sokhotskii (1842-1927) and somewhat later Nikolay Sonin (1849-1915). In 1864, the French 
mathematician Charles Hermite (1822-1901) studied polynomials Hy (x), and the French mathematician Edmond Laguerre (1834-1886) 
analyzed Ly (x) in 1879. See dlmf.nist.gov for other special functions. 

119Pafnuty L. Chebyshev (1821-1894), professor at St. Petersburg University, was the first to study solutions of this equation. The 
collected works of this eminent savant are available in Russian and French. 
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From the recurrence relation 
\? — k? 


(k+2)(k+1) 


we can determine the values of all coefficients by its direct substitution to be 


akg = — ak, 


N2(A2 — 22)(A2 — 42) «+. (A2 — (2m — 2)?) 


— f_1\ym 

dam = (1) (m)! 90, 

a (0? —19)(92 ~ 3%) (02 — 2m —1)%) 
—1 — 3*)--- —(2m-1 
Q2m+1 = (—1) a. 
(2m +1)! 

When A = n is a positive integer, one of these recurrences will terminate depending on the parity of n. In this case, 
the coefficients an42, Qn44, ..- are all equal to zero. Then we obtain a polynomial of degree n as one of the solutions 


of Eq. (12.4.1). If we set the leading coefficient a, of this polynomial to be 2"~+, then a polynomial solution is called 
a Chebyshev polynomial of the first kind and denoted by T,,(x). The letter T is used because of the alternative 
transliteration of the name Chebyshev as Tchebycheff (actually, Chebyshev himself used five distinct spellings with 
Latin letters and two Russian spellings). 

It is customary to define Chebyshev’s polynomials via either the following formula: 


Tn(x) = cos(narccos x) (\z| < 1) (12.4.2) 
or the recurrence relation (for arbitrary 2) 
Tn(x) = 24 Th_i(x) — Th-2(2), To(x) =1, Ti(x) =a. (12.4.3) 
Let us list the first few Chebyshev polynomials: 
To(a) =2a¢7—1, T3(x)=423—32, Ty(x) =1—8e74 82", T(x) = 16x° — 20x? + 5a. 


Chebyshev’s polynomials possess so many remarkable properties and can be defined in many ways that we simply 
cannot present them (see [11, 34]). The reader can find some of their properties in the exercises. We provide here 
the orthogonality property: 


. ae a dr _f &, n #0, 
[Pola Tala) ae = 0, meén, [ nom -{? 


Chebyshev’s polynomial of the second kind, denoted by 


sin[(n + 1) arccos 2] 


U,(x) = for |x| <1 
(x) — a 
are solutions of the following boundary value problem 
(l—2?)y” —32y' +n(n+1)y=0, U,(1) =n+1=(-1)"U(-1). (12.4.4) 


The Chebyshev polynomials of first and second kind turn out to be the best choice for most applications, mainly 
due to the good convergence properties of the corresponding series: 


f(z) = S + a Cn Tn (2), =? f « f(a )as- =f f (cos @) cos(né) dé. (12.4.5) 


We isolate the coefficient co because ||T;,|/? = 7/2 if n 4 0 and |/TZo||? = 7. For Chebyshev’s polynomials of the 
second kind, we have an expansion similar to Eq. (12.4.5): 


=" aU, (e), =2f f(x) Un(a) V1 — 2? de. (12.4.6) 
n20 


To invoke the Chebyshev polynomial, the latest version of MATLAB uses the commands orthpoly: : chebyshevi (n, 
x) and orthpoly::chebyshev2(n, x) for these polynomials of the first and second kind, respectively. Mathematica 
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and Maple share almost the same name ChebyshevT[n,x] and ChebyshevT(n, x), respectively. Also, Maple has 
a special command chebyshev(f,x,eps) (when the package numapprox is invoked) to expand the function f into 
series with respect to Chebyshev polynomials. Mazima uses chebyshev_t(n, x) and chebyshev_u(n, x), while 
Sage utilizes chebyshev_T(n, x) and chebyshev_U(n, x). SymPy uses chebyt(n,x) and chebyu(n,x). 

There is an open-source special package, called Chebfun, written in MATLAB, for numerical evaluation of Cheby- 
shev expansions with functions to 15-digit accuracy. It was proposed in 2002 by the famous British mathematician 
Lloyd N. Trefethen and his student Zachary Battles. The mathematical basis of the system combines tools of 
Chebyshev expansions, fast Fourier transform, barycentric interpolation, recursive zerofinding, and automatic dif- 
ferentiation. 


Example 12.4.1: We expand the function g(x) = (1+ x7)~' into the Chebyshev series (12.4.5) on the interval 
[—1, 1]. To achieve it, we calculate the first few coefficients in the series (12.4.5): 
1 f) g(a) i ey. 17 C6 99 
o=- 8 _dg=-—=, @ =4-372, ==—-12, 2=70-—. 
0 i: vi— ve v2 2 v2 2 v2 
Note that odd numbered coefficients are all zeroes. The Chebyshev truncated series with 4 terms (which is a 
polynomial of degree 6), 


+ co To(x) + c4 Ti(x) + ce To (2), 


gives a very accurate approximation of g(x): it has a mean square error of about 1.71413 x 10~®. Next we use 
expansion (12.4.6): 
ug(x) = 2(V2 — 1) + ag U2(x) + a4 Ua(x) + a6 Ue (2), 


where 


ag =2(7-5v2), aq = 2 (29V2— 41), ag = 2 (239 - 1692), 
: _ 2 lA 2 
since ag = =| Sa 


1+ a? 
because it has a mean square error of about 1.84542 x 107°. 


= 2(/2—1). This finite sum u¢(x) also gives a very good approximation to g(x) 


12.4.2 Legendre’s Equation 


The closest relative to the Chebyshev polynomial of the second kind is the Legendre polynomial, conventionally 
denoted as P,(«), which is an eigenfunction of the singular Sturm—Liouville problem: 


~ ja — 29) | +Ay=0 (-l<a<l1), y(-1)<o, y(1) <o. 


Setting » to be equal to the eigenvalue \ = v(v + 1) (v > 0), we obtain 
(1—2?)y” — Qary' + v(v +1)y =0, (12.4.7) 


where v is a nonnegative real number. Despite the fact that this is a differential equation of order 2, it is known as 
Legendre’s equation of order v. Eq. (12.4.7) is frequently encountered in physics and other numerous problems, 
especially in those exhibiting spherical symmetry. 

These polynomials have a default script in all software packages: MATLAB — orthpoly :: legendre(n, x); 
Mathematica and Maple—LegendreP[n, x] and LegendreP(n, x), respectively, and Maxima-—legendre_p(n,m,x). 
Note that Maple has short-cut P(n,x) when the package orthopoly is invoked. SymPy uses the same command as 
MATLAB to evaluate the Legendre polynomials. Sage utilizes legendre P(x,n). 

Legendre’s equation has regular singular points at « = 1, = —1, and x = ov, and ordinary points elsewhere. 
For simplicity, we consider only the case when v = n, a positive integer. A solution of Eq. (12.4.7) has a Maclaurin 
expansion that converges when |x| < 1, so we set 


y(x) = S- ap x”, 
k=0 
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We substitute the series and its first two derivatives into Eq. (12.4.7) to obtain 


k(k — 1) a, 2 #20 5 kona 14 +v(vt+1) >> a,2* = 


la: 


(1—2”) 


> 
II 


2 


or 
ym —1) ) ay 2*- 75" at —l)agez 3 akan +m (v + 1) az, x® = 0. 


By shifting the index of summation in the first series in this expression, we find that 


co 


(+2)(b-+ 1agyon*— 5° k(k—1aga* — > 2ka 2 +m (v+1)a,2* =0. 
k=2 k=1 


Me 


> 
ll 
3° 


Next, we collect like terms to obtain 
1-2ag+v(v +1) ao + [2-303 —2a, +V(V+ 1) ai] x 


9 {(k+2)(k +1) dpye — k(k — 1) ag — 2kay + v(v +1) ax} 2* =0. 


k=2 
Setting the sums of the coefficients of like powers of x equal to zero gives!”° 
1-2a2+v(v+1l)ao = 0, 
2-3a3+(v+2)(v—l1)a, = 0, 


(kK+2)(kK+l)agpget(vtkt+1)\(v—k) ap 


| 
i) 
> 
W 
iw) 


(12.4.8) 


These relations show us that all coefficients a, from k > 2 can be determined onward in terms of ap and a,, which 
leads to 


peaaeas jt - Wee 1) nan we sys) o—...] 
v+2)(v—1) (v + 4)(v + 2)(v — 1)(v —3) 


Although the bracketed series in this solution are rather unwieldy, it could be shown that if v is not an integer, then 
each of them converges when |x| < 1 and diverges when |x| > 1. The series also diverges when x = +1, though this 
is not easy to prove. When v is an integer, however, one of the series terminates (depending on the parity of v) and, 
therefore, is a polynomial. This polynomial P,(x), called Legendre’s polynomial, is uniquely defined by setting 
P,(1) =1. This condition defines the coefficient ao to be 1 and the coefficient a; to be zero. 

Legendre’s polynomials diverge at the singular point z = oo. The efficient way to define Legendre’s polynomials 
gives!*! Rodrigues’s formula: 


[n/2| 
1.- id” ; 1 n\ (2n—2k\ 
Pe) = oa an —1)"= > e() ( A ) maa (12.4.9) 


The first six Legendre polynomials are seen to be 


Po(x) = 1, Pi(x) =a, P(x) = 5 (32? — 1), P3(x) = = (52° — 32) , 


oe 


1 1 
Pala) = 5 (35a* — 3027+3), P5(x) = 5 (632° — 702° + 152) . 


120We use the relations v(v + 1) — 2 = (v + 2)(v —1) and —k(k—1) —2k+v(v 41) = v(v +1) —k(kK41) = (v + k4+1)\(v—k). 
!21Benjamin Olinde Rodrigues (1795-1851), more commonly known as Olinde Rodrigues, was a French banker, mathematician, and 
social reformer; he derived this formula in 1816. 
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-1.5 -1 -0.5 ie) 0.5 1 1:5 


Figure 12.4: Graphs of the Legendre polynomials P,, (a) for n = 2,3,4,5, plotted with MATLAB. 


The Legendre polynomials are interconnected by the following relations, known as recurrence formulas: 


< a Pa) a Pe, (12.4.10) 
P) (a2) —2P._4(a) —n Pp_-i(z) =0, (12.4.11) 
(n+ 1)Pr4i(x) = (2n + 1)a P,(x) — nPp_i(a). (12.4.12) 


The latter difference equation is well suited for computations. Some important quadratures of Legendre polynomials 
can be evaluated with the aid of the following theorems (their proofs are left as exercises, but also can be found in 
[29]) . 


Theorem 12.3: 


u ap 5 0, ifm<n 
= m+1 2 
/ v™ Py (a) dar PSEC eae ge ae 


Theorem 12.4: 4 
0 ifmAn 
IP ((@)P2 (a8) Cle = { aes: : 
i “ ifn=m. 


2n+1? 


Corollary 12.1: Let q(x) be a polynomial of degree less than n, where n > 1. Then 


/ P, (x) q(x) da = 0. 


-1 


Theorem 12.5: The Legendre polynomial P,,(#) has n distinct zeros in the open interval (—1, 1). 


The orthogonality property of Legendre’s polynomials (Theorem 12.4) leads to an orthogonal expansion: 


He~D+SE+) a, Pe), ag =F? f pea) Pula) (12.413) 


2 2 
n>0 
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Example 12.4.2: (Example 12.4.1 revisited) | Expanding the function g(x) = (1+ 2?)~' into the Legendre 
series (12.4.13) and keeping only 6 terms, we obtain 

Ig (x) =ao + a2 P2(x) + a4 P,(x) + a6 Pe(x), 


where 


ao = = a= — - =, A = — - —. 


ee ee ee ee 5 4 


Its mean square error is about 0.0333262. 


0 dx 3-7 _ 17% 20 _ 161 41n 


Example 12.4.3: Imagine a solid ball of radius 1. We introduce the spherical coordinates (r,6,¢) in this ball so 
that the origin coincides with the center of the ball. Let steady state temperature be rotationally invariant, so it 
does not depend on the azimuthal angle ¢. If the temperature inside the ball is denoted by u(r, 4), then it will satisfy 


Laplace’s equation 
1 0 ( ,0u 1 Ou 
— — +— | sind — } =0. 12.4.14 
r? Or ¢ oe) +a (sino =) : ( ) 


We seek partial nontrivial solutions of this equation in the form u(r,6) = R(r)O(6). Substituting into (12.4.14) 


gives 
d ( dR(r) 1 d/. ,d@\_ 
ea (r a ) +R 5-H (sme GZ) = 


id /(,dR(r)\__ 1 1 d/( do 
"ar J) Osme@ do\o" 

The right-hand side depends only on @ and the left-hand side depends only on r. We conclude that both sides are 
equal to a constant that we denote by A. This leads to two separate equations, one for R(r) and one for 0(6). We 


start with the latter: 


Separating variables, we get 


1 d/. ,dO 


We make the change of variables 
x =cos8, y(x) = O(6). 
With standard identities 
9 5 d dé d 1 od 
sin*O@=1-2 a 


we convert our equation into 


d dy 
“i=. 2 = 
= (( oy) + ay 0 


This is equivalent to Legendre’s equation (12.4.7). Its eigenvalues are \ = n(n + 1), where n is nonnegative integer, 
and corresponding eigenfunctions are y(z) = P, (x) = P,,(cos6@). 
Our next task is to solve the equation for R(r): 


4 (eS) = n(n +1) R(r). 


Since this equation is of Euler’s form (see §8.1.1, page 437), it has a polynomial solution 
Ralr) =car® +d,r-* + for n > 0. 


Here, of course, c, and d,, are some arbitrary constants. Since the function R,(r) must be bounded at r = 0, we 
must set d, equal 0. When n = 0, the general solution becomes 


Ro(r) = co + do Inr. 


Again, we set dg = 0 because the logarithm function is unbounded at the origin, and obtain the sequence of 
eigenfunctions R,(r) = cpr", n = 0,1,2,.... Putting this information together with our solution in 6, we find 
partial nontrivial solutions of Laplace’s equation to be 


Un(r, 0) = Cnr” Pp (cos 8), n=0,1,2,.... 
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Now we invoke the familiar idea of the Fourier method, and write our general solution as the sum over all possible 
partial nontrivial solutions: 


u(r, 0) = S- Cnt” Pp (cos 6). 


n>0 


For Dirichlet boundary condition, the function u(1,6@) = f(@) is specified, so we need to determine the values of 
coefficients c,, from the Fourier—Legendre expansion: 


u(1,0) = f(@) = S> en Pp(cos 6). 


n>0 


Using Eq. (12.4.13), we can determine coefficients c, accordingly. 


Legendre’s differential equation has two linearly independent solutions. If vy = n is a positive integer, one solution 
is the Legendre polynomial (12.4.9). Another solution, having a logarithmic singularity at points +1, is called the 
Legendre function of the second kind. It may be defined recursively according to Eq. (12.4.12) with the 
following first few functions: 


1, l+2 z,1+2 3a2-1, l+ae 32 
C5 a eg I gg 


12.4.3 Hermite’s Polynomials 
The classical one dimensional harmonic oscillator in quantum mechanics is described by the Schrodinger equation 


2m 


"+ SF (E- V6) b =0, 


where 7(&) is the state of a particle of mass m in the potential V(€) with energy EH. The constant hi = h/(27) = 
6.626 x 10~°4m?kg/s is called the reduced Planck constant, or Dirac constant. We will suppose that 7 depends only 
on the position €, and that the potential V is defined by V(€) = Ee, which corresponds to the elastic force —ké. 
Hence, we get the equation 


2m k 
w+ sy (e-$e) v= ’ 
and the values of energy F must be determined from the condition that the solution is bounded in the whole line 


—co < € < ow. Let us introduce new parameters: 


2 mk 2nE 
a= ap = 72 


(a > 0). 


The constant a? is known, but pu is a parameter to be determined by solving a corresponding Sturm—Liouville 


problem. So we get 
w" + (ap — 07é) p =0. 
By introducing a new independent variable x = €,/a, we reduce the above equation to the following one: 


py" + (u— 2?) p =0. (12.4.15) 
This linear differential equation has an irregular singular point at infinity (a = oo). If we seek a solution as a product 
(a) =e Py(2), 
then y(x) must satisfy the differential equation 
y” — 2ry' +2d0\y = 0, -co<@<m, (12.4.16) 
which is called Hermite’s equation. Here, we set 2A = yw — 1. Eq. (12.4.16) can be rewritten in self-adjoint form: 


d ee dy nee 
— — 2 *y=0. 12.4.1 
ae le =| +2re~" y=0 ( 7) 
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The Chebyshev—Hermite polynomials, or simply Hermite polynomials, are defined as the eigenfunctions of 
Eq. (12.4.17) on an infinitely straight line —oo < x < oo that tends to infinity not faster than a polynomial as 
Z— oo. 

Since Eq. (12.4.16) has no singular points in the finite plane, x = 0 is an ordinary point of the equation. We 
shall look for a solution in the form of the power series 


y(x) = S- a; x*, 
k=0 
Substitution into Eq. (12.4.16) leads to 


[an4o(k + 2)(k +1) — 2kay + 2ra,] 2* = 0. 


aE: 


> 
ll 


0 

Hence, we obtain the recurrence for its coefficients to be 
2(k — X) 

(3 ae RST a: 

2 (k+2(k+D * 


The coefficients a9 and a, are arbitrary and all others can be found from this difference equation. Thus, we have 
the general solution: 


. 2*(-A)(2— A) +--+ (2K -2—-A) op 
y(x) = ao fre Mee Oe a 


21 — AML -A+2)---(L-A+I2K-2) ony 
- (Qk +)! 7 

k=1 
valid for all finite x. If X = n, a positive integer, then this solution always has a polynomial solution. If n is an 
even integer, the multiple of ag contains a terminating series, each term for k > (n+ 2)/2 being zero. If n is an odd 
integer, the multiple of a; is a polynomial because each term is zero for k > (n + 2)/2. So the result is 


n 
x d —x 


dam”? 


ne. 
ra 
SS" 
> 
= 
ax 
© 
al 
3 
| 
i) 
> 
| 
as 
8 
iw) 
Foe 
bo 
—— 
8 
| 
| a. 
NN _ 
3 
“i 
8 
i] 
ia 
bo 
| 
Faia 8 
| 
~~" 
3 
o 


in which |4]| stands for the greatest integer < 4, called the floor of $. The polynomial H,,(x) is the Hermite 


polynomial. Using this relation, we evaluate the first few polynomials: 
Ho(t)=1, Hi(x)=22, He(x)=4a?-2, H3(x)= 823-122, Hya(x) = 16a* — 48a? + 12. 
It can be shown that the Hermite polynomials satisfy the differential equation: 
Hi (2) =2nH,ilz) or He) = 22 A,(@) — Anji), 
from which follows the recurrence relation: 
Ayn41(x“) — 2a H, (x) + 2n Hy-1(a) = 0 WH 12 ais 


Next, differentiation shows that H,,(x) are eigenfunctions of the Sturm—Liouville problem (12.4.16) corresponding 
to eigenvalues A, = n. The functions w,(xz) = H,(x) ew /2 (n = 0,1,2,...), called Hermite functions, are 
eigenfunctions to the differential operator —D? + x”, so that —w!!(x) + xu, (x) = (2n 4+ 1) dn(z). 


The Hermite polynomials form an orthogonal system with weight p(x) = e7 : 


a 


[Psoorsteren [ tnirttorsen Sauge HMEe 


This allows us to define the following expansion: 


f@o=S ae"? h@, a= FEE [. f(x)H,(a)e~” /? de. (12.4.18) 


k>0 
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In MATLAB, Hermite’s polynomial can be defined by orthpoly: :hermite(n, x), while Maxima uses a similar 
command hermite(n,x). Maple and Mathematica share the same script: HermiteH(n, x) and HermiteHIn, x], 
respectively. Note that Maple has a short-cut for this polynomial: H(n,x) when the package orthopoly is invoked. 
Sage and SymPy utilize hermite(n,x). 


Example 12.4.4: (Example 12.4.1 revisited) The function g(x) = (1 +27)! can be expanded into the 
Hermite series (12.4.18), where coefficients c,, are evaluated only numerically: 


ol [- Hit) 
ca Wni/ar J_4 142? 


Since c24.41 = 0 (k = 0,1,2,...) and other coefficients with even indices decrease pretty fast, 


en? /2 de, n=0,1,2,.... 


co © 0.9272709, co ¥ 0.0116536, cy © 0.00674567, ce © 0.0001393194718, 


finite sums with relatively small number of terms give good approximations. For instance, the mean square error of 
such approximation with 11 terms (five of them are identically zero) is about 0.00608485677, while with 31 terms, 
it is about 0.0013093589. 


Hermite approximation with 11 terms Hermite approximation with 31 terms 


12.4.4 Laguerre’s Polynomials 


The differential equation 
ey” +(l—a)y'+Ay=0, 0<2<00, (12.4.19) 


is called Laguerre’s equation. We rewrite this equation in a self-adjoint form: 


d _, dy Ree 
+ [re st] 4 26 y=0, 0<4< om. 


The Laguerre polynomials are eigenfunctions of the latter equation subject to the following conditions: the solution 
should be bounded at x = 0 and tend to infinity not faster than a finite power of x as x —> oo. 

The point 2 = 0 is a regular singular point of Eq. (12.4.19) with indicial equation o? = 0. Since it has a double 
root a1 = 02 = 0, it is natural to look for a solution as a Maclaurin series 


y(x“) = S- a, x*, 
k=0 
Substituting this series into Eq. (12.4.19) yields 


So [anzik(k +1) + ap4i(k +1) — kag + Aax] c* = 0 


lee) 
k=0 
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or 
ee) 


S- [anti(k + 1)? — ay(k — A)] 2* = 0. 
k=0 


We obtain the recurrence for its coefficients to be 
k—xX 


=>— k=0,1,2,.... 
Qk+1 (+1) Qk; 0,1, 2, 


If A = nis a positive integer, Eq. (12.4.19) has a polynomial solution. Choosing ao in such a way that the coefficient of 
the highest power of «* is equal to (—1)", we obtain the Chebyshev—Laguerre, or simply Laguerre polynomials: 


k=0 
They can be defined in the form: 
d tet 
L,(x) = . we ) n= 1,2, 
x 
We list the first six polynomials: 
Lo(x) = 1; 3! L3(x) = —a° + 9a? — 182 + 6; 
Iy(a) = -a +1; Al La(x) = x* — 16x? + 722? — 96x + 24; 
2! Lo(x) =a? —4e+2; 5! L5(2) = —2° + 250+ — 200x7 + 600x? — 600x + 120. 


Laguerre polynomials form an orthogonal system with weight e~* on [0, co): 


oo _ _ 0, ifm x n, 
| Lm (x) Ln (a) e~* dx = { (n!)?,  ifm=n. 


Since the orthogonality relation contains multiple (n!)?, it is convenient to consider normalized polynomials 


ej eis, 
n! n! dx” 


This allows us to find coefficients in the Laguerre expansion: 


f(x) = S- Cn ln(2), 0<4< oa, (12.4.20) 


n>0 


of a smooth function f(x) on the semi-infinite interval (0, co): 
Cn = | f(x) l,(x)e~* da, n=0,1,2,.... (12.4.21) 
0 


Mathematica has a default command for the normalized Laguerre polynomials: LaguerreL[n, x]; Maple uses 
L(n,x) when the package orthopoly is invoked. Similarly, MATLAB uses laguerreL(n, x) and Mazima has 
laguerre(n, x). SymPy utilizes laguerre(n,a,x), where a = 0 corresponds to the ordinary Laguerre polyno- 
mials. 

and so on c2 & 0.123969, cz = 0.0497043, ca ~ 0.0134013, c5 ~ —0.00378451, cg » —0.011112, .... The above 
figure shows approximation (in blue) of the function f(x) = (1+ x*)~' (in black) by partial Laguerre sum with 
N =7 terms. 


1—<at 1 
1. Using generating functions gr(z,t) = = 


———- d t) = ——————_- sh that 
a a ie rere 


Talo) = 5 [(# + \/ x? i). | (x x? it) 
Un(a) |(#+ve=1)"" - (e-veF=1)""). 


1 
2a? —1 


12.4. 


Example 12.4.5: Now we expand the function f(x) = 
(1 + x”)~' on the interval (0,00) using Laguerre polyno- 
mials. 
the expansion (12.4.20): 
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First, we calculate a few of the first coefficients in 


co pe 
= da + 0.62145 
Co | 1 er) x ’ 


_— | (1 = 2) de = 0.278072, 
0 +a 


. Show that for nonnegative integers n and m, 


1 


Ta(2)Tm(a) = 5 [ 


Trjl@) + Tinie) - 
Show that both Chebyshev polynomials, T,,(x) and Un(x), satisfy the recurrence 


Yn41 = 22Yn —Yn-1, n=1,2,.... 


. Prove the following recurrence formulas: 


(a) 12,1(x) — Ta(x) Tn+2(e) = 1-2”, (b) (1= 2?) Ti(2) =n (Tne) — eTa(e)). 


Show that the Chebyshev equation (1 — a”) y” — xy!’ +n?y =0 can be reduced to the harmonic equation y” + n?y = 0 
using substitution x = cos 0. 


Using the exponential generating function g(z,t) = exp ( (2axt - -) =») H,( , prove the recurrence for Hermite 
n>0 


polynomials: Hn+i(@) = 2% Hn(x) — 2n Hy_-1(2). 


1 t 
Using the generating function for the Laguerre polynomials g(x,t) = 77 &P ‘ = -} = .y L,,(a)t", prove the 
n>0 


recurrence (n + 1) Ln4i(a) = (2n +1— 2) Ln(x) —nLy_1(2). 


8. Prove the recurrence relations (12.4.10) — (12.4.11). 


9. Using the generating function 


10. 
11. 
12. 
13. 


14. 


15. 


Gt Pr ( 
gle, t) = 7S ee 
prove the recurrence (12.4.12). 
Prove the formula (x? — 1) P(x) = na Pn(x) — n Py-1(a). 
Prove the formula (a? — 1) Pi(a) = —(n + 1)a Pp(x) + (n +1) Proi(a). 
Prove the recurrence for Legendre’s polynomials: x P/(x) — P,_1(x) — n Pn(x) = 0. 
Use the recurrence relation for the Legendre polynomials to prove that for all integers n > 0: 
(a) Pa(-1) = (-1)"; 
0, n is odd, 
(>) PaO) = { CLyywra 298000 


Tien? mis even. 


Show that ‘ 


0, ifn>0, 
Pa(z)ae = 2, ifn=0. 


Let w = (x? — 1)", and let w denote the nth derivative of w. 


-1 


(a) Use integration by parts to prove that 


1 1 
/ w™w™ da = (ent [ (1 — a)" da. 


=f -1 
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16. 
Ti: 
18. 


19. 
20. 
21. 


22: 


23. 


24. 


25. 


26. 
27. 


28. 
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(b) Prove that 


ll ae 7 (n 1)? g2n+1 
[o-*) oder ICES 


Using the previous result, prove Theorem 12.3. 
Prove Theorem 12.4. 


Show that Qs(z) = 5 Pala) in eee 


e-1 2" 13° 

Prove Corollary 12.1 (page 651). 

Prove Theorem 12.5 (page 651). 

Obtain the Legendre polynomial from Laplace’s integral formula 


Py(x) = -/° (z+ v2—1 cos)" 8. 
0 


Tv 


Find first three coefficients in the expansion (12.4.13) of the function 


vz, O<a“Ee 
r)= 
f(@) ‘ oP 


? 


di 
<0. 


Prove the Maclaurin expansion for the Legendre polynomial: 


1 afn : ". oafn nok+t 
rata) = 9° (7) (@— 1)" "@+ 1" = 9" Soe ai 2 ) 
k=0 k=0 


Find the first three coefficients in the expansion of the function 


cos, O0<6<7/2, 
f(0) = 
0, T/2<K0<0 
in a series of the form 
= >> an Pp (cos 8) , O0<@<rm. 


n>0 


Obtain the Legendre functions of the second kind Qo(x) and Qi(a) by means of 


nla) = Fol) | aay 


Expand arccos x into the series with respect to Chebyshev polynomials of the first kind. 


In each exercise, solve the given initial boundary value problem. 


(a) ws = User +1 (0< a < 2), (b) ut =Uca + 7/4 (0 < & < 7/2), 
u(0,t)=1, u(2,t) =3, uz (0,t)=0, u(m/2,t) = 1, 
u(#,0) =a? —2 +1. u(x, 0) = cos(3 ae — cos 22. 

(c) Ut = Use — 2 (0< a < 1/2), (d) w=Uueete' (0<a<7), 
u(0,t)=0, us2(1/2,t) = 1, uz(0,t)=1, uae(a,t) = -1, 
u(x, 0) = sin(7a). u(x,0) = sina. 


Consider a vertically hanging string of length @ subject to the horizontal force with harmonically density distribution 
F(ax,t) = A sinwt per unit length. Let u(z,t) be the horizontal displacement of the spring from the vertical equilibrium 
position at the point x and time t. Then u(z,t) is a solution of the following partial differential equation 


Oru 20 Ou 
nee (e an ot) 4+ Plo t)/p, u(0,t)<oo, u(Z,t) =0, 


where the density of the string is assumed to be p = 1. Find u(az,t) assuming that u(x,0) = u(a2,0) = 0. 
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12.5 Nonhomogeneous Boundary Value Problems 


In this section, we introduce nonhomogeneous boundary value problems. This topic is very important in applications 
and covered thoroughly in many books. Previously in §11.2, we showed how to solve nonhomogeneous equations 
using eigenvalue expansion (see also Problems 1 and 2 on page 665). We start with the following statement, known 
as the Fredholm!” alternative theorem. 


Theorem 12.6: For a given value of jz, either the nonhomogeneous problem 
L{x, Djy = up(x) y(x) + f(x),  Boly]=90, Bey] = 9, (12.5.1) 


where L[x,D] = —Dp(«)D + q(x), D = d/dz, and the boundary operators are Bo[y] = aoy(0) — ary'(0), Bey] = 
Boy() + B1y'(2), has a unique solution for each smooth function f(x) from the domain of La, D] (if u is not an 
eigenvalue of the corresponding homogeneous Sturm-—Liouville boundary value problem), or else the homogeneous 
problem has a nontrivial solution. If 4 = Am is the eigenvalue, the nonhomogeneous boundary value problem 
(12.5.1) has no solutions unless f is orthogonal to ¢m (x). 


We assume that the solution y = ¢(a) of the nonhomogeneous problem (12.5.1) admits eigenfunction 
expansion: 


= S- Andn(2), (12.5.2) 

n2>1 
with respect to known set {@n(#)}n>1 of all normalized eigenfunctions (so lon ||? = fe el x) dz = 1) corre- 
sponding to distinct eigenvalues Ay < Az <--: < An <--: for the homogeneous Reais wee value 


problem: L[x,D]y = Ap(x) y(x), Boly] = Bely] = 0. Substituting the series (12.5.2) into the differential expression 
L\x,D]¢ = L[d|(x) and using the equation L[¢,](x) = Anp(x) n(x), we obtain 


=1|>° andn] (2) = an Llbn|(x) = p(x Se AnAn On(x 
n2>1 no>1 


where the interchange of summation and differentiation is assumed to be justified. We substitute this series into the 
differential equation (12.5.1): 


S- AnAnp(x) On = = pola (x) So An n(x (x). 


nol no1 


Expanding f(x)/p(a) into the series with respect to eigenfunctions, we get 


p(x) = S- nn ), 


n=1 
where 


ti e 
C= | oe p(x) bn (x) da = | f(x) bn(a) da, HAD ees (12.5.3) 


After substituting the series for ¢(a), L[@](x), and f(x), we find that 


S- AnAnP(L) On = pplz (x) So An dn (x (x) > CnOn(z). 


n>1 n>1 no>1 


Upon collecting terms and canceling the common nonzero factor p(), we obtain 


[An — 1) Qn — Cn] On(x) = 0. (12.5.4) 


Ms 


n=1 


122The Swedish mathematician Erik Ivar Fredholm (1866-1927) is best remembered for his work on integral equations and spectral 
theory. 
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If Eq. (12.5.4) is to hold for each x in the interval [0, ¢], then the coefficient of ¢,,(x) must be zero for each n: 


(An —H)@n=Cn =F n= 5 EAGER, gel OAs: (12.5.5) 
n— HE 


Therefore, if wy A An for n = 1, 2,3,..., the solution becomes 


y=9(z) = 


dn(2), (12.5.6) 


where the coefficients c,, are determined from Eq. (12.5.3). While we did not prove that the series (12.5.6) converges 
uniformly to a smooth function that possesses two continuous derivatives, it can be done with even less stringent 
conditions on the forcing term f(x). Thus we obtain a formal solution, and it is reasonable to expect that the series 
(12.5.6) pointwise converges. 

Now suppose that yz is equal to one of the eigenvalues of the corresponding homogeneous problem, say, 4 = Am. 
In this case, the coefficient of ¢,, in the expansion (12.5.4) becomes —Cm¢m(x), which forces us to assume that 
Cm = 0. Again, we must consider two cases. 

In the event that 4 = A» and c,, 4 0, there is no value of a, that satisfies Eq. (12.5.5), and therefore the 
nonhomogeneous problem Eq. (12.5.1) has no solution. 


When pb = Am and Cm = i. f(x) dm(x) dx = 0, Eq. (12.5.5) is satisfied regardless of the value of am; in other 
words, Gm, remains arbitrary. In this case, the nonhomogeneous two-point boundary value problem (12.5.1) has 
infinitely many solutions. 


Example 12.5.1: Solve the boundary value problem 
y" +4y=—-a*, (0) -y'(0) = 0, (1) =0. 


Solution. We seek the solution as a series (12.5.2) with respect to the set of eigenfunctions {¢,(x)} of the corre- 
sponding homogeneous Sturm—Liouville problem: 


y+r4y=0, (0) — y"(0) = 0, -y(1) = 0. 
Since the general solution of the equation y” + Ay = 0 is y = c1 cos (Vix) +c sin (Vie), with arbitrary constants 
1,2, the given boundary conditions yield 
cy — V X02 =0, C1 COs (vr) + cg sin (vr) = 0. 

In order for the Sturm—Liouville boundary value problem, y” + Ay = 0 and y(0) — y’(0) = 0, y(1) = 0, to havea 
nontrivial solution, the parameter 4. = VX must be a solution of the following transcendent equation: 

fs cos jt + sin pw = 0. (12.5.7) 
This equation has infinite many roots, “n,n = 1,2,.... The first few eigenvalues can be found numerically, 

Ar = (11)? © (2.02876)? = 4.115858365694522, Az x 24.1393, Az ~ 63.66. 

For large n, their values are approximately 


Qn — 1)?n? 
dn CRO torn =4,5,6,.... 
4 
Assuming that the solution is given by Eq. (12.5.2) 
y= se andn(2), on(x) = V/An Cos (Vnz) ++ sin ( Xn) : 


n2>1 


we find its coefficients from Eq. (12.5.3): 


ie" 
"On = 4 
where the c, are expansion coefficients of the forcing term f(x) = x: 
1 a 2 bn(x)d al MP? sina/Xy, — 2 (2+, co8/A, 
Cn =H TT x ny (0 ed ee OI 
lon ||? Jo An 2s/Xn (2+ An) — 20/An Co82V/ An + (An — 1) sin 2VA, 
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Example 12.5.2: (Forced vibrations ) Consider a mass m attached to a coil spring of length ¢), the upper 
end of which is securely fastened (see Fig. 12.5). The mass causes an elongation ¢ of the spring in the downward 
(positive) direction. At that point the mass remains at rest. There are two forces acting at the point where the 
mass is attached: the gravitational force, or weight of the mass acting downward, and the spring force that acts 
upward. The gravitational force has magnitude mg, where g is the acceleration due to gravity. The spring force is 
proportional to the elongation @: F, = —K, which is known as Hooke’s law. The constant of proportionality « is 
called the spring constant. Since the mass is in equilibrium, we have 


mg — Ke =0. 


Figure 12.5: A spring-mass system. 


We will now assume that the spring-mass system is also subject to an external periodic force A sinwt. In this 
case, the mass will undergo forced vibrations. Let y(t) measures positive downward displacement of the mass from 
its equilibrium position at time t. To describe the future movement of the mass, we assume that it moves along a 
vertical line through the center of gravity, and its direction is always from the mass toward the point of equilibrium. 

Newton’s second law of motion states that the force F acting on this particle moving with varying velocity v is 
equal to the time rate of change of the momentum mv: 


d(mv) 


F= 
dt 


=m, where v=y 


is the velocity. Equating the two forces and applying Hooke’s law and the external force, we get 


d? . ; 
mp = —Ky + FO) => my+tKy=Asinut. 
For simplicity, we ignore friction and air resistance. We again suppose that the mass is at its equilibrium position 
initially, and when ¢t = T seconds, and seek a solution to the nonhomogeneous equation subject to the Dirichlet 
boundary conditions: 

my+Ky=A sinut, y(0) = y(T) = 0. (12.5.8) 


Following the paradigm we have set up for solving nonhomogeneous boundary value problems, we consider the 
accompanied Sturm—Liouville problem: 


md" +Ky=-dd, (0) = o(T) =0. 


Since the differential operator mD? + « in the left-hand side is negative, we assign the right-hand side to —\ ¢. The 
eigenvalues are obtained by setting \/(« + A)/m equal to a multiple of 7/T; hence 


2 
An =m () —kK, M2 1,2,332 253 
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The corresponding eigenfunctions have the form 


ee ee 
T 


Expanding the forcing term into the Fourier series, we get 


nat 
A sinwt = eer 
sin w Sof T 


n>1 
where the Fourier coefficients are obtained according to the Euler—Fourier formulas (10.5.2), page 582: 
2nn(—1)”" sin(wT) 


aA t ——— 
f= =| sinwt sin ——- dt = A x wT? — nen” 
T Jo L 1, if Tw = kn for some k. 


if Tw £ nz, 


We seek a solution of the given boundary value problem in the form of infinite series: 


This series satisfies the homogeneous boundary conditions because every eigenfunction ¢,,(t) does. Now we substitute 
these series into the differential equation (m D? + K) y = f to obtain 


2 t ie t 
—m > en () sin +k cn sin = 7 fa sin . 


n>1 n>1 no>1 


Uniting these three series into one, we have 


2 t 
S- [enn (+) + Kn — fa sin = = 0. 


n2>1 
Due to uniqueness of Fourier series, we equate every coefficient to zero and get 


crt a _ AT?2nn(—1)" sin(wT) 


nt \ 2 
—enm (=) + en — fn =0 = Cn = 5 


T T2k — mn? 72 T2k —mn 


? 


12 


if Tw # nz, for alln = 1,2,3,.... When Tw = kz for some k, we get the particular solution 


provided that T?« 4 k?x?m. If T?« = k?x?m, the boundary value problem has no solution. 


Example 12.5.3: Consider a physical problem that serves as a crude model of a centrifuge. Suppose that we have 
a horizontally mounted tube of length @ that is rotated about a fixed pivot with a constant angular acceleration. A 
particle having mass m is initially injected with velocity vp at the pivot point of the tube. It can slide freely in a 
rotated tube. The particle migrates radially outward and its motion within the frictionless tube can be described in 
terms of polar coordinates r (distance from the pivot) and @ (angle). 

Since the particle location is r = re,, its velocity vector becomes 


d d ; : 
a = ay (rer) =re,+reg. 


Using equations (12.5.9), we find the acceleration vector to be 


r= (#- 16?) er + (rd + 276) eg. 
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In polar coordinates (r, 6), the rectangular projections are 
x =r cos@é and y = r sin, where both r and @ vary as 
functions of time t. As shown in the figure at left, the unit 
vectors in radial and angular directions are 


e, = eee): (sin 0)j, (12.5.9) 

eg = —(sin 6)i + (cos @)j. 
In contrast to the unit vectors i and j, along the abscissa 
and ordinate, respectively, the unit vectors e,(t) and eg (t) 
vary with time. They remain mutually perpendicular and 
of unit length, but they change direction. Their differen- 
tiation yields 


a 
dt 


e, = —0(sin 0)i + 6(cos @)j = beg, <2 = —de,. 


Applying Newton’s second law of motion in the radial direction, we obtain the following differential equation 


*(e) —r(e) (8) = 00, 


where f(t) is an external force acting on the particle (we assume for simplicity that the particle has unit mass). If 
the tube begins to rotate from rest with a constant angular acceleration, then 6 = at, for some constant a. In this 
case the radial equation of motion becomes 


#(t) — r(t) (at)” =f). 


Suppose we are interested in finding the initial velocity vp = 7(0) of the particle that lead the particle to exit the 
tube at some prescribed later time t = T. This gives us the boundary conditions r(0) = 0, r(T) = @ and finally the 
boundary value problem: 

r(t) — r(t) (at)? = f(t), r(0)=0, r(T)=2. (12.5.10) 
To reduce the problem (12.5.10) to one with homogeneous boundary conditions, we choose a function v(t) that 
satisfies the given boundary conditions, v(0) = 0 and v(T’) = é. While there exist infinite many such functions, a 
particular choice does not effect the final answer because the boundary value problem (12.5.10) has a unique solution. 
So we set v(t) = t@/T. Then, subtracting v(t) from r(t), we get the difference y(t) = r(t) — v(t) that is a solution of 
a similar problem with homogeneous boundary conditions 


yj — y(t) at? = g(t), y(0)=0, y»(T) =0, (12.5.11) 


where g(t) = f(t) + a7t?/T is a known function. The two-point boundary value problem (12.5.11) has the solution 
presented in quadrature form 


T 
y(t) = | G(t, s) 9(s) ds, 


where G(t, s) is the corresponding Green’s function. The homogeneous equation #(t) — r(t) (at)* = 0 has two linearly 
independent solutions 


ri(t) = Vila (=) ,  ro(t) = Vt Kiya (=) (12.5.12) 


where I; /4(z) and Ky,4(z) are modified Bessel functions of order 1/4 (see §4.9.3, page 250). To use the explicit formula 
(12.1.8) on page 632, we need to determine two linearly independent solutions ¢(t) and w(t) of the homogeneous 
equation 7 — y(t) a7t? = 0 that satisfy the boundary conditions 


~(0) =0 and W(T) =0, 


respectively. Since ¢(t) = ri(t) = Vt Lis (F) is already known, we construct w(t) as a linear combination of 
functions (12.5.12): 


w(t) = re(t) —eri(t) = Vt Kiya (=) —evthys (=) 
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From the boundary condition ~)(T) = 0, it follows that 


cantryinty=Kys(%) sry (22), nye () 40 


Since their Wronskian is W[¢(t), w(#)] = —2, we can use formula (12.1.8), page 632, to construct the Green’s function 
explicitly. 


Example 12.5.4: Consider a heat conduction problem for a straight bar of uniform cross section and homogeneous 
material. Let the z-axis be chosen to lie along the axis of the bar and let x = 0 and x = @ denote the end points of 
the bar. Suppose further that the temperature u(x,t) at the end points « = 0 and a = ¢ is maintained fixed. Then 
the temperature distribution within the bar is a solution of the following initial boundary value problem: 

du O*u 


Bt © BaP? u(a@=0)=To, wla=l)=Ty, u(t =0)= f(z). 


Applying the Laplace transform, we get the boundary value problem for the ordinary differential equation 
d2 
( - os) uX=f(x), O<a2<£, uw(0)=T/rA, uh’ (O=Te/d, 
x 


where u/(zx,) is the Laplace transform of the unknown function u(z,t). Its solution is 


u(x, d) = 2 cosh (= vy) ra (+ es 2 cosh (: vx) (sinn € vx)) sinh (< vy) ; 


Here sinhz = te — te* and coshz = te + de are hyperbolic functions. Using properties of hyperbolic 


functions, we rewrite the Laplace transform u”(a,\) in the more convenient form: 


—z 


u(x, A) = oo sinh (£3) ae anny, (12.5.13) 


Therefore, we need to find the inverse Laplace transform of the function: 


Sih nat (a—b)VX _ ,—(at+b)VX 
5(t, 0,8) = £7» | Sibavr | _ pa [ee ie PM, oy 
d sinh bv d (1 _ ee) 
depending on two parameters a,b. Using the sum of geometric series (1 — q)~' = 7,59 g® with g = e~26V?, we 
represent the required function ®(t, a,b) as the series: 
1 a) ew eae = e7 (b-a)va S- e 7 2kbva _ cs en (atb)va x e7 2kbVA (a <b). 
? ? r r 
k>0 k>0 
The inverse Laplace transform of the basic element in this series is represented via the error function: 
1 a a 
Te le | =1-erf (=) = Erf (=) ' 
F We 2\/t 
9 co 
where Erf(x) = —= e-™ dx =1- erf(x) is the complementary error function (also commonly denoted as ‘erfc’). 
fg 


This allows us to find its “explicit” expression: 
2kb — b 2kb 
(t,0,0) = So ert (8) ae () ay, 
k>0 avi k>0 avi 


Then the solution of the given initial boundary value problem becomes 


ue.t) = 76 (1,2,2) -T@ (124). 
Cc Cc Cc Cc 
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1. Solve the Dirichlet problem for the unit disc {(a,y) : 27 +y? <1} 


0? u(r, 0) 2 1 Ou(r,@) | 1 0? u(r, 0) 


Or2 r Or r2 O02 =; u(1, 6) = F(8); 


where the boundary function f(0) is defined by 
(a) f(9)=0+ 4]; (c) f(0) = 0; 
(b) f(@) = cos(0/2); (d) f(0) =sin? 0. 
2. Show that the Dirichlet problem for the disc {(a,y) : x? + y? < a?} of radius a: 
V*u(r, 8) = 0, u(a,0) = f(6), 


where f(0) is a given function, has the solution 


ne i’ aa f(9) ad 
1 On J. a? — 2ar cos(6 — 9) + r? : 


The above formula is known as the Poisson integral. 


3. Solve each of the given problems by means of an eigenfunction expansion. 


(a) y”+y=27, y(0) =y(2) =0; (c) y”+3y=a(8—2), y'(0) =y'(r) =0; 
(b) y” + 2y =—2*, y(0) =y'(r) =0; (d) y”+4y=2, y(0)=y'(1)+y(1) =0. 


4. In each exercise, determine a formal eigenfunction series expansion for the solution of the given problem. State the 
value of u for which the solution exists. Assume that the problem has a unique solution for given function f(x). 


(a) y’+yy=—f, y'(0) =y(F) =0; (c) y’+py= —f, y'(0) =y'(z) =0; 
(b) y” + ny =—f, y(0) =y' (4) =0; (d) y’ +hy=—f, yO) =y') + y(1) =0. 


5. In each exercise, determine whether there is any value of the constant c for which the problem has a solution. Find the 
solution for each such case. 


(a) y+ my=c—ax+27, y(0)=y(1) =0; (c) y’ +my=ct+z, y(0) =y' (5) =0; 
(b) y” +40?y=c—a+2", y(0) =y(1) =0; (d) y"+4y=c, y'(0) =y'(m) =0. 


6. The steady temperature distribution u(r,@) in spherical coordinates not depending on azimuthal angle satisfies the 


equation V?u = 0, where 
2 1020 1 Of. O 
=sS> — ——— G—}. 
¥ r? Or (" Or = 72 sind 00 \°" 80 


Show that its solution, which is regular at 6 = 0,7, is 


u(r, 0) = > (Anr” + Bnr—"~*) Pn(cos6), 


nZ>0 
for some constants A, and B,. 


7. Show that the azimuthal symmetric solution of the inner Dirichlet problem for sphere of radius r = a 
V'u=0 (r<a),  u(a,0) = f(6), 


where f(0) is a given function, is 


u(r,0) = S> (n+ 5) (“)" P,, (cos 6) [ f(9) Pa(cos¥) sind dd. 


a 
n2>0 
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Summary for Chapter 12 


8. Consider the steady state temperature distribution 
u(z,y) in a rectangular slab O < a2 < 2,0 < y <1. 
The left and bottom edges are in direct contact a heat 
sink maintained at zero degrees, but the right edge is 
partially shielded from the sink through leaky insula- 
tion, giving rise to the boundary condition of the third 
kind: ou + «u = 0, for some given positive constant k. 
The temperature of the upper edge is maintained at the 
generic prescribed temperature u(x, 1) = f(a). Solve the 


corresponding boundary value problem for the Laplace 
equation V?u = 0 inside the slab. 


Summary for Chapter 12 


1. Consider the linear two-point boundary value problem for self-adjoint (positive if g(a) > 0) differential operator L[{x,D] = 


—Dp(x)D + q(x), depending on the derivative operator D = d/dz: 
d d 
pet [o() | +9(x)y= f(x), — aoy(0) —aay’(0) =0, Boyle) + Biy’(2) = 0, (12.1.4) 


on the interval 0 < x < £. The functions p(x) > 0, p’(x), q(x), and f(x) are assumed to be continuous on the closed 
interval [0, 2], and 
lao] + lax] >0, [Bo] + |B:| > 0. 
It is convenient to introduce two boundary operators 
Boly] = aoy(0) —ary"(0) and _—Be[y] = Boy(£) + Bry" (2). 
Then the boundary value problem can be written in compact form: 


L[z,Dy=f, Boly)=0, Bey] =0. (12.1.5) 


. Let é(a) and w(x) be two linearly independent solutions to the homogeneous equation L[x,D]y = 0. Ifthe boundary value 
problem L[x,D]y = 0, Bo[y] = 0, Be[y] = 0 has only trivial solution, then the nonhomogeneous equation L[x,D]y = f 
has a solution expressed in quadrature form 


y(@) = : G(a,t) f(t) dt, (12.1.3) 
where the kernel G(z, t), 
-o(tl@) gg epey 
xt) ad POW dt)? 
G(x, t) “ —$(a) W(t) eres, (12.1.8) 
( ~S oS) 


p(t) W[¢, b](t)’ 
is called the Green’s function. The denominator p(t) W[¢,~](t) in Eq. (12.1.8) is a constant for the self-adjoint 
differential operator L[x,D] = —Dp(x)D + q(z). 
. The boundary value problem (12.1.4) can be reformulated in vector form: 
y(t) =P(x)y(z)+£(x),  By(0) + B4y(2) = a, (12.2.1) 
where P(x) is an (n x n) continuous matrix function, B" and B! are given constant (n x n) matrices, and f(a) and 
q@ are n-column vectors. 


. The Sturm—Liouville boundary value problem 
L|x,Djy=Ap(z)y a<a<b, aoy(a)—Boy'(a)=0, — ary(b) + Bry’(b) = 0, 


is said to be regular if the functions p(x), p’(x), q(x), 1/p(x), and p(x) are continuous, and p(x) > 0, p(x) > 0 on the 
closed interval [a, 0]. 


. When one or both of the boundary points in a Sturm—Liouville problem goes to oo, or when the coefficient 1/p(a) in 
the differential operator (12.3.1) diverges at an endpoint, the problem becomes singular. 


. The Fredholm alternative theorem: For a given value of yu, either the nonhomogeneous problem 
Lx, Dly = up(x)y(a)+ f(x),  — Boly]=0, Bey] = 0, (12.5.1) 


where L[x,D] = —Dp(x)D + q(x) and the boundary operators are Bo[y] = aoy(0) — ary’(0), Bely] = Boy(l) + Biy’(2), 
has a unique solution for each continuous function f(x) (if w is not an eigenvalue of the corresponding homogeneous 
Sturm-—Liouville boundary value problem), or else the homogeneous problem has a nontrivial solution. 
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Review Questions for Chapter 12 


Section 12.1 


1. Derive the Green’s function for the given two-point boundary value problem. 


(a) -y" = f(x), y(0)—y'(0) = 0, y() +y'(1) = 0. 
(b) -y"—y= f(z), y'(0)=0, y(m) + y'(m) =0. 
(c) yd f(z), y(0) =0, yA) =0. 

(d) -y"—-y=f(z), y(0)=y'O), y(m) =0. 

(e) (x 7y'(a))' + 22~*y(a) = 


2. Find the Green’s function for the given differential operator L[x,D], where D = d/dz. 


(a) De~**D + 2e-**; (c) DaD—4/z; 
(b) Da®D+1/16; (d) D(a+1)?D 


3. Find the Green’s function for the nonself-adjoint two-point boundary value problem, given two linearly independent 
solutions for the corresponding homogeneous equation. 


(a) (7-1)? y"” —2(@—1)y'+2y= f(x), y/(-1) =0, y(0) =O, given y: = x —1 and yo = 2? —- 1. 
(b) (xwsinz+cosz)y” —a2 cosry’+ycosx= f(x), y(0)=0, y(r) = 0, given y1 = 2, y2 = cosa. 
4. Show that the eigenvalues of the Sturm—Liouville two-point boundary value problem (see Eq. (12.1.4) on page 631), 


L{x,D]y = Ap(x) y(z), Boly] =0, Bely] = 0, 
are positive, provided that p(x) > 0, g(a) > 0 and the coefficients in the boundary operators Bo and By are not negative. 
5. In each exercise, the Green’s function is given for the boundary value problem (12.1.4): 
— (p(x) 9") +a(z)y = F(z), — aoy(0) — ary’(0) = 0,  Boy(1) + Bry’(1) = 0. 


Determine the functions p(x), g(x), and possible values of the constants ao, a1, Bo, (1. 


- aie ; _ 2 
(a) G(a,t)=2x (c—4)(2+1t), O<tK<a, i). Cage =x a cos(l—2), O< 
(¢-4)(2+2z), r<t<l,; cs sinx cos(l—t), «we 


h 2x — (tanh 2) sinh 2 h2t+sinh2t), O0<t<z, 
@4ej=10e)s (cosh 2a ven ) sinh 22) (cos os ), i 
(cosh 2x + sinh 22)(cosh 2t — tanh 2 sinh 2), a<t<l; 
cos(l—2)sint, O<t<a, 
(d) G(z,t) = coal * ; 
sinz cos(l—t), w<t<l. 


6. Find the Green’s function for the nonself-adjoint two-point boundary value problem, given two linearly independent 
solutions for the corresponding homogeneous equation. 


(a) (2? +4) y"”—2cy'+2y= f(x), y/(0) =0, y(2) =0, given yi = 2 and y2 = 27 —4. 

(b) (xcosa — sing —cosx)y” +(x—1)sinzy’—ysing = f(x), y(0)=0, y(1) =0, given y) =a—-1, yo = sing. 
(c) (2? +4) y"+ (4-22) y'+2y= f(x), y'(0) =0, y(2) =0, given yi: = 2”, yi = 2-2. 

(d) (cos? x) y” +2 sin2ry’+(1+sin?z) y= f(z), y(0)=0, y'(7) =0, given y1 = x cosz, y2 = cosa. 


7. Show that the solution y(#) = sina + c cosa + 2a — 1 — 7 of the boundary value problem 


y" +y =2a—-1-7, y' (0) = 3, y(m/2) = 0, 


cannot be obtained as the sum y = yp;,(x)+Y p(x), where yp, is the solution of the homogeneous equation y” +y = 0 subject 
to the given boundary conditions and yp is the solution of the given differential equation subject to the homogeneous 
boundary conditions, y’(0) = y(m/2) = 0. 

Section 12.2 of Chapter 12 (Review) 


1. Express the given boundary value problem as an equivalent boundary value problem for a first order system (12.2.1). 


(a) (x?y')'-2y=—-f(x) 1<a2<3, y(1)—y/(1) =0, y(3)+3y'(3) =0. 
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(b) 2°(2+2)y"+2xry'-2y=—-f(x) l<x<2, y(1)—y'(1)=0, y(2) =0. 
(c) (e** y’)’ —10e* y= —f(z) O<a<1, 5y(0)—-y’(0)=0, y’(1) =0. 
(d) y’+ay'=—-f(x) O<2#<1, y'(0)=0, y(1)=0. 
2. In each exercise from the previous problem, determine the Green’s matrix for the corresponding first order system. 


3. In each exercise, you are given boundary conditions for the two-point boundary value problem (12.2.1), where 


2 -5 yi(x) ay 
P= c)= a= : 
k “3 oe fee a2 
Note that the fundamental matrix for the corresponding homogeneous equation is given to be e®*. Form the matrix 


B from Eq. (12.2.3) and determine whether the boundary value problem has a unique solution for every f(a) and 
a = (a1, Q2). 


(a) y2(0) = a1, yo(m) = a2; (c) y1(0) — y2(0) = 0, yim) + ya) = a2; 
(b) y(0) = a1, y2(m) = —a1; (d) y1(0) + y2(0) =0, yi) + ya() = a2. 


Section 12.4 of Chapter 12 (Review) 
1. Obtain the first two Legendre coefficients for f(x) = e*”: 


1 ft 1 sinha 
= az ax J =yea? a —a\y 
ao sie x 5a (e e ) a. 
= = a4 onde =3 (Se Se) 
2 Jy a a 


2. Suppose that on an isolated sphere of radius a the electrostatic potential varies as V(a,0) = Voe® sod (with some 
constants Vo and a). Assuming that the electrostatic potential in charge-free space satisfies the Laplace equation with 
axial symmetry (no ¢ dependence), derive its series representation 


V(r,0) = Ss — P,(cos 8) 


n>0 


and find the first two values of the coefficients bo and 6; explicitly. 
Section 12.5 of Chapter 12 (Review) 


1. Let G(a,t) be the Green’s function for the regular Sturm-—Liouville boundary value problem 
Lyl]=ey+f, Boly]=0, Bely] = 0, 


where L{x,D] = —Dp(x)D + q(x), depending on the derivative operator D = d/dz, and Bo[y] = aoy(0) — aiy’(0), 
Bely] = Boy(£) + Biy'(é). Using the method of eigenfunction expansions, derive the eigenfunction formula for G(z, t): 


where {¢n}n>1 is an orthonormal system of eigenfunctions with corresponding eigenvalues {An}n>1 for 
Lly}=Ay,  Boly] =0, Bely] = 0. 


Assume that py is not an eigenvalue. 


2. In each exercise, find a formal eigenfunction expansion for the solution to the given nonhomogeneous boundary value 
problem, if it exists. 


(a) y” +3y =4 sin 2x — 23 sin7z, y(0)=0, y(m)=0. 
(b) y” +9y =sin3z — 16 sin5z, y(0)=0, y(r)=0. 
(c) y” +7’y =8 sin3ra, y(0)=0, y’(1/2) =0. 

(d) y”+3y=1-42?, y’'(0)=0, y(1/2) =0. 


3. Find a formal solution to the vibrating string problem governed by the given initial boundary value problem: 


ti =Uce +t (0<a<n), u(0,t)=u(r,t)=0, u(ev,0) =sinz, u(#,0) = 5 sin 2x — 3 sin 5z. 
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Poincare—Bendixson theorem, 533, 539 

Point Quadrature, 5, 40, 95, 147 
asymptotically stable, 484 Quasi-homogeneous function, 67, 126 


critical, 439, 486 


asp R, 291 
oe aad Radial acceleration, 346 
fixed, 486 
i Radian, 346 
singular, 40, 406 ie eae ‘ 
stable, 484 adiocarbon dating, 


Ralson method, 180 

Rayleigh equation, 534 

Rayleigh, Lord, 524, 534 
RC-series circuit, 3 

Real part, 576 

Rectangular window function, 291 
Recurrence, 138, 147, 648, 651 


stationary, 439, 484, 486 
unstable, 484 

Poisson equation, 627 

Poisson integral, 665 

Polar coordinates, 448 

Polar form of complex number, 212 
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olking, ohn, Fibonacci, 139 
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homogeneous, 139, 140 
inhomogeneous, 139 
nonhomogeneous, 139 
order, 138 
relatively stable, 143 
unstable, 143 
Reference solution, 167 
Region of asymptotic stability, 485 
Regular Sturm—Liouville problem, 558 


annulled, 400, 424 
Chebyshev, 647, 648 
Chebyshev—Laguerre, 656 
Hermite, 647, 654 
Laguerre, 647, 655, 656 
Legendre, 647, 650 
minimal, 400, 424 
orthogonal, 647 


Positive definite function, 523 Repeller, 41, 444, 447, 451, 476, 485 
Positive matrix, 392 Residue, 308, 333, 406 
Positive operator, 545 Resistance, 341 
Positive semidefinite function, 523 Resolvent, 383, 400, 405, 423, 424 
Potential function, 7, 73, 126 Resonance, aa 
Precision, 166 Response, 86 
Predator-prey model, 505 Rest point, 486 
Predictor-corrector method, 153 Restoring force, 344 
Principal branch, 213 Riccati equation, 99, 109, 112, 194, 196, 197, 221, 253, 
Principle of reflection, 611 258, 264, 268 
Principle of superposition, 191, 224 canonical form, 101 
Problem special, 102 
boundary value, 255 Riccati, Jacopo, 99 
Cauchy, 8, 188, 255, 362 Riccati, Vincenzo, 94 
Fermi—Pasta—Ulam, 475 Richardson, Lewis, 3 
Henon—Heiles, 520 Robin, Gustave, 609 
initial value, 8, 188, 255, 362 Rodrigues’s formula, 650 
perturbed, 166 Rodrigues, Olinde, 650 
pursuit, 537 Root mean square, 554 
Sturm-—Liouville, 545, 546, 561, 562 Rosen—Morse equation, 544 
well-posed, 166 Rossler system, 538 
Propagated truncation error, 168 Rossler, Otto, 538 
Propagator matrix, 436, 442 Rotational kinetic energy, 346 
Proper node, 449, 452 Round-off error, 172, 182 
Prufer substitution, 591 Rounding error, 172 
Prufer variables, 592 Runge, Carle, 174 


Prufer, Ernst, 591 Runge-Kutta formula 
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Butcher, 181 stationary, 39 

classical, 178 steady state, 465 

Heun third order, 180 trivial, 432, 546 

nearly optimal, 180 Sonin, Nikolay, 639, 647 

optimal second order, 180 Sound, 559 

second order implicit, 181 Source point, 41, 451 
Runge-Kutta method, 174, 182 Special function, 18 

, Special Riccati equation, 102 

Saddle point, 444, 445, 451, 452, 477 Specific heat, 598 
Saddle-node bifurcation, 117 


Spectral representation, 269 
Sage, 16, 43, 70, 199, 215, 252, 291, 332, 333, 356, 361, Spectrum, 386, 423, 546 


383, 389, 642, 649, 655 
Scalar product, 356, 553 
Scholes, Myron, 605 
Schrodinger equation, 552, 591, 653 
Schwartz, Laurent, 298 
Second order differential equation, 188 
Self-adjoint matrix, 358 
Self-adjoint operator, 255, 553 
Self-regulating equation, 47 
Selkov, Evgenii, 543 
Separable equation, 40, 126 
Separation of variables method, 598 


Spherical pendulum, 375 
Spiral point, 447, 451, 452 
Spiral source, 447 

Spline, 163 

Spring constant, 344 

Square integrable function, 553 
Square wave, 587 

Squine, 372 

Stability test, 47 

Stable critical point, 444, 484 
Stable equilibrium solution, 47, 444 


Separatrix, 115, 445, 485, 515 peers ee 
. Standing wave, 614 
Shift rule, 282 
Star, 449 


Signum function, 291 
Similar matrices, 391 
Similarity rule, 282 
Simple eigenvalue, 387 
Simple root, 303 

Sine amplitude, 373 


State of the system, 439 

Stationary point, 439, 484, 486 
Stationary solution, 39 

Steady state solution, 465, 486, 607, 617 
Stefan’s law, 52, 134, 135 


Sine integral function, 18, 579 Steklov, Vladimir, 557 


Sine-Fourier transform, 646 Stiffness, 181 
Single-step method, 146 Streamline, 4, 11, 371, 439, 486 


Singular matrix, 381 Strict Lyapunov function, 522 


Singular point, 39, 40, 86, 119, 189, 406 Strong Lyapunov function, 522 
Singular solution, 9, 189 Strongly irregular system, 536 


Singularity, 119 Strutt, John, 534 
Sink, 41, 451, 476 Struve function, 645 
SIRS nipcel 525, 528 Sturm theorem, 207 
Slope field, 11, 486 Sturm, Jacques, 546 
Smith, Henry John, 392 Sturm-—Liouville problem, 545, 546, 561, 562, 639 
Sobolev, Sergei, 298 singular, 639 
Sokhotskii, Julian, 647 Subrcritical pitchfork, 118 
Solow model, 69 Supercritical pitchfork, 118 
Solow, Robert, 69 Superposition principle, 191, 224, 432 
Solution, 4, 371 Swan, Trevor, 69 
equilibrium, 39, 47 Sylvester auxiliary matrix, 401, 424 
exceptional, 536 Sylvester formula, 424 
explicit, 5 Sylvester, James, 357, 401 
fundamental set, 433 Symmetric matrix, 358 
general, 7, 8, 224, 434 SymPy, 17, 199, 215, 252, 291, 383, 389, 649, 655, 656 
implicit, 5 System of differential equations, 3 
nontrivial, 189, 598 almost linear, 494 
particular, 8 autonomous, 362 


singular, 9, 189 degenerate, 368 
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Tangent line method, 146, 182 
Tangential velocity, 346 
Taylor series, 174 
Taylor series method, 162 
Taylor’s algorithm, 163, 182 
Telegraph equation, 626 
Theorem 
Abel, 201, 435 
Bendixson’s negative criterion, 531, 539 
Cayley—Hamilton, 400 
Dirichlet, 570 
Dulac, 532 
existence and uniqueness, 189 
Fredholm, 659 
Grobman-—Hartman, 493, 539 
Levinson-Smith, 529 
Lozinsky, 28 
Lyapunov, 522, 523, 539 
Peano, 23 
Picard, 23 
Poincare—Bendixson, 533, 539 
Sturm, 207 
Thermal conductivity, 598 
Thermal diffusivity, 598 
Third order Kutta method, 180 
Third order method, 176 
Thomson, William, 254 
Three-term polynomial approximation, 163 
Torque, 347 
Torricelli’s equation, 52 
Trace, 360, 374, 388 
Trajectory, 11, 371, 439, 486 
Transcritical bifurcation, 117 
Transposition, 356, 358 
Transverse waves, 626 
Trapezoid rule, 151, 182 
Trefethen, Lloyd, 649 
Triangular wave function, 290 
Trigonometric polynomial, 563 
Trivial solution, 201, 432, 546, 631 
Truncation error, 167 
Two-sided Laplace transform, 279 


Unimodular matrix, 392 

Unit step function, 291 

Unstable critical point, 444, 484 
Unstable equilibrium solution, 47, 444 
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Validity interval, 27, 119, 121, 371, 484 
van der Pol equation, 4, 524, 535 

van der Pol, Balthasar, 524 
Vandermonde determinant, 209 


Variable coefficients recurrence, 139 
Variation of parameters, 239, 257, 455 
Vector differential equation, 432 
Verhulst equation, 96 

Verhulst, Pierre, 500 

Volta, Alessandro, 341 

Voltage source, 342 
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Wave equation, 610, 623 
Wave operator, 610 
Wavelength, 614 

Weber function, 248, 257 
Weber, Heinrich, 248 
Weber-—Fechner law, 185 
Well-posed problem, 166 
Wilbraham, Henry, 578 
Winplot, 11 

Wronski, Josef, 199 
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Young’s modulus, 548 


Zhukovsky, Nikolai, 174 


681 


< \ Taylor & Francis 
hi! bivadbos Group 


http://taylorandfrancis.com 


